Contact BackType for event and ticket information.

Looks like this event has already ended.

Check out upcoming events by this organizer, or organize your very own event.

View upcoming events Create an event

Cascalog Workshop

Saturday, February 19, 2011 from 10:00 AM to 4:00 PM (PT)

San Francisco, CA

Ticket Information

Ticket Type Sales End Price Fee Quantity
Early bird Ended $120.00 $0.00
Regular ticket Ended $150.00 $0.00
SHARE THIS EVENT

Event Details

The goal of this workshop is to learn how to use Cascalog to build complex data processing workflows on top of Hadoop.

Cascalog's tight integration with Clojure lends itself to lots of powerful techniques which will be covered in this workshop. I will be using real BackType code as illustration of these techniques.

We'll spend a short amount of time going through Cascalog's features and spend most of our time learning techniques to use these features to build real apps.

 

Requirements:

1. Bring your laptop.

2. You should have a basic understanding of Clojure (e.g., have gone through the Programming Clojure book)

3. You should know how to use leiningen to build Clojure applications.

4. No prior understanding of Cascalog necessary, but you'll get more value if you go through the tutorials and experiment with the playground beforehand.

 

Agenda:

1. Incremental development using emacs and leiningen 

2. Basics of Cascalog

3. The Cascalog query planner in depth: Cascalog -> Cascading -> MapReduce

4. The when, how, and why of Cascalog’s custom operation types 

 

Lunch Break

 

5. Making queries dynamically: :<<, :>>, construct, and associated techniques

6. Abstraction and composition: functions and predicate macros

7. Understanding the performance of Cascalog queries 

8. Custom taps

9. Unit testing Cascalog queries

10. Exporting data with ElephantDB

 

Notes:

1. ElephantDB will be open-sourced sometime prior to the workshop.