Category Archives: java

How to use a raw MapReduce job in Cascading

Cascading is a great abstraction over MapReduce. However, sometimes you may have code for an existing MapReduce job or want to drop directly to Hadoop for efficiency. Even if you’re using raw MapReduce jobs, Cascading can still be useful in planning the overall data pipeline. The code below is an example of how to use a [...]
Also posted in cascading, hadoop | Leave a comment