Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on a Hadoop cluster. Please see https://github.com/cwensel/cascading for access to all WIP branches.
A test harness for testing binary compatibility
ANSI SQL for Cascading on Apache Hadoop
Deploying apache-hadoop in a virtualized cluster as easy as 1-2-3.
HBase adapters for Cascading
Tutorials for Cascading, Lingual, Pattern and other projects
source examples to support the "Cascading for the Impatient" blog post series
Integration for Cascading and Apache Hive
cascading schemes and taps for JDBC
A simple command line interface for building high load cluster jobs.
Memecached/Membase/ElasticSearch integration for Cascading
a simple kind of social recommender
Machine Learning for Cascading
A Fluent Java API for Cascading
Cascading.Multitool is a sed and grep command line tool for Apache Hadoop.
Sample applications using Cascading
The Scalding tutorial as a standalone SBT project
standalone project for running the cascalog tutorial
A simple Hello World Cascading project to ease the start of a new Cascading application
Annotations and Classes for managing and executing dependent processes
Serializer and comparator for using Thrift objects in Cascading or Cascalog
All the Cascading taps you need and love.
This project is deprecated, please use https://github.com/Cascading/cascading-jdbc
Cascalog for the Impatient
[DEPRECATED, please use https://github.com/magro/kryo-serializers] Extra tidbits for Kryo.
Cascading plus City of Palo Alto open data
Kryo Integration for Cascading.