- cloudera/hue 1,785 Let’s Big Data. Hue is an open source Web interface for analyzing data with Hadoop and Spark.
- cloudera/flume 851 WE HAVE MOVED to Apache Incubator. https://cwiki.apache.org/FLUME/ . Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log…
- cloudera/sqoop 157 Sqoop has moved to Apache!
- ogrisel/pignlproc 156 Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.
- yahoo/howl 73 Common metadata layer for Hadoop's Map Reduce, Pig, and Hive