Forked from yc-huang/Hive-mongo
hive storage handler for connecting with MongoDB
Forked from mongodb/mongo-hadoop
MongoDB adapter for Hadoop
Forked from nathanmarz/storm
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
Forked from cwensel/cascading
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows on a Hadoop cluster. See https://github.com/Cascading/cascading for the release repository.
Forked from fxsjy/jieba
Forked from codelucas/newspaper
Simplified news extraction, article extraction and content curation in python. Built with multithreading, 10+ languages, NLP, ML, and more!