Stephen Merity Smerity
- cc-warc-examples 27 CommonCrawl WARC/WET/WAT examples and processing code for Java + Hadoop
- pubcrawl 18 *Deprecated* A short and sweet Python web crawler using Redis as the process queue, seen set and Memcache style rate limiter for robots.txt
- cs205_ga 15 How deep does Google Analytics go? Efficiently tackling Common Crawl using AWS & MapReduce
- keras_qa 10 Keras solution to the bAbI tasks using recurrent neural networks - merged as an example into Keras mainline
- cc-mrjob 7 Demonstration of using Python to process the Common Crawl dataset with the mrjob framework
Repositories contributed to
- fchollet/keras 2,636 Theano-based Deep Learning library (convnets, recurrent neural networks, and more).
- kylebillings/flask-test 0
- frankmcsherry/blog 69 Some notes on things I find interesting and important.
- json4s/json4s 500 A single AST to be used by other scala json libraries
- icoming/FlashGraph 143 A SSD-based graph processing engine for billion-node graphs
Contributions in the last year 142 total Sep 4, 2014 – Sep 4, 2015
Longest streak 4 days September 8 – September 11
Current streak 0 days Last contributed