Stephen Merity Smerity
- San Francisco, California
- smerity@smerity.com
- http://www.smerity.com
- Joined on
Popular repositories
- cc-warc-examples 20 CommonCrawl WARC/WET/WAT examples and processing code for Java + Hadoop
- pubcrawl 18 *Deprecated* A short and sweet Python web crawler using Redis as the process queue, seen set and Memcache style rate limiter for robots.txt
- cs205_ga 15 How deep does Google Analytics go? Efficiently tackling Common Crawl using AWS & MapReduce
- Hip-Flask 6 *Deprecated*
- Snippets 4 Useful code snippets that I'd rather not lose
Public contributions
Year of contributions
243 total
Jan 2, 2014 – Jan 2, 2015
Longest streak
7 days
May 7 –
May 13
Current streak
0 days
Last contributed
Contribution activity