Miscellaneous tools for processing WARC files from the CommonCrawl
Go

README.md

Warc Tools

Some rather use-case-specific tools for pulling stuff out of the Common Crawl data on AWS.

License

This code is Licensed under the MIT License

Copyright © 2013 Kevin Bullaughey