Deployed to https://salty-tor-71342.herokuapp.com
- url
- Does the body include my name? (Chris)
- Does the website ust bootstrap?
- What email addresses, if any, does the body contain?
- Can I fetch this url?
-
Better robots.txt parsing. Currently, it is very naive; Currently:
- It just assumes that the path is allowed unless it's explicitly disallowed.
- It does not take "User-Agent" into consideration
-
Better UI?
-
Integrate Sidekiq background job to process batch data?