Spider that crawls the home page of TechCrunch (http://techcrunch.com/)
Python
Latest commit 83565ea Aug 1, 2012 bahtou commit
Permalink
Failed to load latest commit information.
techCrunch commit Aug 1, 2012
LICENSE commit Aug 1, 2012
NOTES.txt commit Aug 1, 2012
README.txt commit Aug 1, 2012

README.txt

Spider that crawls the home page of TechCrunch (http://techcrunch.com/)

Scrapy framework is used to scrape information from the homepages of TechCrunch.
Data on who posted, posters link, headline, headline link and time posted are extracted.

The data is then dumped into MySQLdb.

Checkout:
    http://scrapy.org/