Change the repository type filter
All
Repositories list
183 repositories
spidermon
PublicScrapy Extension for monitoring spiders execution.shub-workflow
Publicdateparser
Publicpython parser for human readable dates- Parse numbers written in natural language
- Extract price amount and currency symbol from a raw text string
web-poet
PublicWeb scraping Page Objects core librarypython-crfsuite
Publicstreamparse
PublicFormasaurus
Publicsplash
Publicextruct
PublicExtract embedded metadata from HTML markuppgcontents
Publichcf-backend
Publicshublang
Publicwoodpecker
Publicvaranus
Publicscrapyrt
PublicHTTP API for Scrapy spidersportia
Publicsklearn-crfsuite
Publicwebstruct-demo
PublicHTTP demo for https://github.com/scrapinghub/webstructautologin
Publicpython-intercom
Publicluigi
Publicmrjob
Publicdocker-custodian
Publicaduana
PublicFrontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even when making big crawls (one billion pages).