-
Notifications
You must be signed in to change notification settings - Fork 0
slifty/rdiscraper
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
This is a suite of scrapers which return article URLs which will be used to feed into the MediaCloud system. We are focusing on the following organizations: - New York Times - Chicago Tribune - Washington Post - LA Times The scrapers use a consistent API which takes in: - A start date - An end date And returns (in XML format): - Article URL for each article in that date range ==== RETURN XML STRUCTURE ==== <articles> <article> <url></url> </article> </articles>
About
Scraping news since 2011
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published