Add news scraper for Santa Clara county #64

Mr0grog · 2020-05-27T07:04:27Z

News scrapers live in the news directory. You can follow the San Francisco scraper as an example.

Santa Clara County

The COVID-19 homepage has a list of announcements near the bottom we can scrape (no equivalent RSS I can find).
The Public Health Department has a newsroom we can scrape. Can’t find any RSS or Atom feeds for it. :\
The Office of Public Affairs also has a newsroom of the same format with slightly broader coverage. As far as I can tell, though, the Public Health Department one pretty well covers all the coronavirus-related stuff.
There are some SOAP services linked from the COVID-19 page, but they seem to require authentication to access.

The text was updated successfully, but these errors were encountered:

This news page is populated at runtime via JavaScript, so we are using Selenium to load it. Fixes #64.

Unfortunately, the page we're scraping gets populated at runtime via JavaScript, so, like Alameda, I wound up using Selenium. This also fixes missing support for tags in our RSS output (the news items here have “categories,” like “press release” or “announcement,” and this code sets those as tags on each news item). Fixes #64.

Mr0grog added enhancement New feature or request news Related to scraping news (rather than data) labels May 27, 2020

Mr0grog mentioned this issue May 27, 2020

Identify news sources for each county #39

Closed

10 tasks

Mr0grog added a commit that referenced this issue May 30, 2020

Add news scraper for Santa Clara County

47f653d

This news page is populated at runtime via JavaScript, so we are using Selenium to load it. Fixes #64.

Mr0grog mentioned this issue May 30, 2020

Add news scraper for Santa Clara County #70

Merged

Mr0grog closed this as completed in #70 Jun 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add news scraper for Santa Clara county #64

Add news scraper for Santa Clara county #64

Mr0grog commented May 27, 2020

Add news scraper for Santa Clara county #64

Add news scraper for Santa Clara county #64

Comments

Mr0grog commented May 27, 2020

Santa Clara County