Skip to content
scraper for new (jun15) ssci site
JavaScript
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
data
samples
.gitignore
INSTALL
README.md
getManifest.js
illegible.js
package-lock.json
package.json
screenshot.js
settings.json

README.md

illegible-us is a scraper for the hearing archive of the Senate Select Committee on Intelligence (SSCI)

the scraper collects hearing-related media (PDF documents and video) and metadata (location, time, witnesses, media-associated metadata).

illegible-us is written in node and has a number of dependencies beyond npm's scope: ffmpeg, youtube-dl, exiftool, and puppeteer (a headless chrome; fwiw i'd prefer puppeteer-firefox but setting up proxy is too annoying)

You can’t perform that action at this time.