Develop high level script(s) for managing scraping/archiving #52

zschira · 2022-09-13T15:01:31Z

The FERC datasets will need a script to manage scraping both the DBF and XBRL data. It may also be useful to create a single high level script for scraping data from all sources.

zaneselvans · 2022-09-13T17:46:46Z

We already depend indirectly on the click and typer CLI frameworks, and I think they both provide hooks for tab completion and hierarchical scripts, which might be useful in this context. I've often imagined having a hierarchical script for PUDL with unified help messages & interface like

$ pudl scrape ferc1 ferc2 ferc6 ferc60 ferc714
$ pudl archive ferc1 ferc2 ferc6 ferc60 ferc714
$ pudl datastore update-cache ferc1 ferc2 ferc6 ferc60 ferc714
$ pudl ferc2sqlite settings/ferc2sqlite.yml

zschira mentioned this issue Sep 13, 2022

Improve and Automate raw data archiving/access catalyst-cooperative/pudl#1418

Closed

12 tasks

zschira self-assigned this Sep 13, 2022

zschira transferred this issue from catalyst-cooperative/pudl Sep 13, 2022

zschira changed the title ~~Develop high level script(s) for managing scraping~~ Develop high level script(s) for managing scraping/archiving Oct 11, 2022

zschira mentioned this issue Oct 24, 2022

Develop high level script(s) for managing scraping/archiving catalyst-cooperative/pudl-archiver#5

Closed

zschira closed this as completed Nov 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Develop high level script(s) for managing scraping/archiving #52

Develop high level script(s) for managing scraping/archiving #52

zschira commented Sep 13, 2022 •

edited

zaneselvans commented Sep 13, 2022

Develop high level script(s) for managing scraping/archiving #52

Develop high level script(s) for managing scraping/archiving #52

Comments

zschira commented Sep 13, 2022 • edited

zaneselvans commented Sep 13, 2022

zschira commented Sep 13, 2022 •

edited