This repo contains a python module script for the pd-extract-archive command. This is a Python command to extract Proactive Disclosure records from archive tar.gz files for a specific PD type and Organization.
- Pull the repo into any directory:
git clone https://github.com/open-data/pd-archive-extractor.git
- (Optional) Create a python virtual environment:
pd-archive-extractorpython3 -m venv pd_extract_venv. pd_extract_venv/bin/activate
- Navigate into the directory, and install the module via pip:
pip install -e . && pip install -r requirements.txt
- Confirm that the package is installed:
which pd-extract-archive
pd-extract-archive --type=grants --org=tbs-sct --input=./my-archive.tar.gz --output=./grants.csv