This program uses
pdftotext to read a PDF file, extracts
Perma.cc links from it, then uses Perma's public
look up the URLs originally archived. It exports a CSV file with the
Perma links and URLs.
You will need
pdftotext, which is in various packages; try
brew install poppler on a Mac, or install
poppler-utils in Linux.
There are various ways of setting up a Python virtualenv. Try
python3-venv, then run
python3 -m venv env source env/bin/activate
Once you've activated the virtual environment, install required packages and the program itself like this:
pip install -r requirements.txt pip install --editable .
At this point, running