No description, website, or topics provided.
Python Shell
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
citation @ 4fd4746
convert
fetch
.gitignore
.gitmodules
README.md
go.sh
requirements.txt
reset.sh

README.md

pacer-recap-citations

  • Create a python virtualenv, activate it
  • pip install -r requirements.txt
  • Create a nodeenv, activate it (yes, they can both be active)
  • Run go.sh

That will give you a data/ directory that contains mirrors of the files listed on pages like this one for docket cacd.518379. Alongside the mirrored files will be a grayscale pgm image file for each pdf page and a plain text version of each pgm image.

TODO:

  • Extract legal citations from text files
  • Extract dates from the *.docket.xml files