Skip to content

Latest commit

 

History

History
25 lines (16 loc) · 558 Bytes

README.md

File metadata and controls

25 lines (16 loc) · 558 Bytes

corpus-unpdf

Github CI

Parse Philippine Supreme Court decisions issued in PDF format as text; hopefully, this can be utilized in the LawSQL dataset.

Documentation

See documentation.

Development

Checkout code, create a new virtual environment:

poetry add corpus-unpdf # python -m pip install corpus-unpdf
poetry update # install dependencies
poetry shell

Run tests:

pytest