Skip to content

justmars/corpus-unpdf

Repository files navigation

corpus-unpdf

Github CI

Parse Philippine Supreme Court decisions issued in PDF format as text; hopefully, this can be utilized in the LawSQL dataset.

Documentation

See documentation.

Development

Checkout code, create a new virtual environment:

poetry add corpus-unpdf # python -m pip install corpus-unpdf
poetry update # install dependencies
poetry shell

Run tests:

pytest

About

Parse Philippine Supreme Court decisions issued in PDF format as text

Resources

Stars

Watchers

Forks