Skip to content

aphp/edspdf-poppler

Repository files navigation

Tests Documentation PyPI Codecov DOI

edspdf-poppler

edspdf-poppler provides a Poppler-based PDF parser component for EDS-PDF. We only provide prebuilt binaries for linux and macos. If you are on windows, you will need to build this lib from source.

Beware, Poppler is GPL-licensed: edspdf-poppler is therefore also GPL-licensed, and any model depending on this component must be too.

Getting started

Install the library with pip:

$ pip install edspdf-poppler

Usage

from edspdf import Pipeline

model = Pipeline()
model.add_pipe('poppler-extractor')

model(pdf_bytes)

Visit the documentation for more information!

Acknowledgement

We would like to thank Assistance Publique – Hôpitaux de Paris and AP-HP Foundation for funding this project.