Skip to content

Latest commit

 

History

History
17 lines (12 loc) · 386 Bytes

parsers_pdf.rst

File metadata and controls

17 lines (12 loc) · 386 Bytes

Parsers - PDF parser

.. automodule:: MordinezNLP.parsers.process_pdf
   :members:

Example usage for TXT files:

from io import BytesIO
from MordinezNLP.parsers import process_pdf

with open("my_pdf_doc.pdf", "rb") as f:
       pdf = BytesIO(f.read())
   output = process_pdf(pdf)
   print(output)