A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
-
Updated
Jul 8, 2024 - Python
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Parser for Consolidated Account Statements (CAS) generated from CAMS/Karvy/Kfintech
Python PDF parser for scientific publications: content and figures
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
Analyze PDFs. With colors. And Yara.
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
Batch-convert pdf to text, extract data from pdf in python
A python client for the Sypht API
Investigation in PDF encryption
PDF parsing toolkit for preparing academic text corpus
📜 parse your Caisse d'Épargne PDF statements to CSV!
Scanipy stands for "scan it with Python"—it's your smart Python library for scanning and parsing complex PDF files like books, reports, articles, and academic papers. Utilizing cutting-edge Deep Learning algorithms, Scanipy transforms your PDFs into a treasure trove of extractable information: tables, images, equations, and text.
PDF Parser based on VirusTotal API
Fix links in PDF files, rewrite links, extract text annotations, remove pages
A Python library for exploring PDFs with ease.
Automation for admin tasks, client intake, allocation of available markets, and dissemination of submissions to insurance carriers.
Add a description, image, and links to the pdf-parser topic page so that developers can more easily learn about it.
To associate your repository with the pdf-parser topic, visit your repo's landing page and select "manage topics."