A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
-
Updated
Jul 28, 2024 - Python
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
A Python tool to help extracting information from structured PDFs.
PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取
Parsing resumes in a PDF format from linkedIn
A PDF parser written in Python 3 with no external dependencies.
Written in python, for checking reference lists in systematic reviews and literature reviews, helps with reference list searching both backward&forward by extracting references and creating search queries, ranks articles by relevance to improve screening efficiency, download full-text pdf of research articles in batch.
Malice PDF Plugin
An ultimate pdf file disintegration tool
A collection of PDF data mining scripts for various IMRT QA vendors
Upload your resume and check out your best matching jobs!
A tool for calculating activity points from certificates of co-curricular activities for colleges under KTU university.
A Single Library Parser to extract meta information,static analysis and detect macros within the files.
Representation of women in board of directors data scraping and visualization project for Van Leer "She Knows" (Yodaat יודעת)
This is source code for transforming PDFs from the Mamluk journal project to Simple Archive Format import objects for knowledgespace.uchicago.edu
Add a description, image, and links to the pdf-parsing topic page so that developers can more easily learn about it.
To associate your repository with the pdf-parsing topic, visit your repo's landing page and select "manage topics."