A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
-
Updated
Jul 28, 2024 - Python
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
A Python tool to help extracting information from structured PDFs.
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
Purpose for make more natural TTS services by modifying scripts.
A tool for calculating activity points from certificates of co-curricular activities for colleges under KTU university.
PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取
Upload your resume and check out your best matching jobs!
A collection of PDF data mining scripts for various IMRT QA vendors
Representation of women in board of directors data scraping and visualization project for Van Leer "She Knows" (Yodaat יודעת)
An ultimate pdf file disintegration tool
Written in python, for checking reference lists in systematic reviews and literature reviews, helps with reference list searching both backward&forward by extracting references and creating search queries, ranks articles by relevance to improve screening efficiency, download full-text pdf of research articles in batch.
A PDF parser written in Python 3 with no external dependencies.
Malice PDF Plugin
A Single Library Parser to extract meta information,static analysis and detect macros within the files.
This is source code for transforming PDFs from the Mamluk journal project to Simple Archive Format import objects for knowledgespace.uchicago.edu
Add a description, image, and links to the pdf-parsing topic page so that developers can more easily learn about it.
To associate your repository with the pdf-parsing topic, visit your repo's landing page and select "manage topics."