Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.
ocr openai claude camelot pymupdf pypdf ocr-python markitdown gemini-pro gemini-ai llama-parse omniai unstructured-io docling llama-vision mistral-ocr smoldocling llama4
-
Updated
Jul 1, 2025 - Python