smoldocling

Here are 2 public repositories matching this topic...

genieincodebottle / parsemypdf

Collection of PDF parsing libraries like AI based docling, claude, openai, llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.

ocr openai claude camelot pymupdf pypdf ocr-python markitdown llama-parse omniai unstructured-io docling llama-vision smoldocling

Updated Jun 29, 2025
Python

PRITHIVSAKTHIUR / Multimodal-OCR2

Star

A comprehensive multimodal OCR application that supports both image and video document processing using state-of-the-art vision-language models. This application provides an intuitive Gradio interface for extracting text, converting documents to markdown, and performing advanced document analysis.

pillow image-analysis gradio video-understanding document-retrieval ocr-recognition huggingface-transformers vision-transformer qwen2-5-vl smoldocling

Updated Jun 25, 2025
Python

Improve this page

Add a description, image, and links to the smoldocling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the smoldocling topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly