Skip to content
#

smoldocling

Here are 2 public repositories matching this topic...

A comprehensive multimodal OCR application that supports both image and video document processing using state-of-the-art vision-language models. This application provides an intuitive Gradio interface for extracting text, converting documents to markdown, and performing advanced document analysis.

  • Updated Jun 25, 2025
  • Python

Improve this page

Add a description, image, and links to the smoldocling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the smoldocling topic, visit your repo's landing page and select "manage topics."

Learn more