A self-hosted PDF OCR API that converts scanned documents to markdown. Powered by PaddleOCR-VL, runs on GPU via Docker.
-
Updated
Apr 19, 2026 - Python
A self-hosted PDF OCR API that converts scanned documents to markdown. Powered by PaddleOCR-VL, runs on GPU via Docker.
Local OCR Studio: local OCR web app with AI proofread, translation, and summary
Multilingual structured OCR (11+ languages, CJK-tuned) — MCP server with verified per-character bboxes for AI agents
Add a description, image, and links to the multilingual-ocr topic page so that developers can more easily learn about it.
To associate your repository with the multilingual-ocr topic, visit your repo's landing page and select "manage topics."