Skip to content
@docling-project

Docling Project

Welcome to the Docling Project

This is the GitHub organization Docling open-source project.

Docling

Docling is our main open-source package. It is a powerful library which simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.

We support an amazing community which helps us driving forward the adoption of Docling. Give it a try and join the community!



The key repositories of Docling are:

  • docling - The home of the main docling package.
  • docling-core - The definition of types, transforms, serializers, etc. If it has to do with the DoclingDocument you will find it here.
  • docling-parse - The backend PDF parser used by Docling.
  • docling-serve - The FastAPI wrappers for running Docling as REST API and distribute large jobs.
  • docling-ibm-models - The AI models powering Docling.
  • docling-sdg - Synthetic data generation (SDG) on documents for dataset generation for RAG, finetuning, etc.
  • docling-mcp - The definition of tools with the Model Context Protocol for document conversion, manipulation and generation agents.

LF AI & Data

Docling is hosted as a project in the LF AI & Data Foundation.

IBM ❤️ Open Source AI

The project was started by the AI for knowledge team at IBM Research Zurich.

Pinned Loading

  1. docling Public

    Get your documents ready for gen AI

    Python 30k 1.9k

  2. docling-serve Public

    Running Docling as an API service

    Python 387 75

  3. docling-core Public

    A python library to define and validate data types in Docling.

    HTML 134 54

  4. community Public

    4

Repositories

Showing 10 of 19 repositories
  • docling-workshops Public

    Docling workshops

    1 CC0-1.0 0 0 0 Updated May 18, 2025
  • docling Public

    Get your documents ready for gen AI

    Python 30,030 MIT 1,893 356 (9 issues need help) 27 Updated May 18, 2025
  • docling-core Public

    A python library to define and validate data types in Docling.

    HTML 134 MIT 54 27 10 Updated May 18, 2025
  • Python 116 MIT 26 21 8 Updated May 17, 2025
  • docling-eval Public

    Evaluation framework for document processing models and services.

    Python 15 MIT 6 3 3 Updated May 16, 2025
  • docling-serve Public

    Running Docling as an API service

    Python 387 MIT 75 36 7 Updated May 16, 2025
  • Go 2 Apache-2.0 7 1 0 Updated May 16, 2025
  • Python 5 MIT 2 4 2 Updated May 15, 2025
  • docling-snap Public Forked from huggingface/HuggingSnap

    docling-snap

    Swift 1 15 0 0 Updated May 13, 2025
  • mlx-swift-examples Public Forked from ml-explore/mlx-swift-examples

    Examples using MLX Swift

    Swift 0 MIT 239 0 0 Updated May 12, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.