A document processing service using the Docling-MCP library and MCP (Message Control Protocol) for tool integration.
Docling MCP is a service that provides tools for document conversion, processing and generation. It uses the Docling library to convert PDF documents into structured formats and provides a caching mechanism to improve performance. The service exposes functionality through a set of tools that can be called by client applications.
- Conversion tools:
- PDF document conversion to structured JSON format (DoclingDocument)
- Generation tools:
- Document generation in DoclingDocument, which can be exported to multiple formats
- Local document caching for improved performance
- Support for local files and URLs as document sources
- Memory management for handling large documents
- Logging system for debugging and monitoring
- RAG applications with Milvus upload and retrieval
The easiest way to install Docling MCP is connect it to your client is launching it via uvx.
Depending on the transfer protocol required, specify the argument --transport
, for example
-
stdio
used e.g. in Claude for Desktop and LM Studiouvx --from docling-mcp docling-mcp-server --transport stdio
-
sse
used e.g. in Llama Stackuvx --from docling-mcp docling-mcp-server --transport sse
-
streamable-http
used e.g. in containers setupuvx --from docling-mcp docling-mcp-server --transport streamable-http
More options are available, e.g. the selection of which toolgroup to launch. Use the --help
argument to inspect all the CLI options.
For developing the MCP tools further, please refer to the docs/development.md page for instructions.
One of the easiest ways to experiment with the tools provided by Docling-MCP is to leverage Claude for Desktop. Once installed, extend Claude for Desktop so that it can read from your computer’s file system, by following the For Claude Desktop Users tutorial.
To enable Claude for Desktop with Docling MCP, simply edit the config file claude_desktop_config.json
(located at ~/Library/Application Support/Claude/claude_desktop_config.json
in MacOS) and add a new item in the mcpServers
key with the details of a Docling MCP server. You can find an example of those details here.
Example of prompt for converting PDF documents:
Convert the PDF document at <provide file-path> into DoclingDocument and return its document-key.
Example of prompt for generating new documents:
I want you to write a Docling document. To do this, you will create a document first by invoking `create_new_docling_document`. Next you can add a title (by invoking `add_title_to_docling_document`) and then iteratively add new section-headings and paragraphs. If you want to insert lists (or nested lists), you will first open a list (by invoking `open_list_in_docling_document`), next add the list_items (by invoking `add_listitem_to_list_in_docling_document`). After adding list-items, you must close the list (by invoking `close_list_in_docling_document`). Nested lists can be created in the same way, by opening and closing additional lists.
During the writing process, you can check what has been written already by calling the `export_docling_document_to_markdown` tool, which will return the currently written document. At the end of the writing, you must save the document and return me the filepath of the saved document.
The document should investigate the impact of tokenizers on the quality of LLMs.
The Docling-MCP codebase is under MIT license. For individual model usage, please refer to the model licenses found in the original packages.
Docling and Docling-MCP is hosted as a project in the LF AI & Data Foundation.
IBM ❤️ Open Source AI: The project was started by the AI for knowledge team at IBM Research Zurich.