# Data Enrichment

Docling allows to enrich the conversion pipeline with additional steps which process specific document components, e.g. code blocks, pictures, etc

In [1]:
from docling.document_converter import DocumentConverter, PdfFormatOption
from docling.datamodel.pipeline_options import PdfPipelineOptions
from docling.datamodel.base_models import InputFormat

pipeline_options = PdfPipelineOptions()
pipeline_options.do_picture_description = True

converter = DocumentConverter(format_options={
    InputFormat.PDF: PdfFormatOption(pipeline_options=pipeline_options)
})

result = converter.convert("https://arxiv.org/pdf/2501.17887")
doc = result.document

2025-09-24 17:45:44,632 - INFO - detected formats: [<InputFormat.PDF: 'pdf'>]
2025-09-24 17:45:46,845 - INFO - Going to convert document batch...
2025-09-24 17:45:46,848 - INFO - Initializing pipeline for StandardPdfPipeline with options hash bf0aba1c366a38f007ad7e7cbab835b8
2025-09-24 17:45:46,860 - INFO - Loading plugin 'docling_defaults'
2025-09-24 17:45:46,863 - INFO - Registered picture descriptions: ['vlm', 'api']
2025-09-24 17:45:47,400 - INFO - Accelerator device: 'cpu'
2025-09-24 17:45:49,305 - INFO - Loading plugin 'docling_defaults'
2025-09-24 17:45:49,313 - INFO - Registered ocr engines: ['easyocr', 'ocrmac', 'rapidocr', 'tesserocr', 'tesseract']
2025-09-24 17:45:49,957 - INFO - Accelerator device: 'cpu'
2025-09-24 17:45:51,648 - INFO - Accelerator device: 'cpu'
2025-09-24 17:45:52,780 - INFO - Accelerator device: 'cpu'
2025-09-24 17:45:53,324 - INFO - Processing document 2501.17887v1.pdf
2025-09-24 17:57:35,286 - INFO - Finished converting document 2501.17887v1.pdf in 711.

In [2]:
print(doc.export_to_markdown())

## Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion

Nikolaos Livathinos * , Christoph Auer * , Maksym Lysak, Ahmed Nassar, Michele Dolfi, Panagiotis Vagenas, Cesar Berrospi, Matteo Omenetti, Kasper Dinkla, Yusik Kim, Shubham Gupta, Rafael Teixeira de Lima, Valery Weber, Lucas Morin, Ingmar Meijer, Viktor Kuropiatnyk, Peter W. J. Staar

IBM Research, R¨ uschlikon, Switzerland

Please send correspondence to: deepsearch-core@zurich.ibm.com

## Abstract

We introduce Docling , an easy-to-use, self-contained, MITlicensed, open-source toolkit for document conversion, that can parse several types of popular document formats into a unified, richly structured representation. It is powered by state-of-the-art specialized AI models for layout analysis (DocLayNet) and table structure recognition (TableFormer), and runs efficiently on commodity hardware in a small resource budget. Docling is released as a Python package and can be used as a Python API or as a CLI tool. D