Skip to content
Discussion options

You must be logged in to vote

Hi @Romaiz! I'm Dosu and I’m helping the docling team.

Hey @Romaiz, great question! Docling does support integrating custom/finetuned local models, and you have several options for using something like Nemotron-nano-4b.

Quick clarification: Docling's default model is actually Granite-Docling-258M (IBM's lightweight model), not Qwen — though Qwen2.5-VL-3B is available as a preset [1].

Integrating a Custom Model

Option 1: Via OpenAI-compatible API (recommended for local serving)

Run your finetuned Nemotron-nano-4b with vLLM or Ollama, then point Docling to it:

from docling.datamodel.pipeline_options import VlmPipelineOptions, VlmConvertOptions, ApiVlmEngineOptions, VlmEngineType

vlm_options 

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by Romaiz
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant