More complete DMR support by krissetto · Pull Request #103 · docker/docker-agent

krissetto · 2025-09-04T16:07:26Z

With this PR the dmr provider can now support:

Proper context length setup using max_tokens
temperature, top_p, frequency_penalty, presence_penalty all get mapped into the proper runtime flags based on the engine in use (for now only llama.cpp mappings);
Raw runtime flags to pass to the inference engine in use, via provider_opts:runtime_flags

Configuration example supported with these changes:

models:
  root:
    provider: dmr
    model: ai/qwen3:14B-Q6_K
    max_tokens: 32768
    temperature: 0.7
    top_p: 0.95
    frequency_penalty: 0.2
    presence_penalty: 0.1
    provider_opts:
      runtime_flags: |
        --batch-size 1024
        --ubatch-size 512

closes #71

Supports most top level models: configurations, proper context size config with max_tokens, and defining engine specific runtime flags manually via `provider_opts:runtime_flags` references docker#71 Signed-off-by: Christopher Petito <chrisjpetito@gmail.com>

More complete DMR support

fb1d116

Supports most top level models: configurations, proper context size config with max_tokens, and defining engine specific runtime flags manually via `provider_opts:runtime_flags` references docker#71 Signed-off-by: Christopher Petito <chrisjpetito@gmail.com>

krissetto requested a review from rumpl September 4, 2025 16:12

rumpl approved these changes Sep 4, 2025

View reviewed changes

krissetto merged commit 9239963 into docker:main Sep 4, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More complete DMR support#103

More complete DMR support#103
krissetto merged 1 commit intodocker:mainfrom
krissetto:better-dmr

krissetto commented Sep 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

krissetto commented Sep 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants