Skip to content

feat(example): align server MTP support with llama.cpp#2283

Merged
abetlen merged 4 commits into
mainfrom
feat/server-mtp-llama-cpp
Jun 7, 2026
Merged

feat(example): align server MTP support with llama.cpp#2283
abetlen merged 4 commits into
mainfrom
feat/server-mtp-llama-cpp

Conversation

@abetlen

@abetlen abetlen commented Jun 7, 2026

Copy link
Copy Markdown
Owner

Align the server example MTP path with current llama.cpp.

  • Use llama.cpp nextn embeddings instead of Python-side Qwen norm handling.
  • Support ctx_other and optional assistant GGUF loading for MTP draft contexts.
  • Document the new server draft model configuration.

@abetlen abetlen merged commit fddee27 into main Jun 7, 2026
15 checks passed
@abetlen abetlen deleted the feat/server-mtp-llama-cpp branch June 7, 2026 23:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant