Skip to content

fix mtp prefix cache flag#824

Merged
valarLip merged 1 commit into
mainfrom
fpz/fix_prefix_default_on
May 18, 2026
Merged

fix mtp prefix cache flag#824
valarLip merged 1 commit into
mainfrom
fpz/fix_prefix_default_on

Conversation

@jiayyu
Copy link
Copy Markdown
Contributor

@jiayyu jiayyu commented May 18, 2026

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Copilot AI review requested due to automatic review settings May 18, 2026 09:40
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR aims to automatically disable prefix caching when using certain MTP speculative draft models that don’t support it, preventing incorrect cache behavior.

Changes:

  • Detect speculative draft model model_type for Qwen MTP variants.
  • If prefix caching is enabled for unsupported MTP draft models, emit a warning and disable prefix caching automatically.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread atom/config.py Outdated
Comment on lines +1015 to +1017
spec_model_type = getattr(
self.speculative_config.hf_config, "model_type", None
)
Copilot AI review requested due to automatic review settings May 18, 2026 10:57
@jiayyu jiayyu force-pushed the fpz/fix_prefix_default_on branch from 92d1d1e to 48c0b21 Compare May 18, 2026 10:57
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated no new comments.

@valarLip valarLip merged commit a076ab3 into main May 18, 2026
29 of 34 checks passed
@valarLip valarLip deleted the fpz/fix_prefix_default_on branch May 18, 2026 12:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants