[TRTLLM-12026][feat] Support MTP with block reuse enabled for hybrid models#12896
Conversation
|
/bot run |
|
PR_Github #42776 [ run ] triggered by Bot. Commit: |
|
PR_Github #42776 [ run ] completed with state
|
|
/bot run |
|
PR_Github #42778 [ run ] triggered by Bot. Commit: |
|
PR_Github #42778 [ run ] completed with state
|
|
/bot run |
|
PR_Github #42779 [ run ] triggered by Bot. Commit: |
|
PR_Github #42779 [ run ] completed with state
|
|
/bot run |
|
PR_Github #42782 [ run ] triggered by Bot. Commit: |
|
PR_Github #42782 [ run ] completed with state
|
95923ce to
29a1d27
Compare
f3f5eba to
51ffff3
Compare
|
/bot run |
|
PR_Github #44336 [ run ] triggered by Bot. Commit: |
|
PR_Github #44336 [ run ] completed with state
|
|
/bot run |
|
PR_Github #44352 [ run ] triggered by Bot. Commit: |
|
PR_Github #44352 [ run ] completed with state
|
|
/bot run |
51ffff3 to
ff89391
Compare
|
/bot run |
|
PR_Github #44611 [ run ] triggered by Bot. Commit: |
|
PR_Github #44611 [ run ] completed with state
|
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
|
/bot run --disable-fail-fast |
|
PR_Github #47273 [ run ] triggered by Bot. Commit: |
|
PR_Github #47273 [ run ] completed with state
|
|
/bot run --disable-fail-fast |
|
PR_Github #47387 [ run ] triggered by Bot. Commit: |
|
PR_Github #47387 [ run ] completed with state
|
|
/bot run --disable-fail-fast |
|
PR_Github #47454 [ run ] triggered by Bot. Commit: |
|
PR_Github #47454 [ run ] completed with state
|
|
/bot run --disable-fail-fast |
|
PR_Github #47496 [ run ] triggered by Bot. Commit: |
|
PR_Github #47496 [ run ] completed with state
|
|
/bot run --disable-fail-fast |
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
|
/bot run --disable-fail-fast |
|
PR_Github #47527 [ run ] triggered by Bot. Commit: |
Signed-off-by: xiweny <13230610+VALLIS-NERIA@users.noreply.github.com>
|
/bot run --disable-fail-fast |
|
PR_Github #47568 [ run ] triggered by Bot. Commit: |
|
PR_Github #47568 [ run ] completed with state
|
|
/bot run --disable-fail-fast |
|
PR_Github #47620 [ run ] triggered by Bot. Commit: |
|
PR_Github #47620 [ run ] completed with state |
…models (NVIDIA#12896) Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com> Signed-off-by: xiweny <13230610+VALLIS-NERIA@users.noreply.github.com>
Summary
enable_cache_reuseflag for mamba cache in executor and resource managerTest plan
🤖 Generated with Claude Code
Summary by CodeRabbit
Release Notes
New Features
Improvements
Tests