merge main#1749
Merged
Harold-lkk merged 13 commits intoagent_devfrom May 2, 2026
Merged
Conversation
[Feature] Add wrapper for compute_local_shape_and_global_offset to handle meta tensor errors
change dir
* add new validation on resume cases * fix f-string error * update priority * add qwen3.5 case about 8nums vs 16nums * install tilelang * limit version * optimizer code
* chore(build): update deps transformers -> 5.2.0 * chore(build): update cudnn to 9.15.1.9 of torch 2.9.1 in dockerfile * chore(ci): use wider tolerance in test_qwen3_5 sp case * refactor: add RopeParametersConfig due to transformers 5.2.0 bc * chore(build): conditional path for lmdeploy and sglang in Dockerfile * chore(build): update dockerfile for deepep, deep_gemm and ci proxy speed fix * fix(ci): ep>1 clip_grad_norm fails due to pt2.9 check * fix(ci): clean hf dynamic modules before test setup * chore(docker): update lmdeploy deps * refactor: Move compile config from FSDPConfig to model_cfg * fix(engine): use field rope_parameters when save_hf
* mtp kl loss in rl * support chunk kl * resolve comments * Complete MTP RL support wiring * fix lint
fix format
Co-authored-by: wentiange <tiangewen@qq.com>
Co-authored-by: wentiange <tiangewen@qq.com>
* fix batch reward * fix lint
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
No description provided.