Skip to content

Conversation

@mengniwang95
Copy link
Contributor

@mengniwang95 mengniwang95 commented Nov 25, 2025

  • Update modeling files of llama4 and gptoss
  • Add moe handler in convert_hf_model
  • Add ut for llama4 and gptoss

@mengniwang95
Copy link
Contributor Author

@XuehaoSun please add llama4 model in CI machine

@yiliu30 yiliu30 self-requested a review November 26, 2025 13:46
@mengniwang95 mengniwang95 merged commit b1b60e4 into main Nov 27, 2025
26 checks passed
@mengniwang95 mengniwang95 deleted the mengni/tf_load branch November 27, 2025 05:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants