Skip to content

[pull] main from huggingface:main#299

Merged
pull[bot] merged 2 commits intoMu-L:mainfrom
huggingface:main
Aug 18, 2025
Merged

[pull] main from huggingface:main#299
pull[bot] merged 2 commits intoMu-L:mainfrom
huggingface:main

Conversation

@pull
Copy link
Copy Markdown

@pull pull bot commented Aug 18, 2025

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.3)

Can you help keep this open source service alive? 💖 Please sponsor : )

lambertwjh and others added 2 commits August 18, 2025 08:02
* CogView4: remove SiLU in final AdaLN (match Megatron); add  switch to AdaLayerNormContinuous; split temb_raw/temb_blocks

* CogView4: remove SiLU in final AdaLN (match Megatron); add  switch to AdaLayerNormContinuous; split temb_raw/temb_blocks

* CogView4: remove SiLU in final AdaLN (match Megatron); add  switch to AdaLayerNormContinuous; split temb_raw/temb_blocks

* CogView4: use local final AdaLN (no SiLU) per review; keep generic AdaLN unchanged

* re-add configs as normal files (no LFS)

* Apply suggestions from code review

* Apply style fixes

---------

Co-authored-by: 武嘉涵 <lambert@wujiahandeMacBook-Pro.local>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
* add docs.

* more docs.

* xfail full compilation for Qwen for now.

* tests

* up

* up

* up

* reviewer feedback.
@pull pull bot locked and limited conversation to collaborators Aug 18, 2025
@pull pull bot added the ⤵️ pull label Aug 18, 2025
@pull pull bot merged commit 4d9b822 into Mu-L:main Aug 18, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants