Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Shardformer] Support the Qwen2 model #5699

Merged
merged 18 commits into from
May 9, 2024
Merged

[Shardformer] Support the Qwen2 model #5699

merged 18 commits into from
May 9, 2024

Conversation

wangbluo
Copy link
Contributor

@wangbluo wangbluo commented May 8, 2024

Support the Qwen2 model, the transformers version should be above 4.39.1.
Already tested this feature locally.
img_v3_02an_b78e9a76-24d2-49ae-a45c-1cf6880fc8ag

@wangbluo wangbluo requested a review from a team as a code owner May 8, 2024 13:58
@wangbluo
Copy link
Contributor Author

wangbluo commented May 9, 2024

Have tested this feature locally.
As the transformers version in unit test is 4.36.2, which is not compatiable with qwen model, so skip the qwen2 unit test in CI.
img_v3_02an_b78e9a76-24d2-49ae-a45c-1cf6880fc8ag

img_v3_02an_4ebb29e2-7d24-4d6e-b57a-f4abde40fb5g
img_v3_02an_5d8b0517-24ab-4972-af3b-d8d2269d714g

@ver217 ver217 marked this pull request as draft May 9, 2024 09:03
@ver217 ver217 marked this pull request as ready for review May 9, 2024 09:03
@ver217 ver217 merged commit a3cc68c into hpcaitech:main May 9, 2024
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants