Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce model parallelism #316

Open
cbalioglu opened this issue Feb 7, 2024 · 1 comment
Open

Introduce model parallelism #316

cbalioglu opened this issue Feb 7, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request modeling Related to modeling APIs training Related to training APIs

Comments

@cbalioglu
Copy link
Contributor

Introduce Megatron style model parallelism.

@cbalioglu cbalioglu self-assigned this Feb 7, 2024
@cbalioglu cbalioglu added enhancement New feature or request modeling Related to modeling APIs training Related to training APIs labels Feb 7, 2024
@yjzhong89
Copy link

hi @cbalioglu, when will this feature be released? By the way, can I use deepspeed to train seamless communication model?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request modeling Related to modeling APIs training Related to training APIs
Projects
None yet
Development

No branches or pull requests

2 participants