Skip to content

Issues: microsoft/Megatron-DeepSpeed

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

about the optimizer param group
#387 opened May 17, 2024 by L-hongbin
Expert deepcopy raises PickleError
#380 opened Apr 23, 2024 by sxontheway
Pipeline parallelism + CPU offload?
#369 opened Mar 21, 2024 by webber26232
Bugs in GPT2 Inference Example
#364 opened Mar 13, 2024 by JianzheXiao
Unreasonably low throughput on HGX-H100s bug Something isn't working
#357 opened Mar 1, 2024 by GuanhuaWang
2nodes, 4 gpu, tp=2,pp=2, timeout
#338 opened Jan 23, 2024 by lonelydancer
Doubts about GPU memory
#330 opened Jan 12, 2024 by 980202006
ProTip! Mix and match filters to narrow down what you’re looking for.