Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

stage_1_and_2: optimize clip calculation to use clamp
#5632 opened Jun 9, 2024 by nelyahu Loading…
Hybrid Offloading for ZeRO3
#5625 opened Jun 7, 2024 by tohtana Draft
fix: quantization with DeepSpeed HE
#5624 opened Jun 6, 2024 by Atry Loading…
Add support for Phi-3 small to FastGen
#5614 opened Jun 4, 2024 by adk9 Draft
fixes in _partition_param_sec function
#5613 opened Jun 4, 2024 by mmhab Loading…
[INF] Enable torch compile for inference
#5612 opened Jun 4, 2024 by oelayan7 Loading…
Upgrade HPU image to v1.16.0.
#5610 opened Jun 4, 2024 by vshekhawat-hlab Loading…
Fixed Windows inference build.
#5609 opened Jun 3, 2024 by costin-eseanu Loading…
Fix overlap communication of ZeRO stage 1 and 2
#5606 opened Jun 3, 2024 by penn513 Loading…
pipe/_exec_backward_pass: fix immediate grad update
#5605 opened Jun 3, 2024 by nelyahu Loading…
state_dict_factory: llama checkpoint - support SWIGLU
#5601 opened Jun 2, 2024 by nelyahu Loading…
Update profiler.py
#5584 opened May 29, 2024 by gameofdimension Loading…
reduce cpu host overhead when using moe
#5578 opened May 29, 2024 by ranzhejiang Loading…
Reuse KV cache of prefixes
#5572 opened May 27, 2024 by tohtana Draft
Add support for Microsoft Phi-3 model to DeepSpeed-FastGen
#5559 opened May 21, 2024 by adk9 Loading…
ProTip! no:milestone will show everything without a milestone.