-
Notifications
You must be signed in to change notification settings - Fork 3.9k
Pull requests: microsoft/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Monitor was always enabled causing performance degradation
#5633
opened Jun 9, 2024 by
deepcharm
Loading…
reduce all-to-all communication volume when both expert and non-expert are tensor-parallel
#5626
opened Jun 7, 2024 by
taozhiwei
Loading…
Add an argument to enable the injection of missing state during the conversion of universal checkpoints
#5608
opened Jun 3, 2024 by
xylian86
Loading…
[CPU] Allow deepspeed.comm.inference_all_reduce in torch.compile graph
#5604
opened Jun 3, 2024 by
delock
Loading…
WA for Torch-compile-Z3-act-apt accuracy issue from the Pytorch repo
#5590
opened May 30, 2024 by
NirSonnenschein
Loading…
FastGen H100 MoE support: Add PyTorch multi-gemm MOE implementation
#5586
opened May 29, 2024 by
HeyangQin
Loading…
Remove compile wrapper to simplify access to model attributes
#5581
opened May 29, 2024 by
tohtana
Loading…
_exec_forward_pass: place zeros(1) on the same device as the param
#5576
opened May 28, 2024 by
nelyahu
Loading…
[CPU] SHM based allreduce improvement for small message size
#5571
opened May 27, 2024 by
delock
Loading…
assumption of torch.initial_seed function accepting seed arg in DeepSpeedAccelerator abstract class is incorrect
#5569
opened May 26, 2024 by
polisettyvarma
Loading…
Add support for Microsoft Phi-3 model to DeepSpeed-FastGen
#5559
opened May 21, 2024 by
adk9
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.