-
Notifications
You must be signed in to change notification settings - Fork 3.9k
Pull requests: microsoft/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
_exec_forward_pass: place zeros(1) on the same device as the param
#5576
opened May 28, 2024 by
nelyahu
Loading…
[CPU] SHM based allreduce improvement for small message size
#5571
opened May 27, 2024 by
delock
Loading…
assumption of initial_seed function accepting seed arg in DeepSpeedAccelerator abstract class is incorrect
#5569
opened May 26, 2024 by
polisettyvarma
Loading…
Add support for Microsoft Phi-3 model to DeepSpeed-FastGen
#5559
opened May 21, 2024 by
adk9
Loading…
create mininal universal checkpoint info for client state
#5526
opened May 13, 2024 by
xylian86
Loading…
Z3: optimizations for grad norm calculation and gradient clipping
#5504
opened May 7, 2024 by
nelyahu
Loading…
Make the quantized data shape compatible with original tensor shape
#5483
opened Apr 30, 2024 by
sfc-gh-reyazda
Loading…
[XPU] support op builder from intel_extension_for_pytorch kernel path
#5425
opened Apr 17, 2024 by
YizhouZ
Loading…
Add fp16 support of Qwen1.5MoE models (A2.7B) to DeepSpeed-FastGen
#5403
opened Apr 12, 2024 by
ZonePG
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.