Skip to content

Pull requests: deepspeedai/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

async tp allreduce
#7115 opened Mar 7, 2025 by inkcherry Loading…
[XPU] Support XCCL on deepspeed side
#7113 opened Mar 6, 2025 by ys950902 Loading…
fix keep_module_on_host
#7112 opened Mar 6, 2025 by inkcherry Loading…
Variable batch size and LR scheduler
#7104 opened Mar 3, 2025 by bm-synth Loading…
Unpin once transformers latest is fixed
#7088 opened Feb 27, 2025 by loadams Loading…
Update Domino for Llama3
#7084 opened Feb 26, 2025 by shenzheyu Loading…
Conditionally quote env vars
#7071 opened Feb 22, 2025 by saurabhkoshatwar Loading…
Enable ZeRO set/get APIs for NVMe offload
#7046 opened Feb 17, 2025 by tjruwase Loading…
Training multiple models
#7018 opened Feb 8, 2025 by tjruwase Loading…
Enable python 3.11 and 3.12 tests
#7007 opened Feb 6, 2025 by loadams Loading…
Enable torch.autocast with ZeRO
#6993 opened Feb 3, 2025 by tohtana Loading…
Improve overflow handling in ZeRO
#6976 opened Jan 28, 2025 by tjruwase Loading…
4 of 6 tasks
Pin numpy version
#6953 opened Jan 15, 2025 by BLOrange-AMD Loading…
Set dataloader shuffle=true
#6950 opened Jan 14, 2025 by loadams Draft
1 task
Update MII tests to support transformers latest
#6686 opened Oct 29, 2024 by loadams Loading…
ProTip! Updated in the last three days: updated:>2025-03-04.