-
Notifications
You must be signed in to change notification settings - Fork 4.5k
Insights: deepspeedai/DeepSpeed
Overview
Could not load contribution data
Please try again later
8 Pull requests merged by 5 people
-
Fix pre-compile on cpu-only machines
#7168 merged
Aug 12, 2025 -
[TiledFusedLogitsLoss] support inference
#7477 merged
Aug 11, 2025 -
[UlyssesSPDataLoaderAdapter] fix iterator reset
#7472 merged
Aug 11, 2025 -
Modal CI
#7289 merged
Aug 11, 2025 -
fix
deepspeed --venv_script
#7469 merged
Aug 11, 2025 -
Fix cpu CI
#7481 merged
Aug 11, 2025 -
Add blog for ZenFlow
#7463 merged
Aug 10, 2025 -
add --bind_cores_to_rank to zero offload tutorial
#7474 merged
Aug 8, 2025
3 Pull requests opened by 3 people
-
Add world-size getter in Engine
#7479 opened
Aug 9, 2025 -
Enable forked PRs
#7486 opened
Aug 12, 2025 -
fix xpu device_id AttributeError issue
#7488 opened
Aug 14, 2025
1 Issue closed by 1 person
-
[REQUEST]
#7476 closed
Aug 12, 2025
8 Issues opened by 8 people
-
[BUG] Cuda failure 700 when use deepcompile with zero stage 3
#7487 opened
Aug 14, 2025 -
[BUG] UlyssesSPDataLoaderAdapter returns duplicate data
#7484 opened
Aug 12, 2025 -
[BUG] FlopsProfiler will hit error when sequence parallel enabled
#7483 opened
Aug 12, 2025 -
[BUG] GPU OOM when finetune Qwen2.5-14B with ZeRO2+offload on 4xA100 40G cards
#7482 opened
Aug 11, 2025 -
[REQUEST]
#7480 opened
Aug 9, 2025 -
[REQUEST] Auto-Tuning CPU Core Binding for DeepSpeed&ZenFlow
#7478 opened
Aug 9, 2025 -
[BUG] Abnormal loss in deepspeed v0.17.2 + ulysess training, not decreasing.
#7473 opened
Aug 8, 2025
12 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Add Zenflow code for Stage 1 & 2
#7391 commented on
Aug 12, 2025 • 15 new comments -
Support Muon Optimizer
#7454 commented on
Aug 10, 2025 • 1 new comment -
Add EXAONE 4.0 model support for DeepSpeed inference v2 @
#7456 commented on
Aug 12, 2025 • 1 new comment -
[ERROR] [launch.py:321:sigkill_handler] exits with return code = -11
#5690 commented on
Aug 8, 2025 • 0 new comments -
[REQUEST] Add support for non-__dict__ outputs such as MinkowskiEngine SparseTensor in ZeRO Stage 3 (DeepSpeed v0.9.2)
#7442 commented on
Aug 9, 2025 • 0 new comments -
nv-torch-nightly-v100 CI test failure
#7467 commented on
Aug 14, 2025 • 0 new comments -
nv-nightly CI test failure
#7140 commented on
Aug 14, 2025 • 0 new comments -
Enable python 3.11 and 3.12 tests
#7007 commented on
Aug 11, 2025 • 0 new comments -
Update Domino for Llama3
#7084 commented on
Aug 11, 2025 • 0 new comments -
Create COMMITTERS_RESPONSIBILITY.md
#7300 commented on
Aug 12, 2025 • 0 new comments -
Try to support deepspeed offload states with ZeRO1 and ZeRO2
#7421 commented on
Aug 12, 2025 • 0 new comments -
Fix invalid f-strings
#7457 commented on
Aug 12, 2025 • 0 new comments