ZeRO++ support in Accelerate's DeepSpeed integration #2020

SumanthRH · 2023-10-02T01:14:36Z

Hi,

I've been going over multi-node training strategies with Accelerate + DeepSpeed and had a question: Does the Accelerate integration support ZeRO++? With ZeRO++, one can, for example, have a hybrid sharding strategy where you have DS ZeRO-3 running on each machine with data parallelism across machines. I couldn't find any information on whether this was supported. This is a simple config change with DeepSpeed (zero_hpz_partition_size) so I'm guessing it is supported, but I wanted clarity. I also see that the corresponding hybrid sharding strategy is supported by the FSDP integration.

It would be great if this can be clarified in the docs as well!

cc @pacman100

The text was updated successfully, but these errors were encountered:

SumanthRH · 2023-10-09T19:50:01Z

Ping @pacman100 @muellerzr

pacman100 · 2023-10-10T06:44:53Z

Hello @SumanthRH, yes, as you suggested it is a simple config change and should be supported by the current integration of DeepSpeed. If you already have a PR in mind with the updates to the docs, it would be much appreciated. Thank you!

SumanthRH · 2023-10-12T00:18:42Z

Hi @pacman100 , great! I'll put up a PR soon for the documentation!

SumanthRH closed this as completed Oct 12, 2023

SumanthRH mentioned this issue Nov 19, 2023

Add ZeRO++ to DeepSpeed usage docs #2166

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ZeRO++ support in Accelerate's DeepSpeed integration #2020

ZeRO++ support in Accelerate's DeepSpeed integration #2020

SumanthRH commented Oct 2, 2023 •

edited

Loading

SumanthRH commented Oct 9, 2023

pacman100 commented Oct 10, 2023

SumanthRH commented Oct 12, 2023

ZeRO++ support in Accelerate's DeepSpeed integration #2020

ZeRO++ support in Accelerate's DeepSpeed integration #2020

Comments

SumanthRH commented Oct 2, 2023 • edited Loading

SumanthRH commented Oct 9, 2023

pacman100 commented Oct 10, 2023

SumanthRH commented Oct 12, 2023

SumanthRH commented Oct 2, 2023 •

edited

Loading