-
-
Notifications
You must be signed in to change notification settings - Fork 5.1k
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Along with updated training/validation components in #458 for TPU support, support use of DeepSpeed/ZeRO
- https://pytorch.org/docs/master/distributed.optim.html#torch.distributed.optim.ZeroRedundancyOptimizer
- https://github.com/microsoft/DeepSpeed
It would be fairly easy to support w/ current training code, however the cahnges in #458 are going to be so significant that I do not want to continue two branches of code.
AlexeyAB and Dreamer312
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request