about training memory optimization #41

zhangvia · 2023-12-21T08:03:51Z

In the README, you mentioned that you would optimize the training code using DeepSpeed and Accelerate. However, as far as I know, the DeepSpeed functionality integrated into the Accelerate library does not support multi-model training. Do you have any suggestions?

zhangvia · 2023-12-21T08:06:49Z

and From my test results, it appears that when resolution was set to 512, first-stage training cannot be conducted on a 40GB A100 even with a batch size of 1. Is this normal?

guoqincode · 2023-12-21T08:10:07Z

This is normal, 40G is too small for training.

zhangvia · 2023-12-21T08:10:53Z

This is normal, 40G is too small for training.

what about the first question?

guoqincode closed this as completed Dec 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about training memory optimization #41

about training memory optimization #41

zhangvia commented Dec 21, 2023

zhangvia commented Dec 21, 2023

guoqincode commented Dec 21, 2023

zhangvia commented Dec 21, 2023

about training memory optimization #41

about training memory optimization #41

Comments

zhangvia commented Dec 21, 2023

zhangvia commented Dec 21, 2023

guoqincode commented Dec 21, 2023

zhangvia commented Dec 21, 2023