Efficiency with deepspeed #11

jennydaman · 2023-06-23T23:02:00Z

Possibly naive suggestion, I wonder if we can lower VRAM usage and/or improve speed using https://github.com/microsoft/DeepSpeed ?

daviddmc · 2023-07-01T14:06:00Z

Thanks for the suggestion. Actually we have already used some of those techniques, e.g., mixed precision training, to improve the usage of GPU memory and efficiency. Other techniques, such as offloading, are useful for large models but might not be necessary in our case. But there might be some new techniques that I am not aware of, so I will keep an eye on it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Efficiency with deepspeed #11

Efficiency with deepspeed #11

jennydaman commented Jun 23, 2023

daviddmc commented Jul 1, 2023

Efficiency with deepspeed #11

Efficiency with deepspeed #11

Comments

jennydaman commented Jun 23, 2023

daviddmc commented Jul 1, 2023