Skip to content

DeepSpeed still gives CUDA-out-of-memory issue #2302

@buttercutter

Description

@buttercutter

May I know why this training code still gives CUDA-out-of-memory issue even after DeepSpeed is turned on ?

image

See this for historical tracking purpose.

Metadata

Metadata

Assignees

Labels

questionFurther information is requestedtraining

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions