Skip to content

FP16 with Zero and Gradient Accumulation in Configuration File #1190

Discussion options

You must be logged in to vote

Hi, Enrico
ZeRO will convert the master weight into fp16 type during training, and convert them into fp32 type during steping. So you don't need to set fp16 configuration when ZeRO is used in your training.

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@conceptofmind
Comment options

@YuliangLiu0306
Comment options

@conceptofmind
Comment options

@AntoineBlanot
Comment options

Answer selected by conceptofmind
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
3 participants