-
Notifications
You must be signed in to change notification settings - Fork 25.9k
Closed
Labels
module: docsRelated to our documentation, both in docs/ and docblocksRelated to our documentation, both in docs/ and docblocksmodule: optimizerRelated to torch.optimRelated to torch.optimtriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module
Description
📚 The doc issue
Lines 469 to 476 in 0bd7b7a
| if amsgrad: | |
| # Maintains the maximum of all 2nd moment running avg. till now | |
| torch.maximum(max_exp_avg_sqs[i], exp_avg_sq, out=max_exp_avg_sqs[i]) | |
| # Use the max. for normalizing running avg. of gradient | |
| denom = (max_exp_avg_sqs[i].sqrt() / bias_correction2_sqrt).add_(eps) | |
| else: | |
| denom = (exp_avg_sq.sqrt() / bias_correction2_sqrt).add_(eps) |
In the code, the bias correction term

Suggest a potential alternative/fix
cc @svekars @brycebortree @sekyondaMeta @AlannaBurke @vincentqb @jbschlosser @albanD @janeyx99 @crcrpar
Metadata
Metadata
Assignees
Labels
module: docsRelated to our documentation, both in docs/ and docblocksRelated to our documentation, both in docs/ and docblocksmodule: optimizerRelated to torch.optimRelated to torch.optimtriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module