Add support for FusedAdam to be mathematically equivalent to pytorch/AdamW by baijumeswani · Pull Request #10106 · microsoft/onnxruntime

baijumeswani · 2021-12-21T23:12:10Z

ORT's FusedAdam is currently mathematically equivalent to transformers/AdamW. Users wanting to work with pytorch/AdamW mathematical implementation would see convergence disparity because of the subtle differences.

This pull request introduces a way for users to select the implementation they want so that they can get the performance gains, as well as aligned convergence.

…AdamW

…o bmeswani/update_fused_adam

Add support for FusedAdam to be mathematically equivalent to pytorch/…

8de0978

…AdamW

baijumeswani requested review from SherlockNoMad, liqunfu, thiagocrepaldi, tlh20 and xadupre as code owners December 21, 2021 23:12

baijumeswani added training issues related to ONNX Runtime training; typically submitted using template component:training-frontend labels Dec 21, 2021

xadupre reviewed Jan 10, 2022

View reviewed changes

Comment thread orttraining/orttraining/python/training/optim/fused_adam.py Outdated

xadupre reviewed Jan 10, 2022

View reviewed changes

Comment thread ...ttraining/python/training/ortmodule/torch_cpp_extensions/cuda/fused_ops/multi_tensor_adam.cu Outdated

xadupre reviewed Jan 10, 2022

View reviewed changes

Comment thread orttraining/orttraining/python/training/optim/fused_adam.py

baijumeswani added 2 commits January 11, 2022 00:57

Address pull request review comments

cab4762

Merge branch 'master' of https://github.com/microsoft/onnxruntime int…

0a3113d

…o bmeswani/update_fused_adam

ytaous approved these changes Jan 20, 2022

View reviewed changes

SherlockNoMad approved these changes Jan 21, 2022

View reviewed changes

baijumeswani merged commit 1416065 into master Jan 21, 2022

baijumeswani deleted the bmeswani/update_fused_adam branch January 21, 2022 21:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for FusedAdam to be mathematically equivalent to pytorch/AdamW#10106

Add support for FusedAdam to be mathematically equivalent to pytorch/AdamW#10106
baijumeswani merged 3 commits into
masterfrom
bmeswani/update_fused_adam

baijumeswani commented Dec 21, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

baijumeswani commented Dec 21, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants