Skip to content

Conversation

iramazanli
Copy link
Contributor

@iramazanli iramazanli commented May 28, 2021

Previously in the PR: #59009 we added NAdam to Optimizers. Here in this PR we are proposing multi-tensor version of NAdam for PyTorch.

Nadam has been proposed in the paper https://openreview.net/forum?id=OM0jvwB8jIp57ZJjtNEZ and report and report : http://cs229.stanford.edu/proj2015/054_report.pdf by Timothy Dozat.

It has been one of the most used algorithm in Deep Learning community.

It worth to noting that the implementation of NAdam is inspired by the implementation for Keras :
https://github.com/tensorflow/tensorflow/blob/f9d386849581d15d72f6f1f96f12aac230a8edbe/tensorflow/python/keras/optimizer_v2/nadam.py

@facebook-github-bot
Copy link
Contributor

facebook-github-bot commented May 28, 2021

💊 CI failures summary and remediations

As of commit 6c0595d (more details on the Dr. CI page and at hud.pytorch.org/pr/59165):


  • 2/2 failures possibly* introduced in this PR
    • 1/2 non-scanned failure(s)

1 failure not recognized by patterns:

Job Step Action
CircleCI pytorch_linux_bionic_py3_8_gcc9_coverage_test2 Run tests 🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

@iramazanli iramazanli changed the title Adding multitensor nadam To add Rectified Adam for multi-tensor API May 28, 2021
@iramazanli iramazanli changed the title To add Rectified Adam for multi-tensor API To add multi-tensor API for Nesterov Adam May 28, 2021
@iramazanli iramazanli force-pushed the adding_multitensor_nadam branch 15 times, most recently from be6e6e0 to 2f603a3 Compare May 30, 2021 20:18
@iramazanli iramazanli changed the title To add multi-tensor API for Nesterov Adam To add Nesterov Adam algorithm for multi-tensor optimizers API Jun 1, 2021
@iramazanli iramazanli requested a review from vincentqb June 1, 2021 04:26
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

does this belong to this PR? looks like #59009 got added here?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it belongs to #59009 :

Copy link
Contributor

@vincentqb vincentqb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's look at #59009 first

@iramazanli iramazanli force-pushed the adding_multitensor_nadam branch 7 times, most recently from 3fdae58 to 60c0f39 Compare June 24, 2021 10:01
@iramazanli iramazanli force-pushed the adding_multitensor_nadam branch from 51e8b50 to 284ba60 Compare June 24, 2021 19:21
@facebook-github-bot
Copy link
Contributor

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Copy link
Contributor

@vincentqb vincentqb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. please use 4e-5 for the tolerance on gpu so that the test passes.

@iramazanli iramazanli closed this Jun 25, 2021
@iramazanli iramazanli reopened this Jun 25, 2021
@iramazanli iramazanli force-pushed the adding_multitensor_nadam branch 3 times, most recently from 5039362 to 9d423a5 Compare June 27, 2021 17:09
@facebook-github-bot
Copy link
Contributor

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@iramazanli iramazanli force-pushed the adding_multitensor_nadam branch from 9d423a5 to 4093d27 Compare June 27, 2021 17:17
@facebook-github-bot
Copy link
Contributor

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@iramazanli iramazanli force-pushed the adding_multitensor_nadam branch 2 times, most recently from 9a19faf to 569c777 Compare June 27, 2021 17:38
@facebook-github-bot
Copy link
Contributor

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@iramazanli iramazanli force-pushed the adding_multitensor_nadam branch from 569c777 to f68a9e2 Compare June 27, 2021 18:16
@facebook-github-bot
Copy link
Contributor

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@iramazanli iramazanli force-pushed the adding_multitensor_nadam branch 4 times, most recently from 5f80cc1 to cd97229 Compare June 27, 2021 21:02
@iramazanli iramazanli force-pushed the adding_multitensor_nadam branch from cd97229 to 6c0595d Compare June 27, 2021 21:07
@facebook-github-bot
Copy link
Contributor

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@iramazanli merged this pull request in f0e972a.

asuhan pushed a commit to asuhan/pytorch that referenced this pull request Jun 28, 2021
…ch#59165)

Summary:
Previously in the PR: pytorch#59009 we added NAdam to Optimizers.  Here in this PR we are proposing multi-tensor version of NAdam for PyTorch.

Nadam has been proposed in the paper   https://openreview.net/forum?id=OM0jvwB8jIp57ZJjtNEZ and report  and report : http://cs229.stanford.edu/proj2015/054_report.pdf by Timothy Dozat.

It has been one of the most used algorithm in Deep Learning community.

It worth to noting that the implementation of NAdam is inspired by the implementation for Keras :
https://github.com/tensorflow/tensorflow/blob/f9d386849581d15d72f6f1f96f12aac230a8edbe/tensorflow/python/keras/optimizer_v2/nadam.py

Pull Request resolved: pytorch#59165

Reviewed By: vincentqb

Differential Revision: D29360577

Pulled By: iramazanli

fbshipit-source-id: 0fe14016303b2df2cb8cc31912a2674acf63d1e5
asuhan pushed a commit that referenced this pull request Jun 30, 2021
Summary:
Previously in the PR: #59009 we added NAdam to Optimizers.  Here in this PR we are proposing multi-tensor version of NAdam for PyTorch.

Nadam has been proposed in the paper   https://openreview.net/forum?id=OM0jvwB8jIp57ZJjtNEZ and report  and report : http://cs229.stanford.edu/proj2015/054_report.pdf by Timothy Dozat.

It has been one of the most used algorithm in Deep Learning community.

It worth to noting that the implementation of NAdam is inspired by the implementation for Keras :
https://github.com/tensorflow/tensorflow/blob/f9d386849581d15d72f6f1f96f12aac230a8edbe/tensorflow/python/keras/optimizer_v2/nadam.py

Pull Request resolved: #59165

Reviewed By: vincentqb

Differential Revision: D29360577

Pulled By: iramazanli

fbshipit-source-id: 0fe14016303b2df2cb8cc31912a2674acf63d1e5
@crcrpar crcrpar mentioned this pull request Nov 8, 2021
28 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants