To add Nesterov Adam algorithm for multi-tensor optimizers API #59165

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

iramazanli wants to merge 1 commit into pytorch:master from iramazanli:adding_multitensor_nadam

Contributor

iramazanli commented May 28, 2021 •

edited

Loading

Previously in the PR: #59009 we added NAdam to Optimizers. Here in this PR we are proposing multi-tensor version of NAdam for PyTorch.

Nadam has been proposed in the paper https://openreview.net/forum?id=OM0jvwB8jIp57ZJjtNEZ and report and report : http://cs229.stanford.edu/proj2015/054_report.pdf by Timothy Dozat.

It has been one of the most used algorithm in Deep Learning community.

It worth to noting that the implementation of NAdam is inspired by the implementation for Keras :
https://github.com/tensorflow/tensorflow/blob/f9d386849581d15d72f6f1f96f12aac230a8edbe/tensorflow/python/keras/optimizer_v2/nadam.py

facebook-github-bot added the cla signed label

Contributor

facebook-github-bot commented May 28, 2021 •

edited

Loading

💊 CI failures summary and remediations

As of commit 6c0595d (more details on the Dr. CI page and at hud.pytorch.org/pr/59165):

2/2 failures possibly* introduced in this PR
- 1/2 non-scanned failure(s)

1 failure not recognized by patterns:

Job	Step	Action
^{pytorch_linux_bionic_py3_8_gcc9_coverage_test2}	^{Run tests}	🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

iramazanli changed the title ~~Adding multitensor nadam~~ To add Rectified Adam for multi-tensor API

iramazanli changed the title ~~To add Rectified Adam for multi-tensor API~~ To add multi-tensor API for Nesterov Adam

iramazanli force-pushed the adding_multitensor_nadam branch 15 times, most recently from be6e6e0 to 2f603a3 Compare

May 30, 2021 20:18

iramazanli changed the title ~~To add multi-tensor API for Nesterov Adam~~ To add Nesterov Adam algorithm for multi-tensor optimizers API

iramazanli requested a review from vincentqb

June 1, 2021 04:26

vincentqb reviewed

View reviewed changes

torch/optim/_functional.py Outdated

Contributor

vincentqb Jun 15, 2021

does this belong to this PR? looks like #59009 got added here?

Contributor Author

iramazanli Jun 24, 2021

it belongs to #59009 :

vincentqb reviewed

View reviewed changes

Contributor

vincentqb left a comment

Let's look at #59009 first

iramazanli force-pushed the adding_multitensor_nadam branch 7 times, most recently from 3fdae58 to 60c0f39 Compare

June 24, 2021 10:01

iramazanli force-pushed the adding_multitensor_nadam branch from 51e8b50 to 284ba60 Compare

June 24, 2021 19:21

Contributor

facebook-github-bot commented Jun 25, 2021

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

vincentqb approved these changes

View reviewed changes

Contributor

vincentqb left a comment

LGTM. please use 4e-5 for the tolerance on gpu so that the test passes.

iramazanli closed this

iramazanli reopened this

iramazanli force-pushed the adding_multitensor_nadam branch 3 times, most recently from 5039362 to 9d423a5 Compare

June 27, 2021 17:09

Contributor

facebook-github-bot commented Jun 27, 2021

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

iramazanli force-pushed the adding_multitensor_nadam branch from 9d423a5 to 4093d27 Compare

June 27, 2021 17:17

Contributor

facebook-github-bot commented Jun 27, 2021

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

iramazanli force-pushed the adding_multitensor_nadam branch 2 times, most recently from 9a19faf to 569c777 Compare

June 27, 2021 17:38

Contributor

facebook-github-bot commented Jun 27, 2021

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

iramazanli force-pushed the adding_multitensor_nadam branch from 569c777 to f68a9e2 Compare

June 27, 2021 18:16

Contributor

facebook-github-bot commented Jun 27, 2021

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

iramazanli force-pushed the adding_multitensor_nadam branch 4 times, most recently from 5f80cc1 to cd97229 Compare

June 27, 2021 21:02


          To add Nesterov Adam algorithm for multi-tensor optimizers API

6c0595d

iramazanli force-pushed the adding_multitensor_nadam branch from cd97229 to 6c0595d Compare

June 27, 2021 21:07

Contributor

facebook-github-bot commented Jun 27, 2021

@iramazanli has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot closed this in

f0e972a

Contributor

facebook-github-bot commented Jun 28, 2021

@iramazanli merged this pull request in f0e972a.

facebook-github-bot added the Merged label

asuhan pushed a commit to asuhan/pytorch that referenced this pull request


          To add Nesterov Adam algorithm for multi-tensor optimizers API (pytor…

0099eea

…ch#59165)

Summary:
Previously in the PR: pytorch#59009 we added NAdam to Optimizers.  Here in this PR we are proposing multi-tensor version of NAdam for PyTorch.

Nadam has been proposed in the paper   https://openreview.net/forum?id=OM0jvwB8jIp57ZJjtNEZ and report  and report : http://cs229.stanford.edu/proj2015/054_report.pdf by Timothy Dozat.

It has been one of the most used algorithm in Deep Learning community.

It worth to noting that the implementation of NAdam is inspired by the implementation for Keras :
https://github.com/tensorflow/tensorflow/blob/f9d386849581d15d72f6f1f96f12aac230a8edbe/tensorflow/python/keras/optimizer_v2/nadam.py

Pull Request resolved: pytorch#59165

Reviewed By: vincentqb

Differential Revision: D29360577

Pulled By: iramazanli

fbshipit-source-id: 0fe14016303b2df2cb8cc31912a2674acf63d1e5

asuhan pushed a commit that referenced this pull request


          To add Nesterov Adam algorithm for multi-tensor optimizers API (#59165)

b8f02b6

Summary:
Previously in the PR: #59009 we added NAdam to Optimizers.  Here in this PR we are proposing multi-tensor version of NAdam for PyTorch.

Nadam has been proposed in the paper   https://openreview.net/forum?id=OM0jvwB8jIp57ZJjtNEZ and report  and report : http://cs229.stanford.edu/proj2015/054_report.pdf by Timothy Dozat.

It has been one of the most used algorithm in Deep Learning community.

It worth to noting that the implementation of NAdam is inspired by the implementation for Keras :
https://github.com/tensorflow/tensorflow/blob/f9d386849581d15d72f6f1f96f12aac230a8edbe/tensorflow/python/keras/optimizer_v2/nadam.py

Pull Request resolved: #59165

Reviewed By: vincentqb

Differential Revision: D29360577

Pulled By: iramazanli

fbshipit-source-id: 0fe14016303b2df2cb8cc31912a2674acf63d1e5

crcrpar mentioned this pull request

Foreach Functions Tracking Issue #58833

Open

28 tasks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed Merged