Add NovoGrad optimizer #385

DT6A · 2022-07-31T18:01:26Z

NovoGrad optimizer was presented in this paper: https://arxiv.org/pdf/1905.11286.pdf
It was used to train the ASR model named Jasper: https://arxiv.org/pdf/1904.03288.pdf

Add NovoGrad optimizer

google-cla · 2022-07-31T18:01:29Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

mtthss · 2022-08-23T08:55:13Z

There are a couple of conflicts, could you sync and merge so we get this submitted?

DT6A · 2022-08-23T09:13:41Z

There are a couple of conflicts, could you sync and merge so we get this submitted?

Hello! Yes, I will try to fix as soon as possible

hbq1 · 2022-10-18T16:37:32Z

docs/api.rst

@@ -149,6 +149,7 @@ Gradient Transforms
    scale_by_adam
    scale_by_belief
    scale_by_factored_rms
+    scale_by_novograd


Could you also add novograd to the Common Optimizers section?

hbq1 · 2022-10-18T16:41:23Z

optax/_src/transform.py

+    mu_dtype: optional `dtype` to be used for the first order accumulator; if
+      `None` then the `dtype is inferred from `params` and `updates`.
+  Returns:
+    An (init_fn, update_fn) tuple.


"A GradientTransformation object."

hbq1 · 2022-10-19T09:45:04Z

Thanks a lot for the contribution!

hbq1 · 2022-10-19T10:11:04Z

@DT6A could you please add a test for the new optimizer?

DT6A · 2022-10-19T10:20:35Z

@hbq1 is there any reference how the test should be done?

mkunesch · 2022-10-19T10:27:34Z

Hi! We are in the process of writing instructions on this at the moment - in your case it should be enough to just add the new optimizer to the list of optimizers under test in alias_test.py here. Let me know if you have any questions on this!

DT6A · 2022-10-19T10:37:13Z

@mkunesch Thanks, added it

mkunesch · 2022-10-19T10:39:50Z

Thank you!
(If there are details of the optimizer that would be important to test in isolation it would be good to add a unit test for them, but from a quick look at the documentation and code nothing stood out to me)

PiperOrigin-RevId: 482336516

hbq1 · 2022-10-20T00:00:30Z

Merged! Thanks again for this great contribution 🎉

DT6A and others added 2 commits July 31, 2022 20:55

Add NovoGrad optimizer

fa922aa

Merge pull request #1 from DT6A/novograd

932feb5

Add NovoGrad optimizer

DT6A added 2 commits August 23, 2022 22:47

Merge branch 'master' of https://github.com/DT6A/optax

da21a4e

Fix lint

7659474

hbq1 reviewed Oct 18, 2022

View reviewed changes

hbq1 approved these changes Oct 19, 2022

View reviewed changes

Fix docs

67d72fd

DT6A and others added 2 commits October 19, 2022 13:34

Add novograd to alias_test

b593048

Merge branch 'master' into novograd

db70d53

hbq1 approved these changes Oct 19, 2022

View reviewed changes

copybara-service bot pushed a commit that referenced this pull request Oct 19, 2022

Merge pull request #385 from DT6A:novograd

3d3b8c3

PiperOrigin-RevId: 482336516

hbq1 closed this Oct 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NovoGrad optimizer #385

Add NovoGrad optimizer #385

DT6A commented Jul 31, 2022

google-cla bot commented Jul 31, 2022

mtthss commented Aug 23, 2022

DT6A commented Aug 23, 2022

hbq1 Oct 18, 2022

hbq1 Oct 18, 2022

hbq1 commented Oct 19, 2022

hbq1 commented Oct 19, 2022

DT6A commented Oct 19, 2022

mkunesch commented Oct 19, 2022 •

edited

DT6A commented Oct 19, 2022

mkunesch commented Oct 19, 2022 •

edited

hbq1 commented Oct 20, 2022

Add NovoGrad optimizer #385

Add NovoGrad optimizer #385

Conversation

DT6A commented Jul 31, 2022

google-cla bot commented Jul 31, 2022

mtthss commented Aug 23, 2022

DT6A commented Aug 23, 2022

hbq1 Oct 18, 2022

Choose a reason for hiding this comment

hbq1 Oct 18, 2022

Choose a reason for hiding this comment

hbq1 commented Oct 19, 2022

hbq1 commented Oct 19, 2022

DT6A commented Oct 19, 2022

mkunesch commented Oct 19, 2022 • edited

DT6A commented Oct 19, 2022

mkunesch commented Oct 19, 2022 • edited

hbq1 commented Oct 20, 2022

mkunesch commented Oct 19, 2022 •

edited

mkunesch commented Oct 19, 2022 •

edited