D-Adaptation and Prodigy contrib implementations #651

adefazio · 2023-12-01T20:06:12Z

Implementations of D-Adaptation AdamW and the related method Prodigy based on the official PyTorch implementations. I have verified that they give the same outputs as the PyTorch version on an example problem. Unit tests similar to those used on other optimizers in contrib.

https://github.com/facebookresearch/dadaptation
https://github.com/konstmish/prodigy

These two new optimizers perform learning rate adaptation, similar to Mechanic and COCOB, two optimizers already included in contrib, but by a different mechanism, and so I think these are relevant to Optax and interesting to the community. D-Adaptation won an ICML outstanding paper award and is already gaining a lot of traction in the ML community, particularly for fine-tuning diffusion models with the Prodigy variant.

fabianp · 2023-12-03T14:54:13Z

Thanks @adefazio for the contribution!

vroulet

Thanks for the contribution!
Minor comment: if s and d could have more explicit names, that could be helpful for a newcomer to investigate the algorithm but numerous algorithms have non-readable parameters like b1 or beta so that's fine as is too.

optax/contrib/dadapt_adamw.py

optax/contrib/prodigy.py

optax/contrib/dadapt_adamw.py

optax/contrib/prodigy.py

vroulet

Looks perfect, thank you! Final request: could you squash your commits into one?

adefazio · 2023-12-07T22:17:25Z

I'm squashed the commits, thanks for reviewing so quickly!

vroulet self-assigned this Dec 5, 2023

vroulet self-requested a review December 5, 2023 19:53

vroulet reviewed Dec 5, 2023

View reviewed changes

adefazio requested a review from vroulet December 6, 2023 17:58

vroulet reviewed Dec 6, 2023

View reviewed changes

D-Adaptation and Prodigy implementation

8cb18fa

adefazio force-pushed the adaptive branch from 13d2614 to 8cb18fa Compare December 7, 2023 22:11

vroulet approved these changes Dec 8, 2023

View reviewed changes

mtthss approved these changes Dec 11, 2023

View reviewed changes

copybara-service bot merged commit 1a7956d into google-deepmind:master Dec 11, 2023
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

D-Adaptation and Prodigy contrib implementations #651

D-Adaptation and Prodigy contrib implementations #651

adefazio commented Dec 1, 2023

fabianp commented Dec 3, 2023

vroulet left a comment

vroulet left a comment

adefazio commented Dec 7, 2023

D-Adaptation and Prodigy contrib implementations #651

D-Adaptation and Prodigy contrib implementations #651

Conversation

adefazio commented Dec 1, 2023

fabianp commented Dec 3, 2023

vroulet left a comment

Choose a reason for hiding this comment

vroulet left a comment

Choose a reason for hiding this comment

adefazio commented Dec 7, 2023