Implement stateless wrapper. #246

n2cholas · 2021-11-26T23:38:08Z

Closes #104.

Not sure if this belongs in wrappers.py, please let me know if I should move it elsewhere.

mkunesch

Hey! Thank you so much for this PR!

I'll do a detailed review tomorrow, but I have two quick high-level questions/comments:

I think this code could actually be part of base.py. It's not really a wrapper in that it doesn't wrap a gradient transformation; instead, I think it's a really fundamental tool for creating gradient transformation so could go into base.py. What do you think?
Could we replace StatelessState by base.EmptyState or is there an advantage to stateless having its own state?

Thanks a lot again and as I said, I'll take a detailed look tomorrow!

n2cholas · 2021-11-29T00:30:49Z

I agree, I'll move it to base.py.
Agree with this as well, completely forgot about base.EmptyState!

rosshemsley

Thanks! I like this idea, I can imagine using this for creating quick transformations :)

I think I would be inclined to simplify this a bit, and to remove the on_leaves part, (and also, I don't htink we need to check for params being None).

We could also add an explicit example to the docs of using this as a decorator:

def stateless_gradient_transformation(f) -> optax.GradientTransformation:
  def init_fn(_):
    return optax.EmptyState()

  def update_fn(updates, state, params=None):
      return f(updates, params), state

  return optax.GradientTransformation(init_fn, update_fn)


@stateless_gradient_transformation
def double_grads(updates, params):
  return jax.tree_map(lambda x: 2*x, updates)


opt = optax.chain(optax.adam(1e-4), double_grads)
state = opt.init(jax.numpy.array([]))

What do you think?

(Note, for a parametrized example, one could write:

def gradient_multiplier(factor):

    @stateless_gradient_transformation
    def transformation(updates, params):
         return jax.tree_map(lambda x: x * factor, updates)

    return transformation


opt = optax.chain(optax.adam(1e-4), gradient_multiplier(2.0))

optax/_src/base.py

optax/__init__.py

rosshemsley · 2021-11-29T13:34:50Z

(@mkunesch pointed out to me that there's already a thread on this in the issues - sorry I hadn't read that! I'll let you both figure out what to do here :) )

n2cholas · 2021-12-01T06:56:53Z

Thanks @rosshemsley for the feedback! I resolved a couple of your comments and left responses to others in the code review.

mkunesch

Hi! I think this looks great - thanks a lot! I added comments on the choice of names and documentation layout and some minor thoughts on the test.
Thanks again!

optax/__init__.py

optax/_src/base.py

optax/_src/base_test.py

mkunesch

Thanks for making all the changes, this looks great! The only comment I have is on the positioning in the documentation but otherwise LGTM!

optax/__init__.py

n2cholas · 2021-12-09T18:20:03Z

Agreed, fixed!

n2cholas · 2021-12-09T18:32:56Z

Would it be possible to rerun the tests? The failures were in control_variates_test.py, which are unaffected by this PR.

mkunesch

Thanks! I have triggered the test workflow again now that we have fixed the broken test you mentioned.

mkunesch

Hi! The checks still don't pass ... could you sync the latest changes? In theory we fixed this error on Monday. I've also flagged a line which gives a wrong import order error in pylint.
Once the checks pass I think it's ready to merge! Thanks a lot again!

mkunesch · 2021-12-16T22:31:43Z

optax/_src/base_test.py

@@ -15,8 +15,11 @@
 """Tests for base.py."""

 from absl.testing import absltest
+
 import chex
 import numpy as np


(Pylint will raise a wrong import order error here)

I'll fix this change, but running test.sh locally did not raise this issue for me (tried running pylint --rcfile=.pylintrc optax/_src/base_test.py separately as well).

Thanks for letting me know! This would definitely be blocked internally as we try to merge so I'll look into adding an import order check to test.sh.

mkunesch · 2021-12-22T00:04:07Z

Thanks a lot again for suggesting and implementing the stateless optimizer. This was such a great idea! We could think about whether to implement a few of the existing gradient transformations in optax using stateless ... good targets could be recently added gradient transformations that have good test coverage.

Thanks again!

google-cla bot added the cla: yes copybara label for automatic import label Nov 26, 2021

mkunesch requested changes Nov 28, 2021

View reviewed changes

n2cholas requested a review from mkunesch November 29, 2021 00:33

rosshemsley requested changes Nov 29, 2021

View reviewed changes

optax/_src/base.py Outdated Show resolved Hide resolved

optax/_src/base.py Show resolved Hide resolved

optax/_src/base.py Outdated Show resolved Hide resolved

optax/_src/base.py Outdated Show resolved Hide resolved

optax/__init__.py Show resolved Hide resolved

n2cholas requested a review from rosshemsley December 1, 2021 06:57

mkunesch requested changes Dec 3, 2021

View reviewed changes

optax/__init__.py Show resolved Hide resolved

optax/_src/base.py Show resolved Hide resolved

optax/_src/base.py Outdated Show resolved Hide resolved

optax/_src/base_test.py Outdated Show resolved Hide resolved

optax/_src/base_test.py Show resolved Hide resolved

n2cholas requested a review from mkunesch December 5, 2021 07:24

mkunesch reviewed Dec 9, 2021

View reviewed changes

optax/__init__.py Show resolved Hide resolved

n2cholas requested a review from mkunesch December 9, 2021 18:20

mkunesch approved these changes Dec 14, 2021

View reviewed changes

mkunesch reviewed Dec 16, 2021

View reviewed changes

n2cholas added 7 commits December 17, 2021 01:33

Implement stateless wrapper.

4a3be7d

Move stateless to base; switch to EmptyState

eb1908c

Update stateless typing and docstring; add to docs

d82cc32

Split stateless into two functions.

2e00aff

Move stateless to gradient transforms in docs.

e63bdab

Move stateless to alphabetical position in init and docs.

0dcfbee

Fix order of base_test imports.

8cf9f61

mkunesch approved these changes Dec 17, 2021

View reviewed changes

copybara-service bot merged commit aedf82a into google-deepmind:master Dec 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement stateless wrapper. #246

Implement stateless wrapper. #246

n2cholas commented Nov 26, 2021

mkunesch left a comment

n2cholas commented Nov 29, 2021

rosshemsley left a comment •

edited

rosshemsley commented Nov 29, 2021

n2cholas commented Dec 1, 2021

mkunesch left a comment

mkunesch left a comment

n2cholas commented Dec 9, 2021

n2cholas commented Dec 9, 2021

mkunesch left a comment •

edited

mkunesch left a comment •

edited

mkunesch Dec 16, 2021

n2cholas Dec 17, 2021

mkunesch Dec 22, 2021

mkunesch commented Dec 22, 2021 •

edited

Implement stateless wrapper. #246

Implement stateless wrapper. #246

Conversation

n2cholas commented Nov 26, 2021

mkunesch left a comment

Choose a reason for hiding this comment

n2cholas commented Nov 29, 2021

rosshemsley left a comment • edited

Choose a reason for hiding this comment

rosshemsley commented Nov 29, 2021

n2cholas commented Dec 1, 2021

mkunesch left a comment

Choose a reason for hiding this comment

mkunesch left a comment

Choose a reason for hiding this comment

n2cholas commented Dec 9, 2021

n2cholas commented Dec 9, 2021

mkunesch left a comment • edited

Choose a reason for hiding this comment

mkunesch left a comment • edited

Choose a reason for hiding this comment

mkunesch Dec 16, 2021

Choose a reason for hiding this comment

n2cholas Dec 17, 2021

Choose a reason for hiding this comment

mkunesch Dec 22, 2021

Choose a reason for hiding this comment

mkunesch commented Dec 22, 2021 • edited

rosshemsley left a comment •

edited

mkunesch left a comment •

edited

mkunesch left a comment •

edited

mkunesch commented Dec 22, 2021 •

edited