Add a module to apply updates every k steps (and accumulate them othe… #2350

perolat · 2020-03-04T16:30:56Z

…rwise)

mtthss · 2020-03-04T16:40:36Z

Looks great!

tomhennigan · 2020-03-05T09:26:48Z

jax/experimental/optix.py

+    reset = state.count % k
+    emit = reset == (k - 1)
+    grad_acc = tree_multimap(
+        lambda g, ga: (reset == 0) * ga + g, updates, state.grad_acc)


The name reset doesn't actually reflect what's in the variable and I think the guard is wrong (afaik reset == 0 should actually be reset != 0).

I'd suggest:

c = state.count % k acc = c != 0 grad_acc = tree_multimap(lambda g, ga: acc * ga + g, updates, state.grad_acc) emit = c == (k - 1) updates = tree_multimap(lambda ga: emit * ga, grad_acc)

It would also be great if there was a test for this to avoid regressions..

Thanks Tom for spotting that problem. I'd be happy to do some tests but where is optix tested?

https://github.com/google/jax/blob/master/tests/optix_test.py

…rwise)

perolat · 2020-03-05T13:58:50Z

Tests added!

The tolerance is quite high absolute tolerance is 1e-6 and the relative tolerance is 100.
I tried lower tolerance high absolute tolerance is 1e-10 and the relative tolerance is 1e-5 but the test would fail on TPU.

I am not sure this solution is good though. What is recommended to handle the lower numerical precision on TPU?

mattjj · 2020-03-10T13:35:25Z

We can just skip the test on the TPU; look for the jtu.skip_on_devices helper.

mattjj · 2020-03-10T13:40:34Z

I'll merge, then mark this test as skipped on TPU.

mattjj · 2020-03-10T14:35:13Z

I confirmed internal tests pass after merging this (and after cc53aa9).

…rwise) (google#2350)

googlebot added the cla: yes label Mar 4, 2020

tomhennigan reviewed Mar 5, 2020

View reviewed changes

Add a module to apply updates every k steps (and accumulate them othe…

3b34ab3

…rwise)

perolat force-pushed the changelist/298700363 branch from 6321a78 to 3b34ab3 Compare March 5, 2020 13:52

mattjj merged commit 5c3b478 into google:master Mar 10, 2020

mattjj added a commit that referenced this pull request Mar 10, 2020

skip new optix test on tpu (cf. #2350)

cc53aa9

srvasude pushed a commit to srvasude/jax that referenced this pull request May 5, 2020

Add a module to apply updates every k steps (and accumulate them othe…

9b3d2c4

…rwise) (google#2350)

srvasude pushed a commit to srvasude/jax that referenced this pull request May 5, 2020

skip new optix test on tpu (cf. google#2350)

1db8d08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a module to apply updates every k steps (and accumulate them othe… #2350

Add a module to apply updates every k steps (and accumulate them othe… #2350

perolat commented Mar 4, 2020

mtthss commented Mar 4, 2020

tomhennigan Mar 5, 2020

perolat Mar 5, 2020

mtthss Mar 5, 2020

perolat commented Mar 5, 2020

mattjj commented Mar 10, 2020

mattjj commented Mar 10, 2020

mattjj commented Mar 10, 2020

Add a module to apply updates every k steps (and accumulate them othe… #2350

Add a module to apply updates every k steps (and accumulate them othe… #2350

Conversation

perolat commented Mar 4, 2020

mtthss commented Mar 4, 2020

tomhennigan Mar 5, 2020

Choose a reason for hiding this comment

perolat Mar 5, 2020

Choose a reason for hiding this comment

mtthss Mar 5, 2020

Choose a reason for hiding this comment

perolat commented Mar 5, 2020

mattjj commented Mar 10, 2020

mattjj commented Mar 10, 2020

mattjj commented Mar 10, 2020