Make solver.update jittable and ensure output states are consistent. #106

mblondel · 2021-12-01T17:23:16Z

No description provided.

jaxopt/_src/test_util.py

froystig · 2021-12-02T21:24:44Z

jaxopt/_src/anderson.py

-    return AndersonState(iter_num=0,
-                         error=jnp.inf,
+    return AndersonState(iter_num=jnp.asarray(0),
+                         error=jnp.asarray(jnp.inf),


It seems worthwhile to document—in a comment or even in this PR's description—the reason for sending these scalars through jnp.asarray. My understanding is that we would like to return a state struct that is the same whether or not the update took place on device (via jit). Is that correct?

In some situations, we might avoid writing expressions like jnp.asarray(1.) whenever possible. If we're not under a jit, the expression allocates a scalar on the default device, which could be an accelerator. Considering this, what is this degree of consistency buying us, and is it worth this cost?

The goal was to avoid this:

jitted_update = jax.jit(solver.update) state = solver.init_state(params) # here state contains floats params, state = jitted_update(params, state) # here state contains arrays due to jit params, state = jitted_update(params, state) # recompilation occurs

I agree it could be nice to add a comment but not sure if we should repeat the same comment in all solvers.

I believe that if we use onp.asarray instead of jnp.asarray, where onp is plain numpy, we will on the one hand keep these scalars in host memory, and on the other hand be consistent for jit (in the sense of shape/dtype), such that it won't recompile.

@fllinares and I would like to understand this better. Since this PR is already an improvement, let's merge and explore onp.asarray separately (potentially we could make measurements).

I agree. Sounds good!

fllinares

On my side, everything LGTM! Thanks a lot Mathieu!

google-cla bot added the cla: yes label Dec 1, 2021

fllinares reviewed Dec 1, 2021

View reviewed changes

jaxopt/_src/test_util.py Outdated Show resolved Hide resolved

mblondel requested a review from froystig December 1, 2021 21:41

froystig reviewed Dec 2, 2021

View reviewed changes

Make solver.update jittable and ensure output states are consistent.

333f0a2

fllinares approved these changes Dec 3, 2021

View reviewed changes

mblondel force-pushed the jit_methods branch from 564b222 to 333f0a2 Compare December 3, 2021 12:10

mblondel added the pull ready label Dec 3, 2021

froystig approved these changes Dec 3, 2021

View reviewed changes

copybara-service bot merged commit ce485d6 into google:main Dec 3, 2021

mblondel mentioned this pull request Dec 3, 2021

Explore onp.asarray instead of jnp.asarray for scalar values in the state struct #110

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make solver.update jittable and ensure output states are consistent. #106

Make solver.update jittable and ensure output states are consistent. #106

mblondel commented Dec 1, 2021

froystig Dec 2, 2021

mblondel Dec 2, 2021

froystig Dec 2, 2021 •

edited

mblondel Dec 3, 2021

froystig Dec 3, 2021

fllinares left a comment

Make solver.update jittable and ensure output states are consistent. #106

Make solver.update jittable and ensure output states are consistent. #106

Conversation

mblondel commented Dec 1, 2021

froystig Dec 2, 2021

Choose a reason for hiding this comment

mblondel Dec 2, 2021

Choose a reason for hiding this comment

froystig Dec 2, 2021 • edited

Choose a reason for hiding this comment

mblondel Dec 3, 2021

Choose a reason for hiding this comment

froystig Dec 3, 2021

Choose a reason for hiding this comment

fllinares left a comment

Choose a reason for hiding this comment

froystig Dec 2, 2021 •

edited