[remat] Change remat lowering to XLA::Conditional #2391

trevorcai · 2020-03-09T18:43:08Z

jax.remat creates rematerializing passes that don't have data dependencies on
the actual loss-computing forward pass. This means that the XLA scheduler was
free to schedule the remat forward pass before the loss-computing pass,
defeating the goal of saving accelerator memory with jax.remat.

In practice, it sometimes did for my workloads.

This change expresses the lowering of remat_call(f) as:
Conditional(true, inputs, f, inputs, dummy_f).

In the common case of jax.grad(jax.remat(f)), the content of the
lowered remat_call are both the forwards & backwards; that is, the
incoming cotangents are part of the args.

Additionally, Conditional (AFAIK) is un-inlineable in the sense that it
doesn't execute until all its inputs (e.g. cotangents!) are available.

Downsides:

AFAICT, we can no longer interleave computation in/outside the
rematerialized block.
Potentially, lower performance. I do not observe this in my tests.

`jax.remat` creates rematerializing passes that don't have data dependencies on the actual loss-computing forward pass. This means that the XLA scheduler was free to schedule the remat forward pass before the loss-computing pass, defeating the goal of saving accelerator memory with `jax.remat`. In practice, it sometimes did for my workloads. This change expresses the lowering of remat_call(f) as: Conditional(true, inputs, f, inputs, dummy_f). In the common case of `jax.grad(jax.remat(f))`, the content of the lowered remat_call are both the forwards & backwards; that is, the incoming cotangents are part of the args. Additionally, Conditional (AFAIK) is un-inlineable in the sense that it doesn't execute until all its inputs (e.g. cotangents!) are available. Downsides: - AFAICT, we can no longer interleave computation in/outside the rematerialized block. - Potentially, lower performance. I do not observe this in my tests.

mattjj · 2020-03-10T13:16:37Z

This looks good, but IIUC @trevorcai suggested we wait to merge until he runs more tests. Let us know!

trevorcai · 2020-03-11T19:43:19Z

Had to include one-line change to work around an upstream XLA bug which got the parameter replication on the Conditionals wrong somehow. I think this is ready to merge.

mattjj

LGTM, thanks!

* [remat] Change remat lowering to XLA::Conditional `jax.remat` creates rematerializing passes that don't have data dependencies on the actual loss-computing forward pass. This means that the XLA scheduler was free to schedule the remat forward pass before the loss-computing pass, defeating the goal of saving accelerator memory with `jax.remat`. In practice, it sometimes did for my workloads. This change expresses the lowering of remat_call(f) as: Conditional(true, inputs, f, inputs, dummy_f). In the common case of `jax.grad(jax.remat(f))`, the content of the lowered remat_call are both the forwards & backwards; that is, the incoming cotangents are part of the args. Additionally, Conditional (AFAIK) is un-inlineable in the sense that it doesn't execute until all its inputs (e.g. cotangents!) are available. Downsides: - AFAICT, we can no longer interleave computation in/outside the rematerialized block. - Potentially, lower performance. I do not observe this in my tests. * provide no replication info for subcomputation params

googlebot added the cla: yes label Mar 9, 2020

jekbradbury requested a review from mattjj March 10, 2020 01:23

jekbradbury approved these changes Mar 10, 2020

View reviewed changes

provide no replication info for subcomputation params

fc7e570

mattjj approved these changes Mar 11, 2020

View reviewed changes

mattjj merged commit 620bf43 into google:master Mar 11, 2020

trevorcai mentioned this pull request Mar 22, 2020

hk.stateful.remat generates excess un-pruneable HLO google-deepmind/dm-haiku#12

Closed

trevorcai deleted the ckpt_cse branch December 1, 2020 20:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[remat] Change remat lowering to XLA::Conditional #2391

[remat] Change remat lowering to XLA::Conditional #2391

trevorcai commented Mar 9, 2020

mattjj commented Mar 10, 2020

trevorcai commented Mar 11, 2020

mattjj left a comment

[remat] Change remat lowering to XLA::Conditional #2391

[remat] Change remat lowering to XLA::Conditional #2391

Conversation

trevorcai commented Mar 9, 2020

mattjj commented Mar 10, 2020

trevorcai commented Mar 11, 2020

mattjj left a comment

Choose a reason for hiding this comment