IDEA: Proxy block for adjoint #2979

connorjward · 2023-06-12T09:14:02Z

One of the key performance problems with the adjoint is the cost of setting up fresh solvers as the tape is traversed. Assuming that the adjoint problem involves a time loop, many of these solvers are repeating work done by other blocks (example). I think that this problem stems from the fact that we unroll time loops on the tape and information is lost about the equivalence of solve blocks.

My suggestion is as follows:

Add a ProxyBlock class to pyadjoint that points to some original block.
Add a new adjoint kwarg to decorated functions so the functions can know if they are repeated operations or not, and hence whether or not to store themselves as proxy blocks. Something like:
```
while t < T:
    solve(..., ad_block_id="mysolve")
```

The text was updated successfully, but these errors were encountered:

dham · 2023-06-12T14:26:44Z

I think this is something like where we ought to go. I'm not sure whether proxy block is the right way to do it.

The way we share state between forward solves right now is by having all the related solve blocks share some state information. This could be expanded to more shared state (particularly the adjoint solves).

I think it would be worth fleshing out which of these approaches is preferable. Maybe put it on the meeting agenda.

colinjcotter · 2023-06-12T14:33:05Z

I like this idea of leaving it up to the coder to decide

dham · 2023-06-12T14:36:50Z

I like this idea of leaving it up to the coder to decide

I don't think either of these options does that. This is still the same taping process.

What is proposed is that if a e.g. NonLinearVariationalSolver has its solve method called twice, you either get:

A Solve block the first time and then a Proxy block pointing at the Solve block the second time.
Two solve blocks but they both have an (e.g.) ._ad_block_shared_state member which contains the data that is shared between the two blocks (the forward and adjoint solvers, for example).

connorjward · 2023-06-12T15:21:56Z

A related pipe dream of mine is for us to employ enough smart caching that we could get near to equivalent performance calling the solve function compared with creating and reusing solvers.

This would all be interesting to discuss in this week's meeting.

connorjward · 2023-06-14T19:30:15Z

The conclusion from this week's meeting is that having a proxy block like this is practically equivalent to creating and reusing a solver object as reusing the solver naturally connects solve blocks.

I still want to find ways to optimise solver instantiation such that these strategies aren't required, but this specific ProxyBlock idea isn't the answer.

connorjward added enhancement performance firedrake-adjoint labels Jun 12, 2023

connorjward closed this as completed Jun 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IDEA: Proxy block for adjoint #2979

IDEA: Proxy block for adjoint #2979

connorjward commented Jun 12, 2023

dham commented Jun 12, 2023

colinjcotter commented Jun 12, 2023

dham commented Jun 12, 2023

connorjward commented Jun 12, 2023

connorjward commented Jun 14, 2023

IDEA: Proxy block for adjoint #2979

IDEA: Proxy block for adjoint #2979

Comments

connorjward commented Jun 12, 2023

dham commented Jun 12, 2023

colinjcotter commented Jun 12, 2023

dham commented Jun 12, 2023

connorjward commented Jun 12, 2023

connorjward commented Jun 14, 2023