Add adjoint hessian called tfq.math.inner_product_hessian #530

jaeyoo · 2021-04-03T00:38:36Z

This PR adds tfq.math.inner_product_hessian() based on adjoint hessian reverse-mode calculation. It's independent of TensorFlow's Jacobian routine, so you can get the Hessian directly without tf.GradientTape.

Note: due to the large numerical error from the 2nd order finite differencing on cirq.PhasedXPowGate, it will complain if any input circuit has the gate.

Instead of getting gradient values, it accepts weight float values on programs[i] and other_programs[i][j], which can be used for any linear combination of the Hessian terms. You can pass just tf.ones() for the bare values.

jaeyoo · 2021-04-04T09:29:07Z

FYI. This op shows 20x~100x faster than cirq hessian calculation used in the unit test file.

MichaelBroughton · 2021-04-05T22:35:56Z

Hi Jae, thanks for writing all of this! I have a few high level questions/comments:
1.

It's independent of TensorFlow's Jacobian routine, so you can get the Hessian directly without tf.GradientTape.

I think we might want to have it be a part of tf.GradientTape that way users who already know to do things like:

with tf.GradientTape():
    with tf.GradientTape():
        <second order stuff>

or https://www.tensorflow.org/api_docs/python/tf/hessians , could just use these with this op and have it just work. I think this means you might need to register another custom_gradient for the gradient op of the forward pass op ?

It might make sense to add tests for adj_hessian_util.*** as well. In fact maybe it might be easier to review all of this PR if we first opened a PR with just adj_hessian_util.*** and tests and then moved on to this op PR. Smaller PRs tend to be better.
due to the large numerical error from the 2nd order finite differencing on cirq.PhasedXPowGate, it will complain if any input circuit has the gate.
This op shows 20x~100x faster than cirq hessian calculation used in the unit test file.

Nice!

Did you try using the analytic form of it's gradient gate instead of finite difference ?

DisARM qsim

jaeyoo requested review from MichaelBroughton and zaqqwerty April 3, 2021 00:38

jaeyoo added 8 commits April 3, 2021 10:05

Fix adjoint hessian bug

2ba9d73

Add analytic diff on PhasedXPowGate

018e541

Fix the adjoint hessian bug and add ComputeSmall

0079d53

Add inner_product_hessian import_test

1c3f25c

Fix lint

2e44e03

Fix format

2a12594

Fix format

67a16f9

Fix timeout by reducing the number of symbols

0141262

jaeyoo force-pushed the inner_prod_hessian_wo_phased_x_pow_gate branch from dec4520 to 0141262 Compare April 3, 2021 01:06

jaeyoo added 5 commits April 3, 2021 10:15

Fix test inputs

5f6ffcd

Fix name

4e01d1d

Fix typo

acea17b

Add mutex in ComputeSmall

1789108

Merge branch 'master' into inner_prod_hessian_wo_phased_x_pow_gate

c53a754

jaeyoo pushed a commit to jaeyoo/quantum that referenced this pull request Mar 30, 2023

Merge pull request tensorflow#530 from quantumlib/disarm-qsim

869c0c0

DisARM qsim

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add adjoint hessian called tfq.math.inner_product_hessian #530

Add adjoint hessian called tfq.math.inner_product_hessian #530

jaeyoo commented Apr 3, 2021

jaeyoo commented Apr 4, 2021

MichaelBroughton commented Apr 5, 2021 •

edited

Loading

Add adjoint hessian called tfq.math.inner_product_hessian #530

Are you sure you want to change the base?

Add adjoint hessian called tfq.math.inner_product_hessian #530

Conversation

jaeyoo commented Apr 3, 2021

jaeyoo commented Apr 4, 2021

MichaelBroughton commented Apr 5, 2021 • edited Loading

MichaelBroughton commented Apr 5, 2021 •

edited

Loading