Differentiable batch execute using TensorFlow #1542

josh146 · 2021-08-17T07:56:27Z

Context: This PR adds support for differentiable batch execution of circuits using TensorFlow following #1501 and #1508.

Description of the change:

This PR adds the following:

TensorFlow dispatch to the top-level qml.interfaces.batch.execute() function.
qml.interfaces.batch.tensorflow - a module containing a TensorFlow custom gradient function for dev.batch_execute.

Benefits:

Execute tapes in a batch, with the output remaining differentiable.
Supports gradient transforms and device execution methods
Since gradient transforms are differentiable, nth order higher derivatives are supported. Compared to PL master, this allows nth order derivatives of everything, including expval, var, probs, tensor products, non-two-term shifts, etc.

Example:

import tensorflow as tf

import pennylane as qml
from pennylane.interfaces.batch import execute

params = tf.Variable([0.1, 0.2, 0.3], dtype=tf.float64)
x = tf.Variable([0.5], dtype=tf.float64)

with tf.GradientTape() as t1:
    with tf.GradientTape() as t2:

        with qml.tape.JacobianTape() as tape1:
            qml.RX(params[0], wires=0)
            qml.RY(params[1], wires=0)
            qml.expval(qml.PauliZ(0))

        with qml.tape.JacobianTape() as tape2:
            qml.RX(params[2], wires=0)
            qml.CRY(x[0], wires=[0, 1])
            qml.CNOT(wires=[0, 1])
            qml.probs(wires=[1])

        tapes = [tape1, tape2]

        # execute both tapes in a batch on the given device
        dev = qml.device("lightning.qubit", wires=2)
        res = execute(tapes, dev, gradient_fn=qml.gradients.param_shift, interface="tf")

        loss = res[0][0] + res[1][0, 0] - res[1][0, 1]

    grad = t2.gradient(loss, [params, x])

hess = t1.jacobian(grad[0], params)

print("Loss:", loss)
print("Gradient:", grad)
print("Hessian:", hess)

gives

Loss: tf.Tensor(1.9332406126165342, shape=(), dtype=float64)

Gradient: [<tf.Tensor: shape=(3,), dtype=float64, numpy=array([-0.0978434 , -0.19767681, -0.27743179])>, <tf.Tensor: shape=(1,), dtype=float64, numpy=array([0.01070641])>]

Hessian: tf.Tensor(
[[-0.97517033  0.01983384  0.        ]
 [ 0.01983384 -0.97517033  0.        ]
 [ 0.          0.         -0.89686157]], shape=(3, 3), dtype=float64)

Potential drawbacks:

In Add a simple API for transforms that generate multiple tapes #1493, work is being done to create a standardized API for gradient transforms. Until then, we simply assume that any gradient_fn within the pennylane.gradients module is a transform.
All gradient transforms, and all device gradients (e.g., adjoint) are supported. The reversible method, however, is not currently supported, since it is not a transform nor a device method.

'Jacobians of Jacobians' (or Hessians or vector-valued cost functions) are not supported out-of-the box, because TensorFlow attempts to autograph/JIT the Jacobian computation in order to parallelize it. However, you can't JIT a function that converts Tensors -> NumPy! E.g.,

import tensorflow as tf

import pennylane as qml
from pennylane.interfaces.batch import execute

params = tf.Variable([0.1, 0.2], dtype=tf.float64)

with tf.GradientTape() as t1:
    with tf.GradientTape() as t2:

        with qml.tape.JacobianTape() as tape:
            qml.RX(params[0], wires=0)
            qml.CRY(params[1], wires=[0, 1])
            qml.CNOT(wires=[0, 1])
            qml.probs(wires=[1])

        # execute both tapes in a batch on the given device
        dev = qml.device("lightning.qubit", wires=2)
        res = execute([tape], dev, gradient_fn=qml.gradients.param_shift, interface="tf")
        res = tf.stack(res)

    grad = t2.jacobian(res, params)

hess = t1.jacobian(grad, params)

/home/josh/xanadu/pennylane/pennylane/interfaces/batch/tensorflow.py:62 execute
    [i.numpy() if isinstance(i, (tf.Variable, tf.Tensor)) else i for i in params]
/home/josh/xanadu/pennylane/pennylane/interfaces/batch/tensorflow.py:62 <listcomp>
    [i.numpy() if isinstance(i, (tf.Variable, tf.Tensor)) else i for i in params]
/home/josh/miniconda3/lib/python3.8/site-packages/tensorflow/python/framework/ops.py:401 __getattr__
    self.__getattribute__(name)

This is resolved by specifying experimental_use_pfor=False when computing the Jacobian.

Issues: n/a

Co-authored-by: Nathan Killoran <co9olguy@users.noreply.github.com>

Co-authored-by: Tom Bromley <49409390+trbromley@users.noreply.github.com>

glassnotes

Just a very quick review with a couple comments for now, but I tried executing one of the tests using the GPU and everything looks okay, the logger showed that all the operations were being run on it 🎉

pennylane/tape/unwrap.py

tests/interfaces/test_batch_tensorflow.py

Co-authored-by: Olivia Di Matteo <2068515+glassnotes@users.noreply.github.com>

glassnotes

@josh146 good to go, just caught a couple copy-paste errors in the docstrings.

glassnotes · 2021-08-20T13:55:40Z

pennylane/gradients/vjp.py

+            # corresponding element of the VJP will be zero,
+            # and we can avoid a quantum computation.
+            return [], lambda _: math.convert_like(np.zeros([num_params]), dy)
+    except AttributeError:


In what situations would allclose cause an attribute error?

Oh, so this is really annoying. Newer versions of TF will attempt to vectorize Jacobian computations by default, and as part of this vectorization process, they trace the cost function. The issue is:

you can't vectorize a quantum execution, even though TF tries 😆 So it's a very pointless step that does nothing but adds overhead.

During tracing, it sends proxy variables that have no value; e.g., you can't call dy.numpy() (since the value doesn't exist yet).

The second bullet point is the cause of the attribute error --- math.allclose is calling dy.numpy(), which doesn't exist in vectorized mode.

I attempted to rewrite math.allclose() to directly implement tf.abs(a-b) <= atol + b * rtol as per the definition, but then ran into another error - TF was complaining that a proxy variable cannot be used in a Python conditional 🙁

Basically: we need the vectorization to 'work' from TF's perspective, even though it has no effect.

tests/interfaces/test_batch_tensorflow.py

Co-authored-by: Olivia Di Matteo <2068515+glassnotes@users.noreply.github.com>

pennylane/interfaces/batch/tensorflow.py

anthayes92 · 2021-08-25T14:12:15Z

pennylane/interfaces/batch/tensorflow.py

+        tapes (Sequence[.QuantumTape]): batch of tapes to execute
+        device (.Device): Device to use to execute the batch of tapes.
+            If the device does not provide a ``batch_execute`` method,
+            by default the tapes will be executed in serial.


is the user notified of somehow when a device executes in series?

It depends on the device 🙂 Currently, no - serial is always a fallback to parallel.

pennylane/interfaces/batch/tensorflow.py

pennylane/tape/unwrap.py

anthayes92 · 2021-08-25T14:53:13Z

This looks really cool @josh146 . I have an objective to benchmark and provide user feedback on new features. This looks like a great candidate for that. Would be good to discuss some key test cases

josh146 and others added 30 commits August 4, 2021 22:38

Added differentiable VJP transform

0c57919

linting

674604b

more tests

688f4a2

linting

9a8476b

add tests

0413307

add comment

6b44284

fix

35e1848

more

67e216a

typos

d0e40f8

Apply suggestions from code review

89bdd8d

Co-authored-by: Nathan Killoran <co9olguy@users.noreply.github.com>

fixes

f415d9f

Merge branch 'master' into vjp-transform

a4592da

merge

9153c45

add tests

6c5dc72

more tests

11f20b3

renamed

122194c

typo

e98c835

Add caching to the autograd backend

5956967

more

8e3159f

Merge branch 'master' into vjp-transform

3cbfc22

more

b36ec30

more

3bd36bf

more

d644228

caching

81bd371

fix

9a19ce2

fix

44ca01d

fix tests

b4bb9d2

final

102d551

update changelog

55be8f2

update

49412da

josh146 and others added 10 commits August 18, 2021 01:26

Merge branch 'autograd-caching' into batch-tensorflow

62e4ddd

changelog

cee61e8

linting

eadfb21

comments

ec3f42c

streamlining

5cca6f2

less

a99f832

Apply suggestions from code review

77e5df1

Co-authored-by: Tom Bromley <49409390+trbromley@users.noreply.github.com>

Update pennylane/interfaces/batch/__init__.py

3d2b9b6

Merge branch 'master' into autograd-caching

34b379b

Merge branch 'autograd-caching' into batch-tensorflow

62741d3

glassnotes reviewed Aug 19, 2021

View reviewed changes

pennylane/tape/unwrap.py Outdated Show resolved Hide resolved

tests/interfaces/test_batch_tensorflow.py Outdated Show resolved Hide resolved

josh146 and others added 2 commits August 20, 2021 01:41

Update pennylane/tape/unwrap.py

a25810e

Co-authored-by: Olivia Di Matteo <2068515+glassnotes@users.noreply.github.com>

Update tests/interfaces/test_batch_tensorflow.py

0790f4e

Co-authored-by: Olivia Di Matteo <2068515+glassnotes@users.noreply.github.com>

Base automatically changed from autograd-caching to master August 20, 2021 05:47

glassnotes approved these changes Aug 20, 2021

View reviewed changes

josh146 and others added 3 commits August 20, 2021 23:58

merge master

38efba1

fix

f89f3f7

Apply suggestions from code review

5697708

Co-authored-by: Olivia Di Matteo <2068515+glassnotes@users.noreply.github.com>

josh146 requested a review from mariaschuld August 20, 2021 16:06

Merge branch 'master' into batch-tensorflow

68c0f12

josh146 mentioned this pull request Aug 25, 2021

Add a simple API for transforms that generate multiple tapes #1493

Merged

josh146 added 2 commits August 25, 2021 20:19

merge master

1141df1

Merge branch 'master' into batch-tensorflow

8f4f259

josh146 requested a review from anthayes92 August 25, 2021 14:38

anthayes92 approved these changes Aug 25, 2021

View reviewed changes

josh146 added 2 commits August 25, 2021 22:53

Apply suggestions from code review

c964056

Merge branch 'master' into batch-tensorflow

6e61771

josh146 merged commit 97a4b97 into master Aug 25, 2021

josh146 deleted the batch-tensorflow branch August 25, 2021 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Differentiable batch execute using TensorFlow #1542

Differentiable batch execute using TensorFlow #1542

josh146 commented Aug 17, 2021 •

edited

Loading

glassnotes left a comment

glassnotes left a comment

glassnotes Aug 20, 2021

josh146 Aug 20, 2021

josh146 Aug 20, 2021

anthayes92 Aug 25, 2021

josh146 Aug 25, 2021

anthayes92 commented Aug 25, 2021

Differentiable batch execute using TensorFlow #1542

Differentiable batch execute using TensorFlow #1542

Conversation

josh146 commented Aug 17, 2021 • edited Loading

glassnotes left a comment

Choose a reason for hiding this comment

glassnotes left a comment

Choose a reason for hiding this comment

glassnotes Aug 20, 2021

Choose a reason for hiding this comment

josh146 Aug 20, 2021

Choose a reason for hiding this comment

josh146 Aug 20, 2021

Choose a reason for hiding this comment

anthayes92 Aug 25, 2021

Choose a reason for hiding this comment

josh146 Aug 25, 2021

Choose a reason for hiding this comment

anthayes92 commented Aug 25, 2021

josh146 commented Aug 17, 2021 •

edited

Loading