Add device and gradient expansions to the new batch-execution pipeline #1651

josh146 · 2021-09-14T12:11:51Z

Context: The beta QNode, introduced in #1642, uses the new batch-execution pipeline internally, but does not yet support decompositions of circuits.

This PR adds circuit decomposition support, but in a different approach to the existing QNode.

The existing QNode, during construction, queries the device to see what operations it supports. It then expands the tape so as to only use operations native to the device. However, this leads to significant drawbacks, since this occurs prior to gradient rules being applied. In many cases, it is more efficient to apply the gradient rules first, and decompose down to device-supported gates at execution time.

For example, consider the DoubleExcitation operation. This operation decomposes down into eight parametrized RY gates, so in the existing pipeline, would require 16 evaluations to compute the gradient:

DoubleExcitation(theta) -> 8 RY(±theta) + 13 CNOT + 6 H ->[parameter-shift]--> 16 parameter-shift circuits

However, the DoubleExcitation operation supports a 4-term parameter-shift rule natively. Performing the device expansion later is therefore prefereable:

DoubleExcitation(theta) -> [parameter-shift]--> 4 parameter-shifts circuits -> decompose each down to RY, CNOT, H

Thus, in this PR, we add gradient specific decomposition to the QNode construction step, and move device-specific expansions to the device.

Description of the Change:

Two new methods were added to the Device API:
- Device.expand_fn(tape) -> tape: expands a tape such that it is supported by the device. By default, performs the standard device-specific gate set decomposition done in the default QNode. Can be overwritten by the device. Note that the output is 1-1; the expanded tape returns exactly the same value as the original tape, no post-processing required.
- Device.batch_transform(tape) -> tapes, processing_fn: pre-processes the tape in the case where the device needs to generate multiple circuits to execute from the input circuit. The requirement of a post-processing function makes this distinct to the expand_fn method above. By default, applies the transform
```
expval(\sum_i ci hi) -> \sum_i ci expval(hi)
```
  for devices that do not natively support Hamiltonians with non-commuting terms.

At the end of QNode.construct(), we call gradient_fn.expand_fn(tape) to expand out the circuit so that all operations present have gradient rules defined. Only applies if a gradient transform is being used, and gradient transforms specify the expansion logic. E.g., DoubleExcitation defines a gradient recipe for parameter-shift and so will be left as is, but StronglyEntanglingLayers doesn't, and so will be expanded.
Within QNode.__call__, prior to execute(tapes) being called, we apply the device's batch_transform.
qml.execute() is modified, to ensure that device.expand_fn(tape) is called before passing a tape onto a device for execution.
All templates have grad_method=None added, to specify that they do not have a gradient method. This will trigger a decomposition into operations that do have a gradient method. This change is required because, by default, grad_method="F" by default!! Which is a silly default :(
Unit tests have been added to tests/beta/test_beta_qnode.py, and integration tests added to tests/interfaces/batch/test_batch_interface_qnode.py.

Benefits:

The new QNode will now automatically decompose templates/operations not supported by the device.
If a template/operation has a gradient rule, but is not supported by the device, the gradient logic will be applied prior to decomposition, leading to a significant reduction in circuit evals. In particular, AllSingleDoubles will result in far fewer circuit executions.
Devices are now in control of tape expansion, and device developers can overwrite Device.expand_fn and Device.batch_transform for full control of circuit manipulation.

Possible Drawbacks:

I originally wanted to call the new method Device.expand(), but this is already taken by an existing device 🤦
Previously, circuit decomposition for the device gate set was done only once in QNode.construct(). However, now that expansion is moved to the device, it happens with every execution. While this has quantum savings with respect to gradient executions, it results in a classical overhead --- the decompositions now happen multiple times a QNode is called.
The Operation.grad_method attribute is really old, and predates a lot of quantum gradient research. As a result, it is not flexible enough for our needs. The following options are allowed:
- grad_method=None: this operation does not support gradients, attempt to decompose it
- grad_method="F": this operation only supports finite-diff, using parameter-shift will raise an error
- grad_method="A": this operation supports both parameter-shift and finite-diff
However, we are missing an option for:
- This operation supports finite-diff but not parameter-shift. Decompose it for parameter-shift support.
This latter behaviour is needed for operations such as ApproxTimeEvolution(H, t, L). This operation can be differentiated using finite-differences to get a gradient with only 2 evaluations. For the parameter-shift, it can be decomposed, requiring O(2NL) (I think?) evaluations to get the gradient. However, there is no value of grad_method we can set that will 'unlock' this behaviour currently.

Related GitHub Issues: n/a

doc/releases/changelog-dev.md

pennylane/beta/qnode.py

anthayes92

I still need to go through the tests for interfaces from autograd onward.

anthayes92 · 2021-09-21T19:42:20Z

doc/releases/changelog-dev.md

@@ -106,21 +107,52 @@
    significant performance improvement when executing the QNode on remote
    quantum hardware.

+  - When decomposing the circuit, the default decomposition strategy will prioritize


when do decompositions generally occur? e.g if I was running a simple optimisation with a PL circuit, at what level does this happen on hadware/simulator?

expansion_strategy="device": decomposition happens in the QNode, during construction, by querying the device for its supported gate set. This is beneficial in terms of overhead (since the decomposition only happens once), but results in future quantum transforms/compilations working with a potentially very big/deep circuit.

expansion_strategy="gradient": decomposition happens in the QNode, during construction, by querying the gradient transform. Typically, the decomposed circuit will not be as deep as the device-decomposed one, since a lot of complex unitaries have gradient rules defined. Later on, further decompositions may be required on the device to get the circuit down to native gate sets.

This is beneficial in terms of a reduction in quantum resources, at the expense of moving the device decomposition down to every evaluation of the device (so additional classical overhead).

This is beneficial in terms of a reduction in quantum resources, at the expense of moving the device decomposition down to every evaluation of the device (so additional classical overhead).

Seems to be yet another benefit for caching parametric circuits to reuse device translations.

Yep, precisely 💯 I would even argue, this is only fully solved by parametric compilation.

Followup question: suppose I do something like

@qml.compile() @qml.qnode(dev, expansion_strategy="device") def some_qnode(): # stuff

(or alternatively the gradient strategy). When does decomposition happen currently vs. in this new PR w.r.t. the compilation transform? As you suggest @josh146 we would want compilation to happen before either the device or gradient strategy so that compilation it is acting on a smaller circuit rather than the full-depth expanded one, and consequently leading to a smaller circuit that gets expanded / gradient transformed. (That said, it's possible that a decomposition leads to optimizations in the compilation pipeline that might not otherwise be recognized...)

Thanks for the detailed explanation @josh146, that makes things very clear! The gradient transform continues to impress me!

@glassnotes, correct me if wrong, but compile() is a qfunc transform, not a QNode transform? So the following order is needed:

@qml.qnode(dev, expansion_strategy="device") @qml.compile() def some_qnode(): # stuff

and just based on the ordering, the compile transform would always occur prior to the QNode's expansions.

Yes you're 100% correct, my bad 🤦‍♀️

anthayes92 · 2021-09-21T19:45:46Z

doc/releases/changelog-dev.md

+
+  - `Device.expand_fn(tape) -> tape`: expands a tape such that it is supported by the device. By
+    default, performs the standard device-specific gate set decomposition done in the default
+    QNode. Devices may overwrite this method in order to define their own decomposition logic.


can a user overwrite this logic?

Oh, this is a good point 🤔

At the moment yes, but it looks a bit hacky. You could do something like this:

>>> def my_custom_expand_fn(tape, **kwargs): ... print("hello") ... return tape >>> qnode.device.expand_fn = my_custom_expand_fn >>> qnode(0.5) hello tensor(0.87758256, requires_grad=True)

Hmmm 🤔 Do you think this will be useful?

@glassnotes I think @anthayes92 might be on to something re: custom decompositions....

I mean, we could even support something like how you currently 'register' QNode execution wrappers while writing a batch transform

dev = qml.device("default.qubit", wires=2) @dev.custom_expand def my_expansion_function(tape, **kwargs): ... return tape # from now on, the custom expansion is called whenever # the device is executed.

This is more powerful (too powerful?) compared to a dictionary of gates to custom decompositions. But I still have some question marks:

Should this replace the device expansion?

If it doesn't, does it come before the device expansion? This way unsupported gates are finally decomposed down to device native gates. Or should it come after the device expansion? Execution would then fail if the custom decomp results in an unsupported gate.

Rather than changing the device, does it make more sense to pass a custom decomposition to the QNode? E.g.,

@qml.qnode(dev, expansion_strategy=my_custom_expansion) # or @existing_qnode.register_expansion def my_custom_expansion(...):

This is more powerful (too powerful?) compared to a dictionary of gates to custom decompositions.

How would custom decompositions be specified in these cases?

What if we did something like this, which combines a few of the ideas floating around:

custom_decomps = {qml.Hadamard : h_func, qml.CNOT : cnot_func} def custom_expand(tape, custom_decomps): # applies custom decompositions qnode.device.set_expand_fn(custom_expand)

but where the device itself does some sort of internal validation of the decompositions?

def set_expand_fn(custom_decomps): for op, decomp in custom_decomps.items(): # Ensure all the operations in the decomposition are valid for the device ... # Register the new decompositions to the operators if decomp_is_valid: op.register_new_decomposition(decomp)

If we do this kind of validation, it ensures that we can apply the expansion after the gradient tapes have already been constructed, but with the guarantee that they'll still run on the device.

I really like @glassnotes suggestion of having expansion after the gradient tapes have already been constructed! So would the logic here look like: if custom gates are unsupported then decompose to device native gates, so that this is where the guarantee they'll still run on the device comes from?

pennylane/_device.py

pennylane/beta/qnode.py

anthayes92 · 2021-09-21T20:24:24Z

pennylane/gradients/gradient_transform.py

+        params = new_tape.get_parameters(trainable_only=False)
+        new_tape.trainable_params = qml.math.get_trainable_indices(params)


is this related to the Unwrap issue?

tests/beta/test_beta_qnode.py

glassnotes

Left some initial questions, will give things some time to sink in and come back to it later 🙂

glassnotes · 2021-09-22T12:31:22Z

doc/releases/changelog-dev.md

+
+  - `Device.expand_fn(tape) -> tape`: expands a tape such that it is supported by the device. By
+    default, performs the standard device-specific gate set decomposition done in the default
+    QNode. Devices may overwrite this method in order to define their own decomposition logic.


This is more powerful (too powerful?) compared to a dictionary of gates to custom decompositions.

How would custom decompositions be specified in these cases?

What if we did something like this, which combines a few of the ideas floating around:

custom_decomps = {qml.Hadamard : h_func, qml.CNOT : cnot_func} def custom_expand(tape, custom_decomps): # applies custom decompositions qnode.device.set_expand_fn(custom_expand)

but where the device itself does some sort of internal validation of the decompositions?

def set_expand_fn(custom_decomps): for op, decomp in custom_decomps.items(): # Ensure all the operations in the decomposition are valid for the device ... # Register the new decompositions to the operators if decomp_is_valid: op.register_new_decomposition(decomp)

If we do this kind of validation, it ensures that we can apply the expansion after the gradient tapes have already been constructed, but with the guarantee that they'll still run on the device.

doc/releases/changelog-dev.md

pennylane/_device.py

pennylane/beta/qnode.py

glassnotes · 2021-09-22T12:40:06Z

pennylane/gradients/gradient_transform.py

+        params = new_tape.get_parameters(trainable_only=False)
+        new_tape.trainable_params = qml.math.get_trainable_indices(params)


So a lot of the current logic is simply guided by 'this causes the tests to pass, for all interfaces, for all QNode variations, for all order derivatives, for all differentiation methods'.

This is how I feel any time I have to write interface tests 😓

glassnotes · 2021-09-22T12:44:04Z

pennylane/templates/layers/strongly_entangling.py

@@ -67,6 +67,7 @@ class StronglyEntanglingLayers(Operation):
    num_params = 1
    num_wires = AnyWires
    par_domain = "A"
+    grad_method = None


Re. your comment in the PR description about grad methods,

This operation supports finite-diff but not parameter-shift. Decompose it for parameter-shift support.

Could we make this parameter an ordered list of grad methods, or tuples with grad methods and additional info? For example, (grad_method, requires_decomposition_to_do_grad_method)?

Yes definitely! The grad_method and grad_recipe attributes are long overdue for an overhaul. I believe they're on the agenda as part of the operator refactor

glassnotes · 2021-09-22T12:47:01Z

tests/beta/test_beta_qnode.py

+            num_params = 1
+            par_domain = "R"
+
+            def expand(self):


Unrelated to this PR, but should all operations eventually have their decomposition method replaced by an expand like this?

To be decided 🤔 Another task for the Operator refactor story 😆

anthayes92

A few small suggestions, otherwise looking great!

tests/interfaces/test_batch_autograd_qnode.py

tests/interfaces/test_batch_tensorflow_qnode.py

tests/interfaces/test_batch_autograd_qnode.py

tests/interfaces/test_batch_torch_qnode.py

anthayes92 · 2021-09-22T14:31:19Z

doc/releases/changelog-dev.md

+
+  - `Device.expand_fn(tape) -> tape`: expands a tape such that it is supported by the device. By
+    default, performs the standard device-specific gate set decomposition done in the default
+    QNode. Devices may overwrite this method in order to define their own decomposition logic.


I really like @glassnotes suggestion of having expansion after the gradient tapes have already been constructed! So would the logic here look like: if custom gates are unsupported then decompose to device native gates, so that this is where the guarantee they'll still run on the device comes from?

pennylane/_device.py

Co-authored-by: Olivia Di Matteo <2068515+glassnotes@users.noreply.github.com>

Co-authored-by: anthayes92 <34694788+anthayes92@users.noreply.github.com>

glassnotes · 2021-09-23T14:38:37Z

doc/releases/changelog-dev.md

@@ -106,21 +107,52 @@
    significant performance improvement when executing the QNode on remote
    quantum hardware.

+  - When decomposing the circuit, the default decomposition strategy will prioritize


Followup question: suppose I do something like

@qml.compile() @qml.qnode(dev, expansion_strategy="device") def some_qnode(): # stuff

(or alternatively the gradient strategy). When does decomposition happen currently vs. in this new PR w.r.t. the compilation transform? As you suggest @josh146 we would want compilation to happen before either the device or gradient strategy so that compilation it is acting on a smaller circuit rather than the full-depth expanded one, and consequently leading to a smaller circuit that gets expanded / gradient transformed. (That said, it's possible that a decomposition leads to optimizations in the compilation pipeline that might not otherwise be recognized...)

josh146 added 30 commits September 10, 2021 15:41

Bug fixes for batch execution

b2c4baf

more tests

317514b

more tests

a937d4d

more tests

c6dc629

more tests

0a5756a

more tests

490f106

changelog

fa10491

Add metric tensor

d70f0c8

fixes

e0b45f5

add test

523b765

Merge branch 'master' into batch-bug-fixes

f8af2d2

Integrate batch execution into a QNode

754bbe7

update

bb6626c

update

377d8f3

remove xfail

7c400cc

Merge branch 'master' into batch-bug-fixes

57f83a0

Merge branch 'master' into batch-bug-fixes

0852693

more tests

abc543a

Merge branch 'master' into batch-bug-fixes

852f2fe

rever

2a3c67b

Add more tests

4e54a26

revert

0129cb7

Merge branch 'batch-qnode' into batch-qnode-interfaces

5d1f0b3

fix

a79c827

Merge branch 'batch-bug-fixes' into batch-metric-tensor

63f2990

Merge branch 'batch-metric-tensor' into batch-qnode

74a8df0

Merge branch 'batch-qnode' into batch-qnode-interfaces

1d35677

fix

59b9d7d

Merge branch 'batch-qnode' into batch-qnode-interfaces

31b0ff3

fix

23f8433

josh146 requested a review from glassnotes September 21, 2021 13:20

Base automatically changed from batch-qnode-interfaces to master September 21, 2021 14:59

josh146 added 6 commits September 21, 2021 23:06

merge master

19c3fe6

update changelog

4d54374

linting

0da517f

changelog

8d0880d

another test

141671a

Merge branch 'master' into batch-qnode-expand

d46fb3a

josh146 commented Sep 21, 2021

View reviewed changes

doc/releases/changelog-dev.md Outdated Show resolved Hide resolved

josh146 commented Sep 21, 2021

View reviewed changes

doc/releases/changelog-dev.md Outdated Show resolved Hide resolved

josh146 commented Sep 21, 2021

View reviewed changes

pennylane/beta/qnode.py Outdated Show resolved Hide resolved

Apply suggestions from code review

a5f6424

anthayes92 reviewed Sep 21, 2021

View reviewed changes

josh146 commented Sep 22, 2021

View reviewed changes

tests/beta/test_beta_qnode.py Outdated Show resolved Hide resolved

Apply suggestions from code review

af9c4c4

glassnotes reviewed Sep 22, 2021

View reviewed changes

anthayes92 approved these changes Sep 22, 2021

View reviewed changes

josh146 commented Sep 22, 2021

View reviewed changes

pennylane/_device.py Outdated Show resolved Hide resolved

josh146 and others added 5 commits September 22, 2021 22:41

Apply suggestions from code review

94de146

Co-authored-by: Olivia Di Matteo <2068515+glassnotes@users.noreply.github.com>

Apply suggestions from code review

61383be

Co-authored-by: anthayes92 <34694788+anthayes92@users.noreply.github.com>

Merge branch 'master' into batch-qnode-expand

4478944

Update tests/interfaces/test_batch_torch_qnode.py

d78f5a4

Co-authored-by: anthayes92 <34694788+anthayes92@users.noreply.github.com>

Merge branch 'master' into batch-qnode-expand

8b40dfa

glassnotes approved these changes Sep 23, 2021

View reviewed changes

josh146 added 4 commits September 23, 2021 23:59

Merge branch 'master' into batch-qnode-expand

46d01bb

add grad_method=None to all templates

6bdcd21

fix

320b468

fix

bb640fe

josh146 merged commit 324192d into master Sep 23, 2021

josh146 deleted the batch-qnode-expand branch September 23, 2021 17:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add device and gradient expansions to the new batch-execution pipeline #1651

Add device and gradient expansions to the new batch-execution pipeline #1651

josh146 commented Sep 14, 2021 •

edited

Loading

anthayes92 left a comment

anthayes92 Sep 21, 2021

josh146 Sep 22, 2021

licedric Sep 23, 2021

josh146 Sep 23, 2021

glassnotes Sep 23, 2021

anthayes92 Sep 23, 2021

josh146 Sep 23, 2021 •

edited

Loading

glassnotes Sep 23, 2021

anthayes92 Sep 21, 2021

josh146 Sep 22, 2021

josh146 Sep 22, 2021

josh146 Sep 22, 2021

glassnotes Sep 22, 2021

anthayes92 Sep 22, 2021

anthayes92 Sep 21, 2021

glassnotes left a comment

glassnotes Sep 22, 2021

glassnotes Sep 22, 2021

glassnotes Sep 22, 2021

josh146 Sep 22, 2021 •

edited

Loading

glassnotes Sep 22, 2021

josh146 Sep 22, 2021

anthayes92 left a comment

anthayes92 Sep 22, 2021

glassnotes Sep 23, 2021

		params = new_tape.get_parameters(trainable_only=False)
		new_tape.trainable_params = qml.math.get_trainable_indices(params)

Add device and gradient expansions to the new batch-execution pipeline #1651

Add device and gradient expansions to the new batch-execution pipeline #1651

Conversation

josh146 commented Sep 14, 2021 • edited Loading

anthayes92 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

josh146 Sep 23, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

glassnotes left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

josh146 Sep 22, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

anthayes92 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

josh146 commented Sep 14, 2021 •

edited

Loading

josh146 Sep 23, 2021 •

edited

Loading

josh146 Sep 22, 2021 •

edited

Loading