[MLIR] Enzyme-driven gradients #244

pengmai · 2023-08-15T21:33:43Z

Context: The current gradient architecture does not support exact computation of hybrid programs with classical postprocessing or multiple QNodes.

Description of the Change: A reworking of Catalyst's gradient architecture to be driven by Enzyme. Quantum functions are split into their purely classical preprocessing and quantum parts as before, but now the differentiation of the end-to-end circuit is done by Enzyme with registered quantum gradients as custom gradients.

Overview of changes

Implement a splitting transformation that updates the hybrid computation graph such that QNodes are split into two functions, a .preprocess function that contains the classical preprocessing (just like the argmap function) and the .quantum function that contains the actual quantum computation. The .preprocess function ends in a call to the .quantum function, meaning it can replace QNodes and leave the hybrid graph connected.
- This assumes that, like PennyLane, a QNode must end in one or more measurements and cannot contain postprocessing.
GradOps are lowered to one or more BackpropOps of the entire hybrid computation (one for each result entry)
Custom gradients are registered for the .quantum split out QNodes.
Modify EinsumLinalgGeneric to support both memref args (in addition to tensor args) and dynamic shapes. This is used in the custom gradient of quantum functions.
The differential lowerings for adjoint and parameter shift differentiation now operate on individual QNodes instead of GradOps.
- There are also attributes used to connect the quantum gradients (.qgrad, .adjoint) to the custom gradients across both the lower-gradients and convert-gradient-to-llvm passes.
The frontend verification is modified to allow for non-QNode callees of catalyst.grad functions while also preserving the same verification as before (return value of a QNode with diff_method="adjoint" must be an expval, etc)
A bugfix within the runtime to prevent an error because the circuit is run twice with the adjoint method.

The important implementation quirks are documented on Notion.

Benefits: This new architecture supports differentiation of circuits with classical postprocessing, hybrid programs with multiple QNodes, and purely classical programs.

[sc-41364]
[sc-41375]
[sc-42856]

…adient architecture

- WIP conversion to DPS of all generated functions - Change tape type of custom grad to null ptr vs struct (was causing Enzyme to crash) - Read values from param vec within the withparams modified QNode (was causing segfaults from dereferencing poison vals)

…ng the per-QNode lowering patterns

… a device is present

… tests with method=defer

…=defer

…jointTest

rmoyard

Amazing job 🥇 Happy to approve the PR but I would like @dime10 to review it as well before merging

josh146 · 2023-08-24T01:44:29Z

This is great work @pengmai! Super excited to have it in 🎉

A quick question: will we also need to update https://docs.pennylane.ai/projects/catalyst/en/latest/dev/quick_start.html#calculating-quantum-gradients?

pengmai · 2023-08-24T13:48:47Z

Thanks Josh!

A quick question: will we also need to update https://docs.pennylane.ai/projects/catalyst/en/latest/dev/quick_start.html#calculating-quantum-gradients?

This PR should be totally backwards compatible and thus the quick start shouldn't require any changes to work, unless you wanted to highlight the new features.

dime10

This is amazing work 💯

A few comments and questions from my side, but nothing major!

Because this PR is so beefy though, I think it might be really helpful if the PR description contained a bullet point list of what was changed in existing code, what was added, and what are some quirks or compromises of the implantation, .... Doesn't need a lot of explanation, but just so we have a quick overview of everything that was undertaken.

doc/changelog.md

frontend/catalyst/pennylane_extensions.py

frontend/test/pytest/test_gradient_postprocessing.py

mlir/lib/Gradient/Transforms/LoweringPatterns.cpp

mlir/lib/Gradient/Transforms/GradMethods/HybridGradient.cpp

mlir/lib/Gradient/Transforms/GradMethods/ClassicalJacobian.cpp

mlir/lib/Gradient/Transforms/ConversionPatterns.cpp

…e-gradient-architecture

- Also update comment around BackpropOp copying cotangents

Co-authored-by: David Ittah <dime10@users.noreply.github.com>

…neAI/catalyst into jmp/enzyme-gradient-architecture

Co-authored-by: David Ittah <dime10@users.noreply.github.com>

dime10 · 2023-08-30T19:24:51Z

Thanks for the overview in the description, very helpful 💯

…the GradOp location

dime10

🧬

**Context:** This PR implements some documentation changes that follow up on #244, particularly [this comment](#244 (comment)). **Description of the Change:** Update the docstrings for `grad` and `jacobian`, rename `"defer"` to `"auto"` --------- Co-authored-by: David Ittah <dime10@users.noreply.github.com>

pengmai added 30 commits August 15, 2023 16:23

Saving WIP generation of Enzyme-driven hybrid gradient architecture

ff641b3

WIP: tensor-based rewriting with LLVM lowering to Enzyme of hybrid gr…

aae78b0

…adient architecture

WIP: disable frontend checks of differentiating non-QNode

e02383d

Remove debugging print statements

db8a058

Saving WIP state

4f70721

Remove general conversion to destination passing style

a762d57

Add function to wrap memref args of custom gradient-registered function

a022e79

Connect qgrad, add volatile load/store to prevent poison values

b03b34a

Always copy old result when updating to destination passing style

439da5a

Multiply together result shadow and quantum gradient

213c3d9

Extend einsumLinalgGeneric to support dynamic sizes

847742f

Un-transpose inferred Jacobian

7026c7a

Add Jacobian mode backprop lowering for 1 input and result

d92cb78

Add support for QNode returning multiple results

a184243

Support multi-result Jacobians

fcdcf51

Add support for multiple results and multiple arguments

1fe07d5

Fix bug when the direct callee of the backprop op is a QNode

f524070

Fix bug with reading the type of the Jacobian

31b8abd

Minor clean up

4c590a8

Break out hybrid gradient logic to its appropriate file, begin updati…

03eaea8

…ng the per-QNode lowering patterns

Add integration tests for differentiation of classical postprocessing

fb403a9

Only match hybrid gradient lowering when method is 'defer'

b4d7602

[runtime] set the recording property of the execution context even if…

90fa654

… a device is present

Add back finite-diff transformation on whole GradOps

f530192

Flatten catalyst.grad returns in test

a91a06b

Remove modifications to compile from IR, specify QNode diff_method in…

1728159

… tests with method=defer

Remove unused file, add check for diff_method=finite-diff with method…

d298fca

…=defer

Undo trying to re-transpose the transposed Catalyst Jacobians

a27aabd

Fix bugs with multiple grad invocations, clean up, update gradient Ad…

cfebb5f

…jointTest

rmoyard approved these changes Aug 21, 2023

View reviewed changes

pengmai added 4 commits August 22, 2023 15:40

Resolving merge conflicts

7ad7612

Resolve merge conflicts

8d381e9

Update postprocessing tests to use jacobian where necessary

03b95f3

codefactor: fix shadowing in test

91cbf53

pengmai and others added 3 commits August 25, 2023 10:51

Fix bug with mutating cotangent tensor

766952c

Resolve merge conflicts in changelog

710ad31

Merge branch 'main' into jmp/enzyme-gradient-architecture

b7dab16

dime10 reviewed Aug 29, 2023

View reviewed changes

pengmai and others added 6 commits August 30, 2023 10:41

Merge branch 'main' of github.com:PennyLaneAI/catalyst into jmp/enzym…

4ad9634

…e-gradient-architecture

Rename splitpreprocessed -> preprocess, withparams -> quantum

0f050b1

- Also update comment around BackpropOp copying cotangents

Remove flag from gradient lowering

beedd79

Apply suggestions from code review

a116d65

Co-authored-by: David Ittah <dime10@users.noreply.github.com>

Merge branch 'jmp/enzyme-gradient-architecture' of github.com:PennyLa…

6bfd945

…neAI/catalyst into jmp/enzyme-gradient-architecture

clang-format

c0b0cd0

dime10 added this to the v0.3 milestone Aug 30, 2023

pengmai and others added 2 commits August 30, 2023 14:38

Apply suggestions from code review

68a3067

Co-authored-by: David Ittah <dime10@users.noreply.github.com>

Add comment for JAXPR verification

35fc236

pengmai added 3 commits August 30, 2023 16:45

Use fullGradFunction to contain BackpropOps instead of inlining into …

0697ac8

…the GradOp location

Make method=defer the default differentiation method

0cb139b

Specify diff method in braket test, remove unused test argument

ca11e02

dime10 approved these changes Aug 30, 2023

View reviewed changes

Merge branch 'main' into jmp/enzyme-gradient-architecture

98f9293

pengmai merged commit 6b319bd into main Aug 31, 2023
18 checks passed

pengmai deleted the jmp/enzyme-gradient-architecture branch August 31, 2023 14:56

pengmai mentioned this pull request Aug 31, 2023

[Docs] Update gradient/jacobian documentation #271

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MLIR] Enzyme-driven gradients #244

[MLIR] Enzyme-driven gradients #244

pengmai commented Aug 15, 2023 •

edited

Loading

rmoyard left a comment

josh146 commented Aug 24, 2023

pengmai commented Aug 24, 2023

dime10 left a comment •

edited

Loading

dime10 commented Aug 30, 2023

dime10 left a comment

[MLIR] Enzyme-driven gradients #244

[MLIR] Enzyme-driven gradients #244

Conversation

pengmai commented Aug 15, 2023 • edited Loading

Overview of changes

rmoyard left a comment

Choose a reason for hiding this comment

josh146 commented Aug 24, 2023

pengmai commented Aug 24, 2023

dime10 left a comment • edited Loading

Choose a reason for hiding this comment

dime10 commented Aug 30, 2023

dime10 left a comment

Choose a reason for hiding this comment

pengmai commented Aug 15, 2023 •

edited

Loading

dime10 left a comment •

edited

Loading