Efficient Gradients with the Adjoint Method #90

ebutler414 · 2023-03-16T17:01:45Z

Efficient Gradients with the Adjoint Method

As per email correspondence, the end UI could be improved and the explanations contained in the tutorial are in a pretty raw state, but the code is hopefully in less of a raw state ;)

I have left a lot of these tasks unchecked until the final form of the end UI has been decided

Best
Eoin

basically didn't change anything except added targtet state and reversed the counting index.

However the legs are not pointing the correct direction: TODO Also added the lists that store the forwardprop and backprop as well as the lines that append the nodes to the list in the correct place

Each tensor in the backprop now has its legs swapped (either through numpy swapaxes in _get_pt_mpos_backprop in the for loop, or alternatively is transposed in the case of the system operators. The network is propagated upwards as described in the visual diagrams, but because of the axes swaps, time now runs downwards in those diagrams

Created the high level functions that the user will use, and a class to contain the gradient results. Added the loop into the backprop that combines the forwardprop and backprpop tensors. This deviates from my private version of the code where this loop was entirely seperate to the forwardprop and backprop. The reason for this change is to allow me to delete the forwardprop tensors as I generate the backprop. This reduceds the amount of memory needed for the adjoint method by 50%.

I haven't debugged it but it has all of the methods it needs for now.

Did not debug this prob doesn't work, or even compile.

Only imports PT and v barebones right now

now get index for chain rule using interp1d which makes way more sense as it removes all of the floating point error bullshit A lot of the old method by searching sorted lists was deleted in this commit so if you're missing something and you want to look how the previous method searched through a sorted list look at the previous commit

oh boy.....

fixed the lines that store the backpropagated tensors and delete them accordingly Also fixed a bug where in the backpropagation the indices were one ahead of what they should have been, because since we are propagating backwards, the propagators we are actually interested in are the ones associated with the previous step in the forwardprop

Forgot that i need N(MPOs)+1 dprop lists, not N.

'Tis bugged.....

i had a z and an x mi8xed up, gradients still aren't correct, however they're currently considerably better They're wrong because i'm taking the derivative across two half timesteps, but i'm not reintroducing the second when i'm combinging the derivatives with dprop dsys, next update.....

finds two numbers that differ by one ascending in a list, and checks if the first one is even this is the condition necessary for adjacent operators in the algorithm

added a brif sketch of what the code should look like when i add the ability to use two propagators. this code is currently broken, debugging....

Basically made the previous algorithm i introduced defunct as i'm not convinced it adds any time saving that would warrent the increase in complexity and modes of failure of the code. Instead added the extra node necessary during the chain rule, which i previously forgot to add. Whole thing is still bug, one is because i haven't treated the first and last half propagator seperately as they're a special case, but there is more bugs that i don't know about

I felt like what i was doing was too long-winded for a single tutorial so I decided it'd be better to split it up into a very introductory tutorial and then a seperate one to talk about optimisation and how that would be done. This is in addition to a seperate tutorial that will talk about the chain rule and finite differencing / autograd

where foreward and backprop lists are both specified. this wouldn't have affected the other case where i delete the foreward and backprop lists and just give total derivitives (which are still bugged)

but i don't like it i'm prob gonna change some stuff

I think it's currently bugged, this will be converted into a jupyter notebook when it's done. it's currently pretty hardcoded.

improved dprop accessability and understandability, and tested it to make sure it works properly

deleted script that had no purpouse other than a proof of concept

Basically reverted previous change and made the gradient function return the dynamics object so can reuse adjoint calculation for different dprop lists

christ that was awful

Updated the frontend comments / documentation Added the retain forward-prop and back-prop user flag to the frontend (previously it was just in the backend)

get_hamiltonian now makes way more sense.....

it's at the start of the tutorial As piper suggested

Deleted redundant tutorials

_chain_rule now uses correct dfidelity

added ParametrizedSystem, as well as an example of how this script should work. Most of this was originally written by gerald

This approach will break my previous code, but retains all of it's power and is more succient. This should work perfectly as is with gerald's example

Plan is for it to instead inherit from a propegator based class, seperate for gradient code, that just supplies a list of propagators instead of a hamiltonian

The PropagatorSystem should be pretty functional, the other one, is very much not

gefux · 2024-05-04T10:32:20Z

This PR is a previous draft of #127.

ebutler414 added 30 commits December 16, 2022 17:50

Duplicated compute_dynamics function

c053be3

Duplicated the for loop to make the backprop

3432e2c

basically didn't change anything except added targtet state and reversed the counting index.

Changed backprop contractions to the correct order

60494cb

However the legs are not pointing the correct direction: TODO Also added the lists that store the forwardprop and backprop as well as the lines that append the nodes to the list in the correct place

Completed GradientDynamics class

f4f1821

I haven't debugged it but it has all of the methods it needs for now.

Added a method to get half and full timesteps for a PT

8407be4

Wrote second part of chain rule

a76828b

Did not debug this prob doesn't work, or even compile.

Added start of tutorial

79d9b4f

Only imports PT and v barebones right now

WIP

6c99f31

Now just have to debug 1000s of lines of code......

0079880

oh boy.....

fixed swapaxes bug

7c88409

Forgot to add saving timeslice at start outside loop

6373141

Forgot that i need N(MPOs)+1 dprop lists, not N.

Cleaned up comments

5596dfb

Fixed remaining niggles... code now runs

aa4fdb2

'Tis bugged.....

Sketch of a method to find adjacent propagators

9bff468

finds two numbers that differ by one ascending in a list, and checks if the first one is even this is the condition necessary for adjacent operators in the algorithm

Imported algorithm for adjacent propagators to gradient code

33f9f5f

added a brif sketch of what the code should look like when i add the ability to use two propagators. this code is currently broken, debugging....

removed defunct functions and tidied comments

eb4c4ab

Started improving documentation for tutorial

9eec30f

Fixed bug with backprop for specific case....

fda98a8

where foreward and backprop lists are both specified. this wouldn't have affected the other case where i delete the foreward and backprop lists and just give total derivitives (which are still bugged)

Modified introduction text

409ed93

Wrote first iteration of frontent and tutorial

c2544c6

but i don't like it i'm prob gonna change some stuff

Started work on an autograd routine for dprop

794816d

I think it's currently bugged, this will be converted into a jupyter notebook when it's done. it's currently pretty hardcoded.

Reworded and reworked large section of tutorial

755f5b6

improved dprop accessability and understandability, and tested it to make sure it works properly

delete adjacent_numbers.py

c6cf809

deleted script that had no purpouse other than a proof of concept

ebutler414 added 14 commits March 2, 2023 17:09

Changed to chain rule in gradient function

9fcdacc

Basically reverted previous change and made the gradient function return the dynamics object so can reuse adjoint calculation for different dprop lists

Minor fixes, addressed first and last recombination

3383837

added a ARP pulse, and sum between adjacent elements

cabb7b0

Added FD for modified ARP, code successfully debugged

bbadf80

christ that was awful

Added test in gradient for dprop_times_list

06f67fd

Updated the frontend comments / documentation Added the retain forward-prop and back-prop user flag to the frontend (previously it was just in the backend)

Improved docstrings, comments, and minor tweeks

5c7bf64

Changed stupid exp_val w endpoint code

38046e5

get_hamiltonian now makes way more sense.....

wip

65ad357

Added description of gradient() requirements

4cbe264

it's at the start of the tutorial As piper suggested

Added a TODO.md for the purpouse of the upcoming PR

858c6e6

Found target_state was not transposed

c5f84a7

Inserted section on how contraction with target state works

bb3d4b0

Merge branch 'master' into eoin_gradients

98a0f76

Changed double ' to " in docstring

bb8cef1

Deleted redundant tutorials

piperfw changed the title ~~Effecient Gradients with the Adjoint Method~~ Efficient Gradients with the Adjoint Method Mar 21, 2023

ebutler414 added 6 commits September 22, 2023 12:52

Fixed gradients_introductrion

f5d9bf2

Added an example of an optimisation script

1f155da

Added more comments and improved variable names

8b744a1

Added plot of an optimised result

51ba7e7

Fixed bug with chain rule using wrong dfidelity

f0b7355

_chain_rule now uses correct dfidelity

reran gradients_introduction, don't really know what changed

9c54639

ebutler414 force-pushed the eoin_gradients branch from 1e863e9 to 9c54639 Compare January 4, 2024 15:24

ebutler414 added 5 commits January 4, 2024 15:29

Added ParamatarisedSystem class as originally written by Gerald

f9f9024

added ParametrizedSystem, as well as an example of how this script should work. Most of this was originally written by gerald

Modified Gerald's Example to a hybrid between our approaches

d2414be

This approach will break my previous code, but retains all of it's power and is more succient. This should work perfectly as is with gerald's example

Duplicated ParamatrizedSystem

fcf5ff2

Plan is for it to instead inherit from a propegator based class, seperate for gradient code, that just supplies a list of propagators instead of a hamiltonian

Wrote a skeleton of what the two classes should look like

ebe039b

The PropagatorSystem should be pretty functional, the other one, is very much not

declan is a fucking legend

cf26848

gefux mentioned this pull request Jan 27, 2024

Efficient Gradients with the Adjoint Method #112

Closed

gefux closed this May 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Efficient Gradients with the Adjoint Method #90

Efficient Gradients with the Adjoint Method #90

ebutler414 commented Mar 16, 2023 •

edited by piperfw

Loading

gefux commented May 4, 2024

Efficient Gradients with the Adjoint Method #90

Efficient Gradients with the Adjoint Method #90

Conversation

ebutler414 commented Mar 16, 2023 • edited by piperfw Loading

Efficient Gradients with the Adjoint Method

gefux commented May 4, 2024

ebutler414 commented Mar 16, 2023 •

edited by piperfw

Loading