Feature germsel enhancements #270

coreyostrove · 2022-11-08T02:27:35Z

First, and primary, PR for a bunch of branches that revamp the germ selection code. This update adds a number of additional performance enhancements:

Better deduping routines for decreasing the size of the search space.
A new hybrid caching scheme that leverages the low-rank nature of the Jacobians for individual germs.
An alternative implementation of the greedy search that uses this hybrid cache to very quickly evaluate the cost function for candidate germ set additions using low-rank psuedoinverse updates.

Because I practiced terrible branch hygiene (but also in part because some changes to the germ sel code were in service of other algorithms), there are a number of additional updates in the pipeline, specifically on the branches 'feature-fpr-algorithm-performance' and 'feature-fidsel-performance'. This includes better docstrings, logging/exception messages and the removal of a bunch of debugging print statements.

This commit implements a rudimentary form of caching for the effective states and preps to avoid having to repeatedly calculate the transfer matrix for the fiducial circuits.

…e available fiducials This commit adds basic caching support to avoid re-calculating the PTMs for the available fiducial list when generating the prep cache and measurement cache, which typically use the same set of available circuits and as such the same PTMs. Also add a function for removing effective identities and circuits with duplicate PTMs from the available fiducial list to decrease the search space.

Updated the scoring routine to inline the calculation to enable avoiding duplicate work. Modest performance boost.

Updated caching to avoid calling to_vector of POVMs and state preps for key generation too often which was proving expensive.

This commit is a one-line change of how the number of non-zero eigenvalues is counted which works out to be a few times faster than the old way I implemented.

Biggest changes are to the deduping routine which is now more efficient because we now winnow the list of circuits to search through for duplicates on the fly, reducing the number of pairwise checks along the way, big speedboost from this. Also switched to using the frobenious norm to identify duplicate PTMs as this was much faster in practice than np.allclose. Also updates the cache to use the string reps of circuits as keys instead of the circuits themselves to reduce the cost of repeatedly hashing circuits which can be more expensive than straight strings, modest performance enhancement from this

Add a basic check through the list of candidate germs to remove any with duplicate PTMs. Modest reduction in the size of the search space/performance.

…formance gains Adds support for user specified numpy floating point data types. This advanced option can enable a factor of 2-4 savings on the memory footprint of germ selection. Also exposes at the top level the ability to switch between a fully cached and on-the-fly calculation of the Jacobians for each germ.

1. Hybrid caching feature built on using the compact SVD representation of each of the Jacobians of each of the germs. 2. low-rank update based optimization loop using a hybrid approach based on results from M. Brand and N. Minamide. 3. Force-rank-update mode which allows you to force a short circuit of an update calculation if it doesn't increase the rank of the overall jacobian. (Still work in progress) 4. Functionality for dropping a user specified random fraction of candidate germs. 5. Minor change to the projection onto the kite code which uses a faster kronecker product implementation.

Adds progress logging to the cache generation stage and also modifies the way the inner loop progression is reported (by changing the progress logging level context).

The gauge parameter calculation is kind of slow so print out the result of this so we can re-use it later to circumvent the need to calculate this again.

One more minor printing change.

Update the code to allow circumventing the calculation of the number of gauge parameters entirely when both the number of gauge and non gauge parameters are manually specified.

I was originally returning both the components of the rank decomposition and the eigenvalues for the cache, but I never actually used the eigenvalues so I am dropping them from this structure. If I think of an application where I need to know what these are I can always reverse these changes very easily.

…disk This adds support for saving and loading compact EVD caches to disk to accelerate future calls to the germ selection algorithm. In the future I'll look into a more robust solution that can, for example, allow for incomplete coverage of the current candidate germ list etc...

When the SVD calculation fails to converge using numpy and it's default lapack driver of gesdd, fall back to scipy's implementation using the non-default gesvd driver option. Supposedly gesvd is more stable at the expense of being a bit slower.

…d toggle on and off prep and measurement selection Add option to return the deduped candidate list as well as the ability to manually specify the candidate list as a kwarg to avoid redoing the dedupe/ckt gen procedure. Also adds toggles to enable the generation of preps and measurements separately.

Adds an additional penalty term to the objective function where in a penalty is added to the score for each instance of a particular gate. This is passed in by the user in the form of a dictionary where the keys are gates and the values are the penalty weights.

Add a fallback behavior that tries instead to use a more stable implementation of the pinv based on scipy's SVD using a different lapack driver.

minor typo fix

…acobians onto the commutant Updates the implementation of the construction of the super-duper operator for projecting Jacobians onto the commutant to avoid doing double work when forming the projector onto degenenerate eigenspaces. Modest performance improvement for 2-qubits. Should return an effectively identical twirling super-duper operator otherwise.

One of the Cholesky decompositions is failing for mysterious reasons. Catch those errors and print out some diagnostic info.

At least one of the Cholesky decompositions was failing because a matrix was not PD, catch this, print some diagnostics, and fall back to calculation that doesn't use Cholesky (this was put in place for performance and is not strictly necessary).

Adds additional handles for setting and piping in a use specified value of the minimum eigenvalue threshold tolerance. I think the Cholesky decompositions we were seeing fail were effectively due to the Jacobian whose psuedoinverse we were updating being poorly conditioned due to an overly generous (i.e. too small) minimum eigenvalue tolerance. This was technically a variable that could be set but didn't have a way to do so at the top level and the hardcoded value resulted in issues.

This updates the final germ set score calculation to use a more memory efficient construction of the Jacobian.

…ure-germsel-enhancements

Adds unit tests to the germ selection test package to ensure the algorithm runs through to completion. Currently doesn't check for a particular set to be returned (none of the germ selection test do at the moment though).

fix linting issue

Add additional unit test for low-rank update germ sel

enielse

Looks great - if possible, and before you forget, perhaps throw into the docstrings a few more references to the low-rank-update algorithms/formula, i.e., in the literature or on the web?

pygsti/algorithms/germselection.py

Removes the argument num_gauge_params from find_germs and child functions since this is extraneous if we are already specifying the number of non-gauge parameters.

Corey Ostrove added 30 commits May 29, 2022 23:07

Implement Rudimentary Caching for Performance

898f200

This commit implements a rudimentary form of caching for the effective states and preps to avoid having to repeatedly calculate the transfer matrix for the fiducial circuits.

Updated the scoring routine

694e07a

Updated the scoring routine to inline the calculation to enable avoiding duplicate work. Modest performance boost.

Additional Caching Improvements

d519312

Updated caching to avoid calling to_vector of POVMs and state preps for key generation too often which was proving expensive.

Switch the way nonzero eigenvalues are counted

ef36728

This commit is a one-line change of how the number of non-zero eigenvalues is counted which works out to be a few times faster than the old way I implemented.

Add basic deduping support

671d5f2

Add a basic check through the list of candidate germs to remove any with duplicate PTMs. Modest reduction in the size of the search space/performance.

Adds additional progress logging

83ccf0d

Adds progress logging to the cache generation stage and also modifies the way the inner loop progression is reported (by changing the progress logging level context).

Print out the number of gauge and non-gauge parameters

1044eaa

The gauge parameter calculation is kind of slow so print out the result of this so we can re-use it later to circumvent the need to calculate this again.

One more minor printing change

9b22760

One more minor printing change.

Update specification of non-gauge and gauge parameter counts

33199c7

Update the code to allow circumventing the calculation of the number of gauge parameters entirely when both the number of gauge and non gauge parameters are manually specified.

Minor logging update

2e064dd

Add fall back behavior for failed SVDs

ce24505

When the SVD calculation fails to converge using numpy and it's default lapack driver of gesdd, fall back to scipy's implementation using the non-default gesvd driver option. Supposedly gesvd is more stable at the expense of being a bit slower.

Fallback behavior for psuedoinverse calculation

243f0d7

Add a fallback behavior that tries instead to use a more stable implementation of the pinv based on scipy's SVD using a different lapack driver.

Minor typo fix

de96935

minor typo fix

Print some diagnostic info to track down a wonky cholesky decomposition

40b2503

One of the Cholesky decompositions is failing for mysterious reasons. Catch those errors and print out some diagnostic info.

Minor typo fix

39194e1

Implement memory efficient version of final germ set score verification

5704e3b

This updates the final germ set score calculation to use a more memory efficient construction of the Jacobian.

Merge remote-tracking branch 'origin/bugfix-fpr-pgl-params' into feat…

d27d2f4

…ure-germsel-enhancements

Fix parameter counting for minimum number of fiducial pairs needed.

6d02f10

Add unit tests for low-rank update code

2e60cf0

Adds unit tests to the germ selection test package to ensure the algorithm runs through to completion. Currently doesn't check for a particular set to be returned (none of the germ selection test do at the moment though).

Merge branch 'develop' into feature-germsel-enhancements

3fcd107

coreyostrove requested review from enielse and sserita November 8, 2022 02:27

Corey Ostrove added 4 commits November 7, 2022 19:29

Fix linting issue

1b71ef6

fix linting issue

Add additional unit test for low-rank update germ sel

fd87b96

Add additional unit test for low-rank update germ sel

Fix linting error

f529c57

Another minor linting fix

bac74ac

coreyostrove marked this pull request as ready for review November 8, 2022 15:39

enielse approved these changes Nov 11, 2022

View reviewed changes

pygsti/algorithms/germselection.py Outdated Show resolved Hide resolved

pygsti/algorithms/germselection.py Show resolved Hide resolved

pygsti/algorithms/germselection.py Outdated Show resolved Hide resolved

Remove extraneous function argument

5e48194

Removes the argument num_gauge_params from find_germs and child functions since this is extraneous if we are already specifying the number of non-gauge parameters.

enielse approved these changes Nov 11, 2022

View reviewed changes

coreyostrove merged commit becc31c into develop Nov 14, 2022

coreyostrove deleted the feature-germsel-enhancements branch November 14, 2022 05:38

sserita mentioned this pull request Nov 18, 2022

Generating 3Q Models breaks fiducial selection #245

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature germsel enhancements #270

Feature germsel enhancements #270

coreyostrove commented Nov 8, 2022 •

edited

Loading

enielse left a comment

Feature germsel enhancements #270

Feature germsel enhancements #270

Conversation

coreyostrove commented Nov 8, 2022 • edited Loading

enielse left a comment

Choose a reason for hiding this comment

coreyostrove commented Nov 8, 2022 •

edited

Loading