Feature fpr algorithm performance #274

coreyostrove · 2022-11-14T23:09:34Z

This branch introduces a new greedy search algorithm for performing per-germ FPR. This algorithm leverages the same low-rank update machinery used to generate speedups for germ selection. Due to the nature of the low-rank updates used, this utilizes an objective function based on the trace of the psuedo-inverse of a candidate fiducial set's Jacobian instead of the minimum eigenvalue (these should largely track with each other though). In the course of developing this a number of edge cases were revealed in the low-rank update code which didn't arise when testing germ selection. These have been patched, and as a nice side-effect the fixes actually result in a modest performance boost for germ selection too.

Smaller changes included in the branch include:

bugfixes for numpy data type handling
bugfixes for leaky RNG seed plumbing
implementation of a simple trick for SVDs of gramians that speeds up the construction of EVD caches
Mass expunging of debugging and print statements that are no longer needed

Switch numpy calls to use faster eigenvalue calculation for symmetric matrices.

…m-performance

This adds a check for what was previously an edge case, but now is a common occurrence when applying this to FPR, where the update doesn't change the rank of the base matrix because that matrix is full rank already. In this case we fall back to the standard Woodbury inverse identity.

Now greedy search algorithm for per-germ FPR using a new objective function and low-rank update based performance enhancements.

The seed wasn't be plumbed in properly for the new greedy FPR algorithm.

Fixes a seeding issue in candidate circuit selection wherein the wrong seed was being passed into a function. Also modifies default behavior when the main seed is specified but a separate seed for the candidates is not.

I was getting some strange behavior when using complex dtypes with matrices we're assuming to be real. For now enforce a requirement that you use a real-valued data type when assume real is true and vice versa.

Speed up the construction of the compact EVD cache by using a connection between the left and right singular vectors.

Removes a bunch of unneeded debugging code and cleans up the logging.

Remove a bunch of unneeded debugging lines and print statements and clean up logging as needed.

Fixes a typo where an incorrect variable name was being updated/used.

sserita

It looks good to me, although I haven't checked the math in detail. As Erik suggested, we can leave this open until we figure out the other FPR issue.

The exit criterion used in the greedy search algorithm didn't take into account the rank of the candidate set's jacobian and so it was possible to have a satisfactory psuedoinverse trace while not being sufficiently high rank. This adds an explicit check for this.

Adds more logging to the randomized search indicating the maximum rank seen at each iteration.

coreyostrove · 2022-11-17T05:47:04Z

I've spent some time looking into @enielse's concerns about the size of the fiducial sets returned for the randomized fiducial pair search routine. I've more or less convinced myself that this isn't a bug. I added additional logging to the randomized search routine so that the maximum rank seen for any Jacobian at each iteration (iteration = candidate fiducial set size) was printed. It looks like (for the particular test case using XYCNOT) that selecting fiducials at random just does a pretty poor job at giving us a full-rank Jacobian (this wouldn't have been my intuition, but it looks to be the case here). The objective function here can't be satisfied until the Jacobian is full-rank and so this is the limiting factor.

Looking into this did help me identify a mistake in the greedy algorithm, by the way. So this was all worthwhile. I had mistakenly failed to include in the exit criterion a check to ensure that the Jacobian was full rank and, unlike the minimum eigenvalue criterion, the psuedoinverse trace condition can be satisfied without being full rank because of how the psuedoinverse is defined. That is fixed now. We now get fiducial sets that are a bit larger, but the greedy search really does seem to be outperforming the random search by a significant amount here. The greedy score function does favor updates that increase the rank of the fiducial set the most up until the point where the Jacobian becomes full-rank, so maybe this isn't too surprising.

enielse · 2022-11-17T14:29:47Z

I did some digging too - and agree with @coreyostrove: I don't think there's necessarily a bug in the current FPR algorithm. The large number of pairs the old method is finding differs from earlier pyGSTi versions but from what I can tell this is due to bug fixes rather than introductions. Down the line, we should be able to trim down the list of needed pairs by tracking the parameter directions a germ was intended to amplify, but that's for another PR.

enielse

Looks great! This represents some really nice added functionality. Adding some comments referencing where the math comes from would be great if we have the time.

enielse · 2022-11-17T14:38:47Z

pygsti/algorithms/germselection.py

+            print('central_mat shape: ', central_mat.shape)
+        #now calculate the diagonal elements of pinv_E_beta@central_mat@pinv_E_beta.T
+
+        #Take the cholesky decomposition of the central matrix:


Maybe add a comment here explaining the different execution paths. If the Cholesky decomp fails, it seems that you can fall back on just an eisum (line 3144) that I assume is slower than the route using the Cholesky decomp, is that right? If this is correct, I think just noting in a comment that the einsum on 3144 is what we're trying to accomplish and this can be optionally sped up via a Cholesky decomposition execution path (if it works).

Thanks for the feedback, @enielse. I added some comments on this in my latest commit.

sserita

All looks good to me! @coreyostrove, ball is in your court for whether you want to add the comments Erik is suggesting, or just merge this in now.

Adds comment detailing the theory used for the low-rank update version of the greedy search algorithm. Also renames the greedy search algorithms function name more appropriately.

Fixes a broken unit test by giving it a slightly larger memory limit (I must've messed with the memory limit calculation at some point).

coreyostrove · 2022-11-20T23:21:21Z

I finally forced myself to sit down and write up a description of the theory behind the low-rank update code. I eventually plan to convert this into an expanded and more detailed write-up in latex (hence why it reads the way it does), so let me know if anything is unclear or confusing and I'll try to clarify that once I get around to writing that document.

Edit: Also I fixed that germ selection unit test in algorithmsb I accidentally broke. Just needed a slightly larger memory limit (I vaguely recall changing how the memory estimate was calculated, so that must've been why).

Corey Ostrove added 12 commits September 21, 2022 22:28

Switch to symmetric eigenvalue calculation

6b130a7

Switch numpy calls to use faster eigenvalue calculation for symmetric matrices.

Merge branch 'feature-germsel-enhancements' into feature-fpr-algorith…

c0b0c67

…m-performance

New greedy search algorithm for per-germ FPR

b7d17ab

Now greedy search algorithm for per-germ FPR using a new objective function and low-rank update based performance enhancements.

Fix RNG seeding

e60cb07

The seed wasn't be plumbed in properly for the new greedy FPR algorithm.

Fix seeding issue in circuit selection

35e59b3

Fixes a seeding issue in candidate circuit selection wherein the wrong seed was being passed into a function. Also modifies default behavior when the main seed is specified but a separate seed for the candidates is not.

Numpy dtype patch

3877b7c

I was getting some strange behavior when using complex dtypes with matrices we're assuming to be real. For now enforce a requirement that you use a real-valued data type when assume real is true and vice versa.

Use properties of SVD to speed up caching

b303397

Speed up the construction of the compact EVD cache by using a connection between the left and right singular vectors.

Clean up debugging lines and logging

48b0d4b

Removes a bunch of unneeded debugging code and cleans up the logging.

Merge branch 'develop' into feature-fpr-algorithm-performance

a9fc9ec

Clean up debugging code and print statements

3f73a11

Remove a bunch of unneeded debugging lines and print statements and clean up logging as needed.

Typo fix

560f36f

Fixes a typo where an incorrect variable name was being updated/used.

coreyostrove requested review from enielse and sserita November 14, 2022 23:09

sserita reviewed Nov 15, 2022

View reviewed changes

Corey Ostrove added 2 commits November 16, 2022 20:09

Additional logging information

707fc3e

Adds more logging to the randomized search indicating the maximum rank seen at each iteration.

enielse approved these changes Nov 17, 2022

View reviewed changes

sserita approved these changes Nov 18, 2022

View reviewed changes

Corey Ostrove added 2 commits November 19, 2022 22:37

Add low-rank update theory section

5c46890

Adds comment detailing the theory used for the low-rank update version of the greedy search algorithm. Also renames the greedy search algorithms function name more appropriately.

Fix unit test

f1b35a3

Fixes a broken unit test by giving it a slightly larger memory limit (I must've messed with the memory limit calculation at some point).

coreyostrove merged commit cd0a5ed into develop Nov 20, 2022

coreyostrove deleted the feature-fpr-algorithm-performance branch November 20, 2022 23:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature fpr algorithm performance #274

Feature fpr algorithm performance #274

coreyostrove commented Nov 14, 2022

sserita left a comment

coreyostrove commented Nov 17, 2022

enielse commented Nov 17, 2022

enielse left a comment

enielse Nov 17, 2022

coreyostrove Nov 20, 2022

sserita left a comment •

edited

coreyostrove commented Nov 20, 2022 •

edited

Feature fpr algorithm performance #274

Feature fpr algorithm performance #274

Conversation

coreyostrove commented Nov 14, 2022

sserita left a comment

Choose a reason for hiding this comment

coreyostrove commented Nov 17, 2022

enielse commented Nov 17, 2022

enielse left a comment

Choose a reason for hiding this comment

enielse Nov 17, 2022

Choose a reason for hiding this comment

coreyostrove Nov 20, 2022

Choose a reason for hiding this comment

sserita left a comment • edited

Choose a reason for hiding this comment

coreyostrove commented Nov 20, 2022 • edited

sserita left a comment •

edited

coreyostrove commented Nov 20, 2022 •

edited