added jvp rule for eigh, tests #358

levskaya · 2019-02-12T07:36:45Z

This adds the jvp rule for the symmetric eigendecomposition eigh operator. I followed Matthew's approach with the cholesky op of symmetrizing the input argument. I'd be happy to change that up if it's not how we want to deal with symmetry-bound operators.

levskaya · 2019-02-12T08:41:11Z

😠sorry, all these tests pass on our internal testing suite, will investigate what's going on

hawkinsp · 2019-02-12T12:19:06Z

A couple of things to watch out for that might explain different results on the CI builder:

we randomly sample tests to run, since we have so many. Different environments might have different RNG behavior and run a different subset. You can run a larger sample by setting the environment variable JAX_NUM_GENERATED_CASES=1000 (default is 10), and it's usually worth doing this when adding new tests.
the random initialization of tensors will also likely differ across machines/numpy versions.

levskaya · 2019-02-12T19:40:06Z

Yeah the tests probably weren't being run with enough examples initially. There's definitely a real problem with a subset of complex matrices here, though I'm a little baffled by what's going on - will try to hunt down the root cause soon.

levskaya · 2019-02-14T05:03:42Z

After trying to solve it again I finally decided to think.

The gradient of the complex eigenvector coordinates is ill-posed - every column of the eigenvector matrix has a complex degree of freedom (2*n real degrees of freedom for a nxn hermitian problem). e.g. onp.eig/onp.eigh will produce radically different coordinates given the differences in phase conventions in the underlying algorithms. Unless you were to backprop through the actual solver algorithms you're not going to match the particular coordinate gradients accurately due to all the irrelevant degrees of freedom of the problem. This is fine, it just means a simple numerical gradient check will fail.

What is true and can be tested is that (v+dv) are close to being true eigenvectors of the new eigenvalues of (A+dA). I suppose I should write a different sort of unit test for this case.

levskaya · 2019-02-14T10:57:45Z

Ok, there's a first version of eigh grad. Just let me know if anything is lacking. (Sometime later I should review the degenerate theory to see how hard it would be to make the grad safer for degeneracy, at least for the common case where the perturbation naturally splits the degeneracy...)

mattjj · 2019-02-14T16:43:14Z

Great thinking! That makes a lot of sense; why didn't we realize that before!

These are some pretty deep cuts into numerical linear algebra, and fantastic contributions. I'll take a look over the code now.

mattjj

This is beautiful. Seriously, amazing comments with references. I learned some things about the mathematics of differentiating eigh (and eigh algorithms).

googlebot added the cla: yes label Feb 12, 2019

added jvp rule for eigh, tests

8a84ae8

levskaya force-pushed the master branch from 1671861 to 8a84ae8 Compare February 14, 2019 05:51

levskaya added 2 commits February 13, 2019 23:23

fix testing of eigh jvp rule

ed437b4

fix missing symmetrize_input arg

8cd3f44

levskaya force-pushed the master branch from 9a5b4c1 to 8cd3f44 Compare February 14, 2019 08:41

actually test relative error

cd22050

mattjj self-requested a review February 14, 2019 16:43

mattjj approved these changes Feb 14, 2019

View reviewed changes

mattjj merged commit fc4c8bd into google:master Feb 14, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added jvp rule for eigh, tests #358

added jvp rule for eigh, tests #358

levskaya commented Feb 12, 2019

levskaya commented Feb 12, 2019 •

edited

hawkinsp commented Feb 12, 2019

levskaya commented Feb 12, 2019

levskaya commented Feb 14, 2019

levskaya commented Feb 14, 2019

mattjj commented Feb 14, 2019

mattjj left a comment

added jvp rule for eigh, tests #358

added jvp rule for eigh, tests #358

Conversation

levskaya commented Feb 12, 2019

levskaya commented Feb 12, 2019 • edited

hawkinsp commented Feb 12, 2019

levskaya commented Feb 12, 2019

levskaya commented Feb 14, 2019

levskaya commented Feb 14, 2019

mattjj commented Feb 14, 2019

mattjj left a comment

Choose a reason for hiding this comment

levskaya commented Feb 12, 2019 •

edited