Debug positive definite constraints #68720

nonconvexopt · 2021-11-22T05:39:15Z

While implementing #68644,
during the testing of 'torch.distributions.constraint.positive_definite', I found an error in the code: location

class _PositiveDefinite(Constraint):
    """
    Constrain to positive-definite matrices.
    """
    event_dim = 2

    def check(self, value):
        # Assumes that the matrix or batch of matrices in value are symmetric
        # info == 0 means no error, that is, it's SPD
        return torch.linalg.cholesky_ex(value).info.eq(0).unsqueeze(0)

The error is caused when I check the positive definiteness of
torch.cuda.DoubleTensor([[2., 0], [2., 2]])
But it did not made a problem for
torch.DoubleTensor([[2., 0], [2., 2]])

You may easily reproduce the error by following code:

Python 3.9.7 (default, Sep 16 2021, 13:09:58)
[GCC 7.5.0] :: Anaconda, Inc. on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import torch
>>> const = torch.distributions.constraints.positive_definite
>>> const.check(torch.cuda.DoubleTensor([[2., 0], [2., 2]]))
tensor([False], device='cuda:0')
>>> const.check(torch.DoubleTensor([[2., 0], [2., 2]]))
tensor([True])

The cause of error can be analyzed more if you give 'check_errors = True' as a additional argument for 'torch.linalg.cholesky_ex'.
It seem that it is caused by the recent changes in 'torch.linalg'.
And, I suggest to modify the '_PositiveDefinite' class by using 'torch.linalg.eig' function like the below:

class _PositiveDefinite(Constraint):
    """
    Constrain to positive-definite matrices.
    """
    event_dim = 2

    def check(self, value):
        return (torch.linalg.eig(value)[0].real > 0).all(dim=-1)

By using above implementation, I get following result:

Python 3.9.7 (default, Sep 16 2021, 13:09:58) 
[GCC 7.5.0] :: Anaconda, Inc. on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import torch
>>> const = torch.distributions.constraints.positive_definite
>>> const.check(torch.cuda.DoubleTensor([[2., 0.], [2., 2.]]))
tensor(True, device='cuda:0')
>>> const.check(torch.DoubleTensor([[2., 0.], [2., 2.]]))
tensor(True)

FYI, I do not know what algorithm is used in 'torch.linalg.eig' and 'torch.linalg.cholesky_ex'. As far as I know, they have same time complexity generally, O(n^3). It seems that in case you used special algorithms or finer parallelization, time complexity of Cholesky decomposition may be reduced to approximately O(n^2.5). If there is a reason 'torch.distributions.constraints.positive_definite' used 'torch.linalg.cholesky_ex' rather than 'torch.linalg.eig' previously, I hope to know.

…sor is symmetric in last 2 dimensions.

… interface.

…terface

…test.

…mmetric

pytorch-probot · 2021-11-22T05:39:18Z

CI Flow Status

⚛️ CI Flow

Ruleset - Version: v1
Ruleset - File: https://github.com/nonconvexopt/pytorch/blob/67b3448108f974213eabbcad9268dcf7e83aa862/.github/generated-ciflow-ruleset.json
PR ciflow labels: ciflow/default

Workflows	Labels (bold enabled)	Status
Triggered Workflows
linux-bionic-cuda11.5-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-bionic-py3.6-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/noarch`, `ciflow/xla`	✅ triggered
linux-vulkan-bionic-py3.6-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/vulkan`	✅ triggered
linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3-clang5-mobile-build	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`	✅ triggered
linux-xenial-py3-clang5-mobile-custom-build-static	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`	✅ triggered
linux-xenial-py3.6-clang7-asan	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/sanitizers`	✅ triggered
linux-xenial-py3.6-clang7-onnx	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/onnx`	✅ triggered
linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-gcc7	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-gcc7-bazel-test	`ciflow/all`, `ciflow/bazel`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single-full-jit	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
win-vs2019-cpu-py3	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/win`	✅ triggered
win-vs2019-cuda11.3-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/win`	✅ triggered
Skipped Workflows
caffe2-linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`	🚫 skipped
docker-builds	`ciflow/all`	🚫 skipped
ios-12-5-1-arm64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-custom-ops	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-full-jit	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-metal	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-x86-64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-x86-64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-x86-64-full-jit	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
libtorch-linux-bionic-cuda11.5-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
libtorch-linux-xenial-cuda10.2-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
libtorch-linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
linux-bionic-cuda10.2-py3.9-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`	🚫 skipped
macos-10-15-py3-arm64	`ciflow/all`, `ciflow/macos`	🚫 skipped
macos-10-15-py3-lite-interpreter-x86-64	`ciflow/all`, `ciflow/macos`	🚫 skipped
macos-11-py3-x86-64	`ciflow/all`, `ciflow/macos`	🚫 skipped
parallelnative-linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`	🚫 skipped
periodic-libtorch-linux-xenial-cuda11.1-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-linux-xenial-cuda10.2-py3-gcc7-slow-gradcheck	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`, `ciflow/slow`, `ciflow/slow-gradcheck`	🚫 skipped
periodic-linux-xenial-cuda11.1-py3.6-gcc7-debug	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-win-vs2019-cuda11.1-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	🚫 skipped

You can add a comment to the PR and tag @pytorchbot with the following commands:

# ciflow rerun, "ciflow/default" will always be added automatically
@pytorchbot ciflow rerun

# ciflow rerun with additional labels "-l <ciflow/label_name>", which is equivalent to adding these labels manually and trigger the rerun
@pytorchbot ciflow rerun -l ciflow/scheduled -l ciflow/slow

For more information, please take a look at the CI Flow Wiki.

facebook-github-bot · 2021-11-22T05:39:20Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/68720
📄 Preview docs built from this PR
📄 Preview C++ docs built from this PR
🔧 Opt-in to CIFlow to control what jobs run on your PRs

💊 CI failures summary and remediations

As of commit 67b3448 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

lezcano

So, this code assumes that the matrix you're working with is symmetric. Note that the matrix you tried this with is not symmetric. See the definition of [definite matrix]. See also the comment that in the check function. When given a non-symmetric matrix, cholesky copies the lower half of the matrix onto the upper half. In your case, you'd get the matrix [[2, 2], [2, 2]]. This matrix is symmetric positive semidefinite. Due to numerical errors, CPU correctly classifies this as non-SPD and CUDA fails and returns that it's SPD.

Regardless of this, about your solution.

It's not correct, as a matrix with complex singular values may be considered "positive definite".
To get the eigenvalues, consider using in the future linalg.eigvals.

nonconvexopt · 2021-11-23T10:04:57Z

So, this code assumes that the matrix you're working with is symmetric. Note that the matrix you tried this with is not symmetric. See the definition of [definite matrix]. See also the comment that in the check function. When given a non-symmetric matrix, cholesky copies the lower half of the matrix onto the upper half. In your case, you'd get the matrix [[2, 2], [2, 2]]. This matrix is symmetric positive semidefinite. Due to numerical errors, CPU correctly classifies this as non-SPD and CUDA fails and returns that it's SPD.

Regardless of this, about your solution.

It's not correct, as a matrix with complex singular values may be considered "positive definite".

To get the eigenvalues, consider using in the future linalg.eigvals.

Thank you for detailed explanation! I will check linalg.eigvals. I forgotted the point about the symmetricity of the tensor.

In fact, I have a suggestion in implementation of constraints for matrices: inheritable_torch.distributions.constraints_for_matrices
@lezcano How about this design? It is different from previous implementation styles in 'torch.distributions.constraints' and it requires more computation. But I think it is more logical way.

…mmetric

lezcano

I think that makes sense. Now, I do not maintain this part of PyTorch, so I would not be able to accept such a change in good faith, someone else would need to review it (cc @jbschlosser to find someone that could?).

I have proposed below how to implement that approach.

torch/distributions/constraints.py

Co-authored-by: Lezcano <Lezcano@users.noreply.github.com>

nonconvexopt · 2021-11-23T13:29:07Z

I think that makes sense. Now, I do not maintain this part of PyTorch, so I would not be able to accept such a change in good faith, someone else would need to review it (cc @jbschlosser to find someone that could?).

I have proposed below how to implement that approach.

Thanks for your feedback. I will open another seperated PR for inheritable_torch.distributions.constraints_for_matrices if it is more clear.

mruberry · 2021-11-23T22:05:42Z

I think @fritzo is our distributions maintainer?

fritzo · 2021-11-23T22:11:59Z

Hi @mruberry I'm pretty busy over the next month, any chance @neerajprad has time to review these PRs?

neerajprad · 2021-11-23T22:27:08Z

Hi @mruberry I'm pretty busy over the next month, any chance @neerajprad has time to review these PRs?

Thanks for tagging, I missed this. I'll take a look.

neerajprad · 2021-11-23T22:38:22Z

Thanks for your feedback. I will open another seperated PR for inheritable_torch.distributions.constraints_for_matrices if it is more clear.

There are definitely constraints that assume that the matrix corresponding to the event dims is a square matrix so it is good to explicitly check for that. Feel free to update this PR with your proposed change.

neerajprad

Looks great overall! I just had one comment about the test. Thanks for contributing to pytorch. It will be good to also get a go-ahead from @lezcano.

test/distributions/test_constraints.py

…onvexopt/pytorch into debug_positive_definite_constraints

test/distributions/test_constraints.py

neerajprad

I just had one small comment. Looks good to me if unit tests pass!

lezcano

Just left a small comment. Besidesd that LGTM

lezcano · 2021-11-30T08:41:36Z

test/distributions/test_constraints.py

+                                     pytest.param(True, marks=pytest.mark.skipif(not TEST_CUDA,
+                                                                                 reason='CUDA not found.'))])
+def test_constraint(constraint_fn, result, value, is_cuda):
+    t = torch.cuda.DoubleTensor if is_cuda else torch.DoubleTensor


I believe that these constructors are deprecated. The correct way of writing this code would be to pass a device to this function and to do torch.tensor(data, dtype=torch.double, device=device).

I will try to correct it at a new PR since we have to modify all the functions in test/distributions/test_constraints.py. Thanks for your feedback.

facebook-github-bot · 2021-11-30T17:02:01Z

@neerajprad has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

…puts. (#69069) Summary: While implementing #68720, We found out empirically that `torch.cholesky_inverse` support batched inputs, but it is not explained in doc: [link](#68720 (review)) `torch.cholesky_inverse` is implemented in #50269 and the doc was updated at #31275 but not merged. neerajprad Pull Request resolved: #69069 Reviewed By: mrshenli Differential Revision: D32979362 Pulled By: neerajprad fbshipit-source-id: 0967c969434ce6e0ab15889c240149c23c0bce44

…puts. (#69069) Summary: While implementing #68720, We found out empirically that `torch.cholesky_inverse` support batched inputs, but it is not explained in doc: [link](#68720 (review)) `torch.cholesky_inverse` is implemented in #50269 and the doc was updated at #31275 but not merged. neerajprad Reviewed By: mrshenli Differential Revision: D32979362 Pulled By: neerajprad fbshipit-source-id: 0967c969434ce6e0ab15889c240149c23c0bce44 [ghstack-poisoned]

…puts. (#69069) Summary: While implementing #68720, We found out empirically that `torch.cholesky_inverse` support batched inputs, but it is not explained in doc: [link](#68720 (review)) `torch.cholesky_inverse` is implemented in #50269 and the doc was updated at #31275 but not merged. neerajprad Reviewed By: mrshenli Differential Revision: D32979362 Pulled By: neerajprad fbshipit-source-id: 0967c969434ce6e0ab15889c240149c23c0bce44

…puts. (#69069) Summary: While implementing #68720, We found out empirically that `torch.cholesky_inverse` support batched inputs, but it is not explained in doc: [link](#68720 (review)) `torch.cholesky_inverse` is implemented in #50269 and the doc was updated at #31275 but not merged. neerajprad Pull Request resolved: #69069 Reviewed By: mrshenli Differential Revision: D32979362 Pulled By: neerajprad fbshipit-source-id: 0967c969434ce6e0ab15889c240149c23c0bce44

Summary: Fixes #68050 TODO: - [x] Unit Test - [x] Documentation - [x] Change constraint of matrix variables with 'torch.distributions.constraints.symmetric' if it is reviewed and merged. #68720 Pull Request resolved: #68588 Reviewed By: bdhirsh Differential Revision: D33246843 Pulled By: neerajprad fbshipit-source-id: 825fcddf478555235e7a66de0c18368c41e935cd

Summary: Implement #68050 Reopened merged and reverted PR #68588 worked with neerajprad cc neerajprad Sorry for the confusion. TODO: - [x] Unit Test - [x] Documentation - [x] Change constraint of matrix variables with 'torch.distributions.constraints.symmetric' if it is reviewed and merged. Debug positive definite constraints #68720 Pull Request resolved: #70377 Reviewed By: mikaylagawarecki Differential Revision: D33355132 Pulled By: neerajprad fbshipit-source-id: e968c0d9a3061fb2855564b96074235e46a57b6c

nonconvexopt and others added 10 commits November 19, 2021 23:16

Added 'torch.distributions.constraints.symmetric' checking if the ten…

3d1567a

…sor is symmetric in last 2 dimensions.

Fit flake8 requirements

265759c

Disabled calling symmetric class and postive definite class in Public…

909af85

… interface.

Reverted Calling classes symmetric and postivie_definite in public in…

83c8b41

…terface

Added test for symmetric constraint and positive_definite constraint

9b1dcda

Fix error in positive definite constraint

8a0cef2

Improve converting complex value to real

124fb09

Reflected the comment of neerajprad and cleaned argument usage for py…

295a348

…test.

Merge branch 'pytorch:master' into torch.distributions.constraints.sy…

c2da8f0

…mmetric

Merge branch 'pytorch:master' into debug_positive_definite_constraints

df3094c

pytorch-probot bot added the ciflow/default label Nov 22, 2021

facebook-github-bot added the cla signed label Nov 22, 2021

pytorchbot added the open source label Nov 22, 2021

jbschlosser requested review from mruberry and lezcano November 22, 2021 19:02

jbschlosser added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Nov 22, 2021

Merge branch 'pytorch:master' into debug_positive_definite_constraints

c0f10a4

lezcano requested changes Nov 23, 2021

View reviewed changes

Merge branch 'pytorch:master' into torch.distributions.constraints.sy…

510f7b4

…mmetric

lezcano requested changes Nov 23, 2021

View reviewed changes

torch/distributions/constraints.py Outdated Show resolved Hide resolved

use torch.linalg.cholesky_ex and reflect Lezcano's feedback

61bea99

Co-authored-by: Lezcano <Lezcano@users.noreply.github.com>

Merge branch 'pytorch:master' into debug_positive_definite_constraints

bec8072

nonconvexopt and others added 4 commits November 29, 2021 15:07

Use torch.isclose and reflect the feedback of neerajprad

bc1c635

Fixing CI errors.

0d3a4a9

Revert modification in '_precision_to_scale_tril' in multivariate_normal

7adfb50

Merge branch 'pytorch:master' into debug_positive_definite_constraints

6b57d92

neerajprad reviewed Nov 29, 2021

View reviewed changes

test/distributions/test_constraints.py Show resolved Hide resolved

nonconvexopt and others added 3 commits November 30, 2021 09:47

Add test for high dimensional symmetric matrices.

a1ceee6

Merge branch 'debug_positive_definite_constraints' of github.com:nonc…

3cf67b2

…onvexopt/pytorch into debug_positive_definite_constraints

Merge branch 'pytorch:master' into debug_positive_definite_constraints

838c288

neerajprad reviewed Nov 30, 2021

View reviewed changes

test/distributions/test_constraints.py Outdated Show resolved Hide resolved

neerajprad approved these changes Nov 30, 2021

View reviewed changes

lezcano approved these changes Nov 30, 2021

View reviewed changes

nonconvexopt and others added 2 commits November 30, 2021 18:39

Delete assert message in 'test_constraint' function

3bd0acd

Merge branch 'pytorch:master' into debug_positive_definite_constraints

ee046f4

nonconvexopt mentioned this pull request Nov 30, 2021

Extend explanation of torch.cholesky_inverse to consider batched inputs. #69069

Closed

Merge branch 'pytorch:master' into debug_positive_definite_constraints

67b3448

facebook-github-bot closed this in 845a82b Dec 1, 2021

This was referenced Dec 1, 2021

implemented 'torch.distributions.constraints.symmetric' checking if the tensor is symmetric at last 2 dimension. (#68644) #69123

Closed

Implementation of Wishart distribution #68588

Closed

nonconvexopt deleted the debug_positive_definite_constraints branch December 4, 2021 13:18

nonconvexopt mentioned this pull request Dec 24, 2021

[Reinstate] Wishart distribution #70377

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Debug positive definite constraints #68720

Debug positive definite constraints #68720

nonconvexopt commented Nov 22, 2021 •

edited

pytorch-probot bot commented Nov 22, 2021 •

edited

⚛️ CI Flow

facebook-github-bot commented Nov 22, 2021 •

edited

lezcano left a comment

nonconvexopt commented Nov 23, 2021 •

edited

lezcano left a comment

nonconvexopt commented Nov 23, 2021

mruberry commented Nov 23, 2021

fritzo commented Nov 23, 2021

neerajprad commented Nov 23, 2021

neerajprad commented Nov 23, 2021

neerajprad left a comment

neerajprad left a comment

lezcano left a comment

lezcano Nov 30, 2021

nonconvexopt Nov 30, 2021 •

edited

facebook-github-bot commented Nov 30, 2021

Debug positive definite constraints #68720

Debug positive definite constraints #68720

Conversation

nonconvexopt commented Nov 22, 2021 • edited

pytorch-probot bot commented Nov 22, 2021 • edited

⚛️ CI Flow

facebook-github-bot commented Nov 22, 2021 • edited

🔗 Helpful links

💊 CI failures summary and remediations

lezcano left a comment

Choose a reason for hiding this comment

nonconvexopt commented Nov 23, 2021 • edited

lezcano left a comment

Choose a reason for hiding this comment

nonconvexopt commented Nov 23, 2021

mruberry commented Nov 23, 2021

fritzo commented Nov 23, 2021

neerajprad commented Nov 23, 2021

neerajprad commented Nov 23, 2021

neerajprad left a comment

Choose a reason for hiding this comment

neerajprad left a comment

Choose a reason for hiding this comment

lezcano left a comment

Choose a reason for hiding this comment

lezcano Nov 30, 2021

Choose a reason for hiding this comment

nonconvexopt Nov 30, 2021 • edited

Choose a reason for hiding this comment

facebook-github-bot commented Nov 30, 2021

nonconvexopt commented Nov 22, 2021 •

edited

pytorch-probot bot commented Nov 22, 2021 •

edited

facebook-github-bot commented Nov 22, 2021 •

edited

nonconvexopt commented Nov 23, 2021 •

edited

nonconvexopt Nov 30, 2021 •

edited