Singular value decomposition of complex valued matrices does not match mathematical definition #45821

IvanYashchuk · 2020-10-04T14:17:33Z

📚 Documentation

Now when torch.svd supports complex input both for CPU and CUDA. Let's discuss the docs.
Currently, SVD documentation is fully in agreement with the implementation. This function returns(U, S, V) such that input = U @ diag(S) @ V.T
However, the mathematical definition of SVD is the decomposition of a matrix M such that , where V^* is the conjugate transpose of V.

What is the opinion on this? Should the documentation and implementation match math and be updated such that A = U @ diag(S) @ V.T.conj()?

Link to torch.svd documentation.

cc @ezyang @gchanan @zou3519 @bdhirsh @heitorschueroff @anjali411 @dylanbespalko @mruberry @jlin27 @vishwakftw @jianyuh @nikitaved @pearu @vincentqb

The text was updated successfully, but these errors were encountered:

pearu · 2020-10-04T17:08:29Z

xref: #45063 - conjugate transpose operator feature request

pearu · 2020-10-04T17:28:44Z

IIUC, the current SVD documentation uses transpose because previously only real input was supported (and hence using conjugate transpose would have been unnecessary). With the new complex input support in SVD, I would suggest updating the SVD documentation to use conjugate transpose to match the mathematical definition of general SVD. While doing so, also mention that when M is real then V will be real and conjugate transpose falls back to transpose.

ezyang · 2020-10-05T15:28:14Z

Yes, please just generalize the docs, in this case it seems to be straightforward and the right thing to do.

nikitaved · 2020-10-05T15:34:23Z

The current SVD implementation returns V that is silently conjugated. I guess it makes sense to fix that too, is that correct?

mruberry · 2020-10-06T09:25:54Z

The current SVD implementation returns V that is silently conjugated. I guess it makes sense to fix that too, is that correct?

As far as I understand what you're saying: yes.

antocuni · 2020-10-12T15:20:20Z

I'd like to add my 2 cents to this discussion. By looking at the source code, it seems to me that the current behavior or torch.svd was not designed, but simply a byproduct of a bug. In particular, look at _create_U_S_VT:

pytorch/aten/src/ATen/native/LinearAlgebraUtils.h

Line 221 in a814231

    
           static inline std::tuple<Tensor, Tensor, Tensor> _create_U_S_VT(const Tensor& input, bool some, bool compute_uv) {

As the name suggests, it should create a VT tensor, which is done here:

pytorch/aten/src/ATen/native/LinearAlgebraUtils.h

Lines 246 to 247 in a814231

    
           // VT should be a row-major or a batch of row-major matrices 
        
           Tensor VT_empty;

However, the comment is wrong: the lapack's function _gesdd returns a VT in column-major order.

The rest of the code keeps calling it VT; e.g., this is _svd_helper_cpu returning a tuple (U, S, VT):

pytorch/aten/src/ATen/native/BatchLinearAlgebra.cpp

Line 1058 in a814231

return std::make_tuple(U_working_copy, S_working_copy, VT_working_copy);

However, since we are creating the tensor with the wrong strides, we are effectively returning VT.transpose(), which is V. To summarize:

numpy.linalg.svd returns VT
lapack retusn VT
our implementation returns something which is called VT (but it's not)

So, it looks to me that the intention was to return VT as well, but because of the bug in _create_U_S_VT, we started to return V and instead of fixing the bug we simply "fixed" the documentation.

There isn't much that we can to at this point without breaking compatibility, but this confusing/unexpected/nonstandard behavior of svd might indicate that we should deprecate it in favor or torch.linalg.svd, once it lands.

Note: in my WIP PR #45562 I fixed _create_U_S_VT by creating VT with the correct strides, and transpose_()ing it upon return to keep b/c.
In the same PR I am also going to fix the docs for torch.svd.

mruberry · 2020-10-12T15:22:24Z

Sounds great, @antocuni! I agree we should consider deprecating torch.svd once torch.linalg.svd lands. It will also be confusing to have a method torch.Tensor.svd when both functions are in because users may not know which svd it refers to.

…h#45821

antocuni · 2020-10-12T15:37:47Z

It will also be confusing to have a method torch.Tensor.svd when both functions are in because users may not know which svd it refers to.

This is something which I didn't think about. What should Tensor.svd do if/when we remove torch.svd? Ideally it would do the same as torch.linalg.svd, but this will break compatibility.

In the same PR I am also going to fix the docs for torch.svd.

FWIW, this is the commit which aims to fix the documentation:
97662da

mruberry · 2020-10-12T16:02:12Z

It will also be confusing to have a method torch.Tensor.svd when both functions are in because users may not know which svd it refers to.

This is something which I didn't think about. What should Tensor.svd do if/when we remove torch.svd? Ideally it would do the same as torch.linalg.svd, but this will break compatibility.

We should warn/deprecate for a release and then remove both functions for a release. Then we can consider adding torch.Tensor.svd back and mapping it to torch.linalg.svd.

antocuni · 2020-10-14T10:02:18Z

(sorry guys, I closed the issue by mistake, reopening it)

pulkin · 2020-11-05T13:11:29Z

You probably do know that numpy returns u, s, vh = svd(A) such that A = u @ s @ vh. I would prefer to have this convention in pytorch because many people use numpy for prototyping.

antocuni · 2020-11-10T11:09:37Z

while working on PR #45562 we realized that we can't fix torch.svd without breaking existing code. OTOH, the upcoming torch.linalg.svd is completely compatible with numpy.
As discussed on the PR, the current plan is to document the current behavior of torch.svd and tell people to use torch.linalg.svd for new code: this is the commit which tentatively tries to improve the docs, but feel free to suggest a better wording:
abf9baf

IvanYashchuk added module: complex Related to complex number support in PyTorch module: linear algebra Issues related to specialized linear algebra operations in PyTorch; includes matrix multiply matmul labels Oct 4, 2020

This was referenced Oct 5, 2020

torch.linalg in PyTorch 1.10 tracker #42666

Closed

Implement torch.linalg.svd #45562

Closed

VitalyFedyunin added module: docs Related to our documentation, both in docs/ and docblocks triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Oct 5, 2020

IvanYashchuk mentioned this issue Oct 11, 2020

Added support for complex torch.pinverse #45819

Closed

antocuni added a commit to antocuni/pytorch that referenced this issue Oct 12, 2020

fix the docstring of svd, according to the discussion in issue pytorc…

97662da

…h#45821

antocuni closed this as completed Oct 14, 2020

antocuni reopened this Oct 14, 2020

ezyang added the high priority label Nov 6, 2020

pytorch-probot bot added the triage review label Nov 6, 2020

bdhirsh removed the triage review label Nov 9, 2020

IvanYashchuk self-assigned this Jan 15, 2021

antocuni mentioned this issue Jan 18, 2021

Add cusolver gesvdj and gesvdjBatched to the backend of torch.svd #48436

Closed

IvanYashchuk mentioned this issue Jan 24, 2021

Make torch.svd return V, not V.conj() for complex inputs #51012

Closed

facebook-github-bot closed this as completed in ddf2681 Jan 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Singular value decomposition of complex valued matrices does not match mathematical definition #45821

Singular value decomposition of complex valued matrices does not match mathematical definition #45821

IvanYashchuk commented Oct 4, 2020 •

edited by pytorch-probot bot

pearu commented Oct 4, 2020

pearu commented Oct 4, 2020

ezyang commented Oct 5, 2020

nikitaved commented Oct 5, 2020 •

edited

mruberry commented Oct 6, 2020

antocuni commented Oct 12, 2020 •

edited

mruberry commented Oct 12, 2020

antocuni commented Oct 12, 2020

mruberry commented Oct 12, 2020

antocuni commented Oct 14, 2020

pulkin commented Nov 5, 2020

antocuni commented Nov 10, 2020

Singular value decomposition of complex valued matrices does not match mathematical definition #45821

Singular value decomposition of complex valued matrices does not match mathematical definition #45821

Comments

IvanYashchuk commented Oct 4, 2020 • edited by pytorch-probot bot

📚 Documentation

pearu commented Oct 4, 2020

pearu commented Oct 4, 2020

ezyang commented Oct 5, 2020

nikitaved commented Oct 5, 2020 • edited

mruberry commented Oct 6, 2020

antocuni commented Oct 12, 2020 • edited

mruberry commented Oct 12, 2020

antocuni commented Oct 12, 2020

mruberry commented Oct 12, 2020

antocuni commented Oct 14, 2020

pulkin commented Nov 5, 2020

antocuni commented Nov 10, 2020

IvanYashchuk commented Oct 4, 2020 •

edited by pytorch-probot bot

nikitaved commented Oct 5, 2020 •

edited

antocuni commented Oct 12, 2020 •

edited