Add Cholesky Decompostion #8202

UmashankarTriforce · 2019-09-30T14:03:35Z

This PR aims to add Cholesky Decomposition as per the issue #1448

Route taken to complete this feature -

Implement function
Write unit test cases
Document the function

toslunar · 2019-10-01T06:00:36Z

Why don't you use numpy.linalg.cholesky?

UmashankarTriforce · 2019-10-01T08:07:19Z

How is cupy.linalg.cholesky implemented? I'm using cupy.linalg.cholesky for backward computation. I checked the difference between numpy and scipy implementation of cholesky.

scipy.linalg.cholesky is giving you the upper-triangular decomposition by default, whereas np.linalg.cholesky is giving you the lower-triangular version.

https://stackoverflow.com/questions/16699163/what-is-the-difference-between-cholesky-in-numpy-and-scipy

toslunar · 2019-10-03T05:40:51Z

How is cupy.linalg.cholesky implemented?

cupy.linalg.cholesky decomposes a matrix into L @ L.H. The code https://github.com/cupy/cupy/blob/c9c7c4eedcca54e6f53b5dcfc205d4ad94a2b595/cupy/linalg/decomposition.py#L49-L54 has cublas.CUBLAS_FILL_MODE_UPPER, which looks inconsistent but it's not because of the difference of row-major vs column-major.

SciPy-compatible GPU implementation is available in cupyx.scipy namespace, if there is.

I'm using cupy.linalg.cholesky for backward computation.

Inputs to FunctionNode.backward are Variables.

Use Chainer functions in backward, or
define class CholeskyGrad(FunctionNode) and leave double-backward unimplemented.

UmashankarTriforce · 2019-10-07T04:33:33Z

Sorry for the late response!
SciPy-compatible GPU implementation is not there for cholesky. Also I plan to use cupy.linalg.cholesky in forward_gpu and for backwards I plan to take the dot product of L and L.T.conj(). Since chainer.dot is not implemented, I plan to use cupy.dot and wrap the array back to chainer.Variable. What do you suggest?

Edit: I also found that the results of cupy.linalg.cholesky is consistent with numpy's implementation. So I would be switching to numpy for forward_cpu

chainer/functions/math/cholesky.py

toslunar · 2019-10-10T10:03:16Z

chainer/functions/math/cholesky.py

+
+    def backward(self, indexes, gy):
+        a, = self.get_retained_inputs().array
+        return cuda.cupy.dot(a, a.T) * gy[0],


Variable-level equivalent code would be

a, = self.get_retrained_inputs() return chainer.functions.matmul(a, a, transb=True),

UmashankarTriforce · 2019-10-13T04:38:47Z

Sorry for the late response! I have added changes and also added test patterns
There's an error I'm encountering when I try to run the test
AttributeError: 'Cholesky' object has no attribute 'get_retrained_inputs'

How do I solve this?

chainer/functions/math/cholesky.py

UmashankarTriforce · 2019-10-15T09:37:13Z

When I run tests, in some cases the gradients computed backwards is absurdly more than the numeric gradients. For example -

>           raise AssertionError(f.getvalue())
E           chainer.testing.function_link.FunctionTestError: Parameterized test failed.
E           
E           Base test method: TestCholesky_use_chainerx_false__chainerx_device_None__use_cuda_false__cuda_device_None__use_cudnn_never__cudnn_deterministic_false__autotune_false__cudnn_fast_batch_normalization_false__use_ideep_never.test_backward
E           Test parameters:
E             dtype: <class 'numpy.float32'>
E             shape: (2, 2)
E           
E           
E           (caused by)
E           FunctionTestError: backward is not implemented correctly
E           
E           (caused by)
E           AssertionError: check_backward failed (eps=0.001 atol=0.001 rtol=0.001)
E           inputs[0]:
E           [[7670.438  1473.2218]
E            [1473.2218 6982.2075]]
E           grad_outputs[0]:
E           [[-0.70425826  0.97613996]
E            [ 0.5500437  -0.09768751]]
E           directions[0]:
E           [[-0.7262568  -0.0971007 ]
E            [-0.21805311  0.64465135]]
E           gradients (numeric):  0.0015698603367818734
E           gradients (backward): 23360998.461244226
E           
E           x: numeric gradient, y: backward gradient
E           Not equal to tolerance rtol=0.001, atol=0.001
E           
E           Mismatch: 100%
E           Max absolute difference: 23360998.45967437
E           Max relative difference: 1.
E            x: array(0.00157)
E            y: array(23360998.461244)
E           
E           assert_allclose failed: 
E             shape: () ()
E             dtype: float64 float64
E             i: (0,)
E             x[i]: 0.0015698603367818734
E             y[i]: 23360998.461244226
E             relative error[i]: 0.9999999999328
E             absolute error[i]: 23360998.459674366
E             relative tolerance * |y[i]|: 23360.998461244228
E             absolute tolerance: 0.001
E             total tolerance: 23360.999461244228
E           x: 0.00156986
E           y: 23360998.46124423
``
How do you suggest I fix this issue?

toslunar · 2019-10-15T10:30:34Z

The scale in the CuPy test is large because integer types are tested. To test backprop, scale=(1e-2, 2.0) looks good to me.

UmashankarTriforce · 2019-10-15T11:16:15Z

Yes I have made the necessary changes, but the mismatch still exists (its of a lower magnitude now, but more than tolerance value)

chainer/functions/math/cholesky.py

toslunar · 2019-10-17T09:08:37Z

chainer/functions/math/cholesky.py

+
+    def backward(self, indexes, gy):
+        a, = self.get_retained_inputs()
+        return functions.matmul(a, a, transb=True) * gy[0],


My fix is available at UmashankarTriforce#1. Could you merge it?

Yes its done

Fix backward of cholesky

chainer/functions/math/cholesky.py

tests/chainer_tests/functions_tests/math_tests/test_cholesky.py

Co-Authored-By: Toshiki Kataoka <tos.lunar@gmail.com>

toslunar

In order to add the link to the function, could you edit docs/source/reference/functions.rst?
On the backprop, I changed my mind and let's return a symmetric matrix. Please wait and see Document properties of computed gradients in cholesky and eigh #8312.
F.cholesky could call chainerx.linalg.cholesky by overriding forward_chainerx:

    def forward_chainerx(self, inputs):
        return chainerx.linalg.cholesky(*inputs),

UmashankarTriforce · 2019-10-24T03:29:00Z

I have added the points 1 and 3 from your review. I will wait for #8312 and change accordingly

toslunar

#8312 is merged. Could you align F.cholesky with chainerx.linalg.cholesky?

chainer/functions/math/cholesky.py

toslunar · 2019-11-01T05:32:42Z

tests/chainer_tests/functions_tests/math_tests/test_cholesky.py

+        return y_expect.astype(self.dtype),
+
+    def forward(self, inputs, device):
+        a, = inputs


Insert a = 0.5 * (a + a.T) to make the perturbation symmetric, like the test code for chainerx.linalg.cholesky:

chainer/tests/chainerx_tests/unit_tests/routines_tests/test_linalg.py

Lines 504 to 514 in c8e3b4f

def forward_xp(self, inputs, xp):

a, = inputs

if (_numpy_does_not_support_0d_input113 and a.size == 0):

pytest.skip('Older NumPy versions do not work with empty arrays')

# Input has to be symmetrized for backward test to work

a = (a + a.T)/2. + 1e-3 * xp.eye(*self.shape)

L = xp.linalg.cholesky(a)

return L,

.

It can be also tested that the returned gradient is symmetric. (Not mandatory in this PR)

Could you also fix TestCholesky.forward? xp means numpy or cupy (or chainerx) in Chainer.

Yes I have added this fix

Co-Authored-By: Toshiki Kataoka <tos.lunar@gmail.com>

UmashankarTriforce · 2019-11-12T13:12:43Z

I'm extremely sorry for the late response 🙁 : I had my exams in college and hence I wasn't able to work on this. Have a look and let me know if any more modifications are necessary 🙂

toslunar · 2019-11-13T09:38:46Z

tests/chainer_tests/functions_tests/math_tests/test_cholesky.py

+        return y_expect.astype(self.dtype),
+
+    def forward(self, inputs, device):
+        a, = inputs


Could you also fix TestCholesky.forward? xp means numpy or cupy (or chainerx) in Chainer.

chainer/functions/math/cholesky.py

Co-Authored-By: Toshiki Kataoka <tos.lunar@gmail.com>

toslunar

LGTM

toslunar · 2019-11-18T07:15:35Z

Jenkins & flexCI, test this please.

chainer-ci · 2019-11-18T12:59:44Z

Jenkins CI test (for commit a496ded, target branch master) failed with status FAILURE.

UmashankarTriforce · 2019-11-18T18:35:07Z

The error seems to be from ChainerX

Added cholesky function

8d94bd6

UmashankarTriforce changed the title ~~[WIP ]Add Cholesky Decompostion~~ [WIP] Add Cholesky Decompostion Sep 30, 2019

niboshi assigned toslunar Oct 1, 2019

toslunar reviewed Oct 10, 2019

View reviewed changes

UmashankarTriforce added 2 commits October 13, 2019 10:07

fixed bugs

4286b6a

added unit tests

a81cf51

toslunar reviewed Oct 15, 2019

View reviewed changes

chainer/functions/math/cholesky.py Outdated Show resolved Hide resolved

added changes

f312aa5

Fix backward of cholesky

29f7dfe

toslunar mentioned this pull request Oct 17, 2019

Fix backward of cholesky UmashankarTriforce/chainer#1

Merged

toslunar requested changes Oct 17, 2019

View reviewed changes

UmashankarTriforce added 2 commits October 17, 2019 15:26

Merge pull request #1 from toslunar/cholesky

087640c

Fix backward of cholesky

added changes

a674488

toslunar requested changes Oct 17, 2019

View reviewed changes

UmashankarTriforce and others added 7 commits October 18, 2019 09:50

add check for ndim

d3ec981

Co-Authored-By: Toshiki Kataoka <tos.lunar@gmail.com>

removed numpy requirement

b6ea422

Co-Authored-By: Toshiki Kataoka <tos.lunar@gmail.com>

increased size for testing

eb11355

Co-Authored-By: Toshiki Kataoka <tos.lunar@gmail.com>

changed scale for testing

d24556c

merge commits

552a10e

modified forward implementation

9eef794

fixed bugs

6a4689d

toslunar reviewed Oct 23, 2019

View reviewed changes

UmashankarTriforce added 2 commits October 24, 2019 08:48

Added chainerx module for forward computation

b58f29a

added cholesky to docs

67bfd8a

toslunar reviewed Nov 1, 2019

View reviewed changes

toslunar added the st:awaiting-author State indicating that response is needed from contributors, often authors of pull request. label Nov 8, 2019

UmashankarTriforce and others added 2 commits November 12, 2019 18:37

Aligning with chainerx.linalg.cholesky

75e9f98

Co-Authored-By: Toshiki Kataoka <tos.lunar@gmail.com>

Making perturbation symmetric

5b60c81

toslunar requested changes Nov 13, 2019

View reviewed changes

UmashankarTriforce added 2 commits November 14, 2019 09:06

added changes

4b2358e

fixed TestCholesky.forward

8c3a899

toslunar reviewed Nov 14, 2019

View reviewed changes

chainer/functions/math/cholesky.py Outdated Show resolved Hide resolved

added suggested changes

a496ded

Co-Authored-By: Toshiki Kataoka <tos.lunar@gmail.com>

toslunar added cat:feature Implementation that introduces new interfaces. and removed st:awaiting-author State indicating that response is needed from contributors, often authors of pull request. labels Nov 14, 2019

toslunar approved these changes Nov 18, 2019

View reviewed changes

toslunar changed the title ~~[WIP] Add Cholesky Decompostion~~ Add Cholesky Decompostion Nov 20, 2019

toslunar added this to the v7.0.0 milestone Nov 20, 2019

toslunar merged commit a2c63c8 into chainer:master Nov 20, 2019

UmashankarTriforce deleted the cholesky branch November 20, 2019 11:55

niboshi mentioned this pull request Nov 20, 2019

Remove ChainerX F.cholesky test #8469

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Cholesky Decompostion #8202

Add Cholesky Decompostion #8202

UmashankarTriforce commented Sep 30, 2019 •

edited

Loading

toslunar commented Oct 1, 2019

UmashankarTriforce commented Oct 1, 2019 •

edited

Loading

toslunar commented Oct 3, 2019

UmashankarTriforce commented Oct 7, 2019 •

edited

Loading

toslunar Oct 10, 2019

UmashankarTriforce commented Oct 13, 2019

UmashankarTriforce commented Oct 15, 2019

toslunar commented Oct 15, 2019

UmashankarTriforce commented Oct 15, 2019

toslunar Oct 17, 2019

UmashankarTriforce Oct 17, 2019

toslunar left a comment

UmashankarTriforce commented Oct 24, 2019 •

edited

Loading

toslunar left a comment

toslunar Nov 1, 2019

toslunar Nov 13, 2019

UmashankarTriforce Nov 14, 2019

UmashankarTriforce commented Nov 12, 2019

toslunar Nov 13, 2019

toslunar left a comment

toslunar commented Nov 18, 2019

chainer-ci commented Nov 18, 2019

UmashankarTriforce commented Nov 18, 2019

	def forward_xp(self, inputs, xp):
	a, = inputs

	if (_numpy_does_not_support_0d_input113 and a.size == 0):
	pytest.skip('Older NumPy versions do not work with empty arrays')

	# Input has to be symmetrized for backward test to work
	a = (a + a.T)/2. + 1e-3 * xp.eye(*self.shape)

	L = xp.linalg.cholesky(a)
	return L,

Add Cholesky Decompostion #8202

Add Cholesky Decompostion #8202

Conversation

UmashankarTriforce commented Sep 30, 2019 • edited Loading

toslunar commented Oct 1, 2019

UmashankarTriforce commented Oct 1, 2019 • edited Loading

toslunar commented Oct 3, 2019

UmashankarTriforce commented Oct 7, 2019 • edited Loading

toslunar Oct 10, 2019

Choose a reason for hiding this comment

UmashankarTriforce commented Oct 13, 2019

UmashankarTriforce commented Oct 15, 2019

toslunar commented Oct 15, 2019

UmashankarTriforce commented Oct 15, 2019

toslunar Oct 17, 2019

Choose a reason for hiding this comment

UmashankarTriforce Oct 17, 2019

Choose a reason for hiding this comment

toslunar left a comment

Choose a reason for hiding this comment

UmashankarTriforce commented Oct 24, 2019 • edited Loading

toslunar left a comment

Choose a reason for hiding this comment

toslunar Nov 1, 2019

Choose a reason for hiding this comment

toslunar Nov 13, 2019

Choose a reason for hiding this comment

UmashankarTriforce Nov 14, 2019

Choose a reason for hiding this comment

UmashankarTriforce commented Nov 12, 2019

toslunar Nov 13, 2019

Choose a reason for hiding this comment

toslunar left a comment

Choose a reason for hiding this comment

toslunar commented Nov 18, 2019

chainer-ci commented Nov 18, 2019

UmashankarTriforce commented Nov 18, 2019

UmashankarTriforce commented Sep 30, 2019 •

edited

Loading

UmashankarTriforce commented Oct 1, 2019 •

edited

Loading

UmashankarTriforce commented Oct 7, 2019 •

edited

Loading

UmashankarTriforce commented Oct 24, 2019 •

edited

Loading