Generalize `aesara.tensor.linalg.cholesky` beyond 2D arrays #1012

purna135 · 2022-06-24T09:11:11Z

Thank you for opening a PR!

Here are a few important guidelines and requirements to check before your PR can be merged:

There is an informative high-level description of the changes.
The description and/or commit message(s) references the relevant GitHub issue(s).
pre-commit is installed and set up.
The commit messages follow these guidelines.
The commits correspond to relevant logical changes, and there are no commits that fix changes introduced by other commits in the same branch/BR.
There are tests covering the changes introduced in the PR.

Don't worry, your PR doesn't need to be in perfect order to submit it. As development progresses and/or reviewers request changes, you can always rewrite the history of your feature/PR branches.

If your PR is an ongoing effort and you would like to involve us in the process, simply make it a draft PR.

ricardoV94 · 2022-06-25T04:32:42Z

How does the numpy implementation perform compared to scipy (including the lower version that is being removed)?

If it's vastly different we could keep using it for 2D cases. For higher cases is numpy faster than a Python loop calling the scipy function?

ricardoV94 · 2022-06-25T04:36:29Z

Besides performance concerns, the other thing you need to check is if there are any rewrites in Aesara that rely on the assumption that this Op is always 2-dimensional.

If that's the case you might need to add a new check in the rewrites or generalize them to be higher-dimensional if that's already possible (and open a Aesara issue if not)

ricardoV94 · 2022-06-25T04:38:37Z

tests/tensor/test_slinalg.py

@@ -451,23 +421,6 @@ def test_solve_dtype(self):
            assert x.dtype == x_result.dtype


-def test_cho_solve():


Why is this test removed?

ricardoV94 · 2022-06-25T04:40:12Z

aesara/tensor/slinalg.py



 cho_solve = CholeskySolve()


-def cho_solve(c_and_lower, b, check_finite=True):
+def cho_solve(a, b):


This needs some care, as it will simply break people's code.

ricardoV94 · 2022-06-25T04:43:15Z

aesara/tensor/slinalg.py

-            check_finite=self.check_finite,
-        )
+        out_dtype = aes.upcast(a.dtype, b.dtype)
+        broadcastable = [


We can do better than broadcastable. Have a look at https://aesara.readthedocs.io/en/latest/extending/type.html

Sayam753 · 2022-06-26T05:08:07Z

aesara/tensor/slinalg.py

@@ -39,10 +40,9 @@ class Cholesky(Op):
    # TODO: for specific dtypes
    # TODO: LAPACK wrapper with in-place behavior, for solve also

-    __props__ = ("lower", "destructive", "on_error")
+    __props__ = ("destructive", "on_error")


Do we need destructive attribute here?

According to the Op docs, __props__ lists the attributes which influence the computation performed. And I am not sure how destructive attribute is used in perform method.

My guess is that this was added for some functionality that was never implemented (e.g., a faster method that alters the input variables in place). Looking around the library I don't think this is being used anywhere so maybe we can remove it.

ricardoV94 · 2022-06-26T05:35:30Z

By the way, the rewrites involving linalg Ops are found here: https://github.com/aesara-devs/aesara/blob/main/aesara/sandbox/linalg/ops.py

We should probably bring them over to this module (CC @brandonwillard).

Importantly, some rewrites may need to be updated with the new Op's signatures... and the same performance concerns should be at least briefly investigated.

brandonwillard · 2022-07-08T00:35:08Z

By the way, the rewrites involving linalg Ops are found here: https://github.com/aesara-devs/aesara/blob/main/aesara/sandbox/linalg/ops.py

We should probably bring them over to this module (CC @brandonwillard).

Importantly, some rewrites may need to be updated with the new Op's signatures... and the same performance concerns should be at least briefly investigated.

See #499.

refactored CholeskySolve

ecb65df

purna135 marked this pull request as draft June 24, 2022 09:11

ricardoV94 reviewed Jun 25, 2022

View reviewed changes

Sayam753 reviewed Jun 26, 2022

View reviewed changes

brandonwillard changed the title ~~Generalize linalg.cholesky beyond 2D arrays~~ Generalize aesara.tensor.linalg.cholesky beyond 2D arrays Jul 7, 2022

brandonwillard added enhancement New feature or request new operator refactor This issue involves refactoring Op implementation Involves the implementation of an Op and removed new operator labels Jul 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalize `aesara.tensor.linalg.cholesky` beyond 2D arrays #1012

Generalize `aesara.tensor.linalg.cholesky` beyond 2D arrays #1012

purna135 commented Jun 24, 2022

ricardoV94 commented Jun 25, 2022 •

edited

ricardoV94 commented Jun 25, 2022

ricardoV94 Jun 25, 2022

ricardoV94 Jun 25, 2022

ricardoV94 Jun 25, 2022

Sayam753 Jun 26, 2022

ricardoV94 Jun 26, 2022

ricardoV94 commented Jun 26, 2022

brandonwillard commented Jul 8, 2022

		@@ -451,23 +421,6 @@ def test_solve_dtype(self):
		assert x.dtype == x_result.dtype


		def test_cho_solve():

Generalize aesara.tensor.linalg.cholesky beyond 2D arrays #1012

Are you sure you want to change the base?

Generalize aesara.tensor.linalg.cholesky beyond 2D arrays #1012

Conversation

purna135 commented Jun 24, 2022

ricardoV94 commented Jun 25, 2022 • edited

ricardoV94 commented Jun 25, 2022

ricardoV94 Jun 25, 2022

Choose a reason for hiding this comment

ricardoV94 Jun 25, 2022

Choose a reason for hiding this comment

ricardoV94 Jun 25, 2022

Choose a reason for hiding this comment

Sayam753 Jun 26, 2022

Choose a reason for hiding this comment

ricardoV94 Jun 26, 2022

Choose a reason for hiding this comment

ricardoV94 commented Jun 26, 2022

brandonwillard commented Jul 8, 2022

Generalize `aesara.tensor.linalg.cholesky` beyond 2D arrays #1012

Generalize `aesara.tensor.linalg.cholesky` beyond 2D arrays #1012

ricardoV94 commented Jun 25, 2022 •

edited