fix matrix_exp_multiply accuracy bug #2619

yizhang-yiz · 2021-11-23T16:42:27Z

Summary

Fix #2529 by re-implementing the prim version of the function.

Tests

test matrix_exp_multiply_handle components that calculate the taylor expansion order and helper function approximating matrix power norm.
test matrices from bug in scale_matrix_exp_multiply #2529.

Side Effects

n/a

Release notes

Bugfix: matrix_exp_multiply function's accuracy issue.

Checklist

Math issue #(issue number)
Copyright holder: Metrum Research Group.
By submitting this pull request, the copyright holder is agreeing to the license the submitted work under the following licenses:
- Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
- Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)
the basic tests are passing
- unit tests pass (to run, use: ./runTests.py test/unit)
- header checks pass, (make test-headers)
- dependencies checks pass, (make test-math-dependencies)
- docs build, (make doxygen)
- code passes the built in C++ standards checks (make cpplint)
the code is written in idiomatic C++ and changes are documented in the doxygen
the new changes are tested

…4.1 (tags/RELEASE_600/final)

…_issue_2529

…4.1 (tags/RELEASE_600/final)

yizhang-yiz · 2021-11-29T18:05:06Z

I'm not sure how to interpret the failure of expression test (tests7_test.cpp). @rok-cesnovar can you take a look?

rok-cesnovar · 2021-11-29T18:17:52Z

If I run it locally for the matrix_exp_multiply only (python3 runTests.py test/expressions/ --only-function=matrix_exp_multiply, I can see the following error:

Running main() from lib/benchmark_1.5.1/googletest/googletest/src/gtest_main.cc
[==========] Running 3 tests from 3 test suites.
[----------] Global test environment set-up.
[----------] 1 test from ExpressionTestPrim
[ RUN      ] ExpressionTestPrim.matrix_exp_multiply0
test/expressions/tests0_test.cpp:24: Failure
Expected: (matrix1_expr3_counter) <= (int9), actual: 2 vs 1
[  FAILED  ] ExpressionTestPrim.matrix_exp_multiply0 (0 ms)
[----------] 1 test from ExpressionTestPrim (0 ms total)

It seemed as if the second matrix would be an Eigen expression it would be evaluated twice. And it was, see last commit for the fix.

rok-cesnovar

I honestly have zero knowledge of these algorithms, so cant review the changes in the sense of correctness, here are just a few general code-related comments.

stan/math/prim/fun/matrix_exp_action_handler.hpp

yizhang-yiz · 2021-11-30T19:00:00Z

I was able to reproduce the expression test failure on linux but not on mac. @rok-cesnovar can you take another look? Thanks.

rok-cesnovar · 2021-11-30T21:41:36Z

Will check tomorrow.

rok-cesnovar · 2021-12-02T08:07:27Z

I am unable to replicate locally (even with clang++-6.0) on Ubuntu.

My next guess would be that the instance runs out of memory compiling 15 tests in parallel or something along these lines. Not sure why that would happen after this change, but who knows. Decreasing the parallel -j setting in the tests should be simple to try.

We are in the middle of a move of our testing infrastructure, so there will be a bit of downtime or a period of unrelated issues popping up, so testing on the actual instances will be a bit difficult in the next few day, but will make sure to test this as soon we are back up.

yizhang-yiz · 2021-12-02T15:51:00Z

@rok-cesnovar thanks. We can continue after the migration.

…4.1 (tags/RELEASE_600/final)

rok-cesnovar · 2021-12-20T18:58:54Z

Getting back to this finally.

For the GHA error with Windows tests, I think what we need to do is Add the CXX flag for only running rev tests with the test_ad #2630

I would prefer closing Moving to C++14 #2489 and bumping the minimum g++, but that seems unlikely to happen very soon (see thread).
For the Jenkins tests, I am going to debug on this PR. I am going to force-push a bit. I apologize in advance for the spam e-mails and force pushing, but I just don't see another way of getting to the bottom of this, seeing was no one can reproduce this one locally.

stan/math/prim/fun/matrix_exp_action_handler.hpp

syclik · 2023-05-19T14:46:33Z

stan/math/prim/fun/matrix_exp_action_handler.hpp

+    // L1 norm
+    double normA = mat.colwise().template lpNorm<1>().maxCoeff();
+
+    if (normA < tol || t < tol) {
      m = 0;
      s = 1;


I'd want to eagerly return here so it's clear that there's no more that happens below it.

That will unnest the lower else statement and it'll be slightly easier to read.

syclik · 2023-05-19T14:47:13Z

stan/math/prim/fun/matrix_exp_action_handler.hpp

+      const std::vector<double>& theta = theta_m_double;
+      Eigen::VectorXd alpha(p_max - 1);
+
+      if (normA


for efficiency, I'd move the / (m_max * b.cols()) to the left hand side by * (m_max * b.cols()).

syclik · 2023-05-19T14:48:32Z

stan/math/prim/fun/matrix_exp_action_handler.hpp

-      m = m_max;
+      const std::vector<double>& theta = theta_m_double;
+      Eigen::VectorXd alpha(p_max - 1);
+


I would put whitespace above 185 and remove the whitespace in 186.

The output of the if statement here is alpha. We want to keep that declaration close together if possible for readability.

stan/math/prim/fun/matrix_exp_action_handler.hpp

syclik · 2023-05-19T21:09:52Z

@yizhang-yiz, for what it's worth, it's implemented really cleanly relative to the paper. The naming convention follows and is close. It looks right to me.

syclik

The implementation looks good to me.

syclik

lgtm

yizhang-yiz · 2023-05-19T22:40:27Z

@syclik I completely let this slip through. Thanks for fixing the PR!

syclik · 2023-05-22T12:04:48Z

Looks like the "fix" I introduced causes it to have linking issues with multiple translation units.

From Jenkins https://jenkins.flatironinstitute.org/blue/organizations/jenkins/Stan%2FMath/detail/PR-2619/11/pipeline/214:

I believe this is real. I'll try to address it in a different way.

syclik · 2023-05-22T13:21:49Z

I'm looking at the intent of the code and I think the optimization is too much effort relative to the difficulty in getting it working reliably across C++ versions.

To be specific, there are two major optimizations happening. The first is that the variable is a static class member variable. I believe the idea is that on multiple uses of the class these constants don't have to be reallocated with the class. This, in and of itself, is great, but it interacts with the second thing.

The second is the use of constexpr and there are some differences in the C++ across the pre C++-17 / C++-17 boundary. This interacts with the static keyword in a way that requires the definition of the static member variable. And this is what is the last error posted -- multiple translation units can not have the same definition. We could define the variable in a central place like wherever we define the int main() function, but we're not really set up to do this in the math library well.

So... I think we can simplify this a bit and lose this optimization. Note: this is for 3 primitives only. I doubt it'd be that much of a difference in practice.

syclik · 2023-05-22T13:22:39Z

Also, this would be dominated by the allocation of the theta_m_single and theta_m_double variables.

…renaming for consistency

Yi Zhang added 2 commits November 22, 2021 15:53

revamp matrix_exp_multiply & more unit test matrices

381ad68

fix header

25a7d8b

yizhang-yiz mentioned this pull request Nov 23, 2021

bug in scale_matrix_exp_multiply #2529

Closed

stan-buildbot and others added 8 commits November 23, 2021 16:43

[Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

97fc786

…4.1 (tags/RELEASE_600/final)

add matrix model from #1146 to unit test.

c15984a

[Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

564757e

…4.1 (tags/RELEASE_600/final)

linting

4b78bf6

Merge remote-tracking branch 'stan-dev/bugfix_issue_2529' into bugfix…

fb56049

…_issue_2529

[Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

49dc3e5

…4.1 (tags/RELEASE_600/final)

fix linting

bf2e745

fix unit test

f708734

fix multiple evals of b

1081352

rok-cesnovar reviewed Nov 29, 2021

View reviewed changes

misc review comments

ea55ec7

Yi Zhang and others added 4 commits December 3, 2021 09:53

additional unit tests for expm of large condition number

fac3d0c

Merge commit 'a5e40e7633aabae9d25f083569fccd3268b79fde' into HEAD

190a2c0

[Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

f0f462f

…4.1 (tags/RELEASE_600/final)

Merge branch 'develop' into bugfix_issue_2529

a53874a

rok-cesnovar reviewed Dec 21, 2021

View reviewed changes

stan/math/prim/fun/matrix_exp_action_handler.hpp Show resolved Hide resolved

rok-cesnovar force-pushed the bugfix_issue_2529 branch from 8ed5a59 to a53874a Compare December 23, 2021 18:11

rok-cesnovar and others added 3 commits December 23, 2021 19:11

Merge branch 'develop' into bugfix_issue_2529

deccced

Merge branch 'develop' into bugfix_issue_2529

082f7b6

matrix_exp_action_handler: adding definition of constexpr variables

8d9bbad

syclik reviewed May 19, 2023

View reviewed changes

stan/math/prim/fun/matrix_exp_action_handler.hpp Outdated Show resolved Hide resolved

syclik and others added 11 commits May 19, 2023 16:51

matrix_exp_action_handler: set_approx_order -- eager return

d018c4c

matrix_exp_action_handler: set_approx_order -- adjust condition

9ea04de

matrix_exp_action_handler: set_approx_order -- replace theta

079c66e

matrix_exp_action_handler: set_approx_order -- consistent indexing

74df8a5

matrix_exp_action_handler: set_approx_order -- simplifying code

e40da15

matrix_exp_action_handler: set_approx_order -- simplifying code

35c259b

matrix_exp_action_handler: set_approx_order -- replacing auto with int

f9476a1

matrix_exp_action_handler: set_approx_order -- using LinSpaced to set u

df93668

matrix_exp_action_handler: set_approx_order -- returning special case

8feefd4

[Jenkins] auto-formatting by clang-format version 10.0.0-4ubuntu1

42228ef

matrix_exp_action_handler: adding one more definition

f4b6c42

syclik previously approved these changes May 19, 2023

View reviewed changes

matrix_exp_action_handler: removing auto

8f1c923

syclik dismissed their stale review via 8f1c923 May 19, 2023 21:26

syclik previously approved these changes May 19, 2023

View reviewed changes

Merge branch 'develop' into bugfix_issue_2529

f46628b

matrix_exp_action_handler: converting static constexpr to const int, …

3e12a84

…renaming for consistency

syclik dismissed their stale review via 3e12a84 May 22, 2023 13:55

syclik approved these changes May 23, 2023

View reviewed changes

syclik merged commit 1871303 into develop May 23, 2023
8 checks passed

syclik deleted the bugfix_issue_2529 branch May 23, 2023 12:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix matrix_exp_multiply accuracy bug #2619

fix matrix_exp_multiply accuracy bug #2619

yizhang-yiz commented Nov 23, 2021

yizhang-yiz commented Nov 29, 2021

rok-cesnovar commented Nov 29, 2021

rok-cesnovar left a comment

yizhang-yiz commented Nov 30, 2021

rok-cesnovar commented Nov 30, 2021

rok-cesnovar commented Dec 2, 2021

yizhang-yiz commented Dec 2, 2021

rok-cesnovar commented Dec 20, 2021

syclik May 19, 2023

syclik May 19, 2023

syclik May 19, 2023

syclik commented May 19, 2023

syclik left a comment

syclik left a comment

yizhang-yiz commented May 19, 2023

syclik commented May 22, 2023

syclik commented May 22, 2023

syclik commented May 22, 2023

fix matrix_exp_multiply accuracy bug #2619

fix matrix_exp_multiply accuracy bug #2619

Conversation

yizhang-yiz commented Nov 23, 2021

Summary

Tests

Side Effects

Release notes

Checklist

yizhang-yiz commented Nov 29, 2021

rok-cesnovar commented Nov 29, 2021

rok-cesnovar left a comment

Choose a reason for hiding this comment

yizhang-yiz commented Nov 30, 2021

rok-cesnovar commented Nov 30, 2021

rok-cesnovar commented Dec 2, 2021

yizhang-yiz commented Dec 2, 2021

rok-cesnovar commented Dec 20, 2021

syclik May 19, 2023

Choose a reason for hiding this comment

syclik May 19, 2023

Choose a reason for hiding this comment

syclik May 19, 2023

Choose a reason for hiding this comment

syclik commented May 19, 2023

syclik left a comment

Choose a reason for hiding this comment

syclik left a comment

Choose a reason for hiding this comment

yizhang-yiz commented May 19, 2023

syclik commented May 22, 2023

syclik commented May 22, 2023

syclik commented May 22, 2023