Skip to content

Conversation

@chriselrod
Copy link
Contributor

For huge duals, we probably don't want to unroll much at all.
For small duals, unrolling more helps.

The intention is to try and keep up with SMatrix performance, but while this does well in microbenchmarks, in practice it is still several times slower in the real workload, with 80% GC time vs 15% GC time for the SMatrix version. =/

@YingboMa
Copy link
Member

Looks like the tests need an update, too.

@ChrisRackauckas ChrisRackauckas merged commit 8f93399 into SciML:master Apr 12, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants