Tridagonal matrices and associated spmv kernel #957

loiseaujc · 2025-03-20T08:20:06Z

This PR is the first of a series to port what I had been doing in SpecialMatrices to stdlib. This first one provides the base type and spmv kernel for Tridiagonal matrices. At the moment, it is included in stdlib_sparse_kinds and stdlib_sparse_spmv files out of simplicity. It could also be (yet) another module, I don't have a strong opinion about that.

Proposed interface

A = Tridiagonal(dl, dv, du) where dl, dv and du are rank-1 arrays describing the tridiagonal elements.
call spmv(A, x, y [, alpha] [, beta] [, op]) for the matrix-vector product.
dense(A) to convert a Tridiagonal matrix to its standard rank-2 array representation.
transpose(A) to compute the transpose of a Tridiagonal matrix.
hermitian(A) to compute the hermitian of a Tridiagonal matrix.
+ and - are being overloaded for addition and subtraction to two Tridiagonal matrices.
* is being overloaded for scalar-matrix multiplication.

Key facts

The spmv kernel uses lagtm LAPACK backend under the hood.
Tridiagonal matrices of all the kinds provided by stdlib can be created.
The kind generality of the spmv kernel has the same limitations as stdlib_linalg_lapack.

Progress

cc @jvdp1, @jalvesz, @perazz

loiseaujc · 2025-03-20T11:04:41Z

I got one build to work, that's encouraging. Everything works fine locally though when using gfortran 14.2.0. I'll try to test things over the weekend with older versions of gfortran to see exactly what's going on.

On a different note: the sparse matrix-vector product currently uses call spmv. What about exposing an extension to the intrinsic matmul? For many mundane tasks, it might be more natural to have y = matmul(A, x) in the code rather than call spmv(A, x, y, ...).

jalvesz · 2025-03-20T14:13:37Z

On a different note: the sparse matrix-vector product currently uses call spmv. What about exposing an extension to the intrinsic matmul? For many mundane tasks, it might be more natural to have y = matmul(A, x) in the code rather than call spmv(A, x, y, ...).

Sure, it might be interesting to create a wrapper interface on top of the subroutine to have a matmul equivalent. Now, I personally avoid using matmul and for that matter any function procedure signature implying assignment to an array lhs, as it is difficult to control whether the compiler recycles the memory of the lhs or creates a temporary array. subroutines allow better control in that regard. At the same time, having these interfaces might also be a good study case for such performance problems.

loiseaujc · 2025-03-20T15:19:53Z

All tests are now passing. It was simply the all( y1 == y2) which was too restrictive. Apparently, bit reproducibility is possible with gfortran 14.2.0 but not the others, there are some floating point error accumulation (which actually make sense I suppose).

Sure, it might be interesting to create a wrapper interface on top of the subroutine to have a matmul equivalent. Now, I personally avoid using matmul and for that matter any function procedure signature implying assignment to an array lhs, as it is difficult to control whether the compiler recycles the memory of the lhs or creates a temporary array. subroutines allow better control in that regard. At the same time, having these interfaces might also be a good study case for such performance problems.

So do I. I think adding a matmul is more of a sugar-coating thing than performance related. I kind of see this as the np.dot(A, x) vs A @ x in numpy (although it points to the exact same function in this case). Not quite recommended if you want performance, but more natural/readable for a simple piece of code.

PS: Now that the tests are passing, I'll start working on the docs and examples tomorrow.

loiseaujc · 2025-03-20T15:22:30Z

example/linalg/example_sparse_data_accessors.f90

Sorry about this. I did not realized I had fprettify set on on automatic formatting when saving.

perazz · 2025-03-21T07:37:52Z

it might be more natural to have y = matmul(A, x)

I've started a discussion on this at #951, great work @loiseaujc!

Moving all the implementations to a dedicated module (and submodule) will prevent cluter of the `sparse` module as well as make things clearer from an implementation and documentation point of view.

jalvesz · 2025-03-24T08:15:34Z

Oh, I found the initial design as an extension of the sparse module was the ideal proposal. It would enable to improve the global structure by having different use-cases emerging from it.

loiseaujc · 2025-03-24T08:27:23Z

Oh, I found the initial design as an extension of the sparse module was the ideal proposal. It would enable to improve the global structure by having different use-cases emerging from it.

I had the same idea initially, but things started to get messy when working on the documentation. Moreover, if Circulant, Toeplitz and Hankel matrices eventually make their ways into stdlib, these are not actually sparse matrices. I figured it made more sense to have a dedicated module grouping all the highly structured matrices. Additionally, very efficient linear solvers exist for all of these highly structured matrices which is not case for general sparse ones. I can still define Tridiagonal and the likes as an extension of sparse_type though to make them compatible.

Another possibility is to keep Tridiagonal, SymTridiagonal, etc in the sparse module, and only have Circulant, Toeplitz, and Hankel in the newly created stdlib_specialmatrices one. I'm all open.

jalvesz · 2025-03-24T20:17:38Z

I had the same idea initially, but things started to get messy when working on the documentation.

I'm curious about this. Would you mind letting me know why did you find it complicated? I would definitely like to help and make it easier to contribute/build on top of it. If it can be improved, lets try to do it :)

Another possibility is to keep Tridiagonal, SymTridiagonal, etc in the sparse module, and only have Circulant, Toeplitz, and Hankel in the newly created stdlib_specialmatrices one. I'm all open.

Sounds about right!

loiseaujc · 2025-03-24T21:50:49Z

Well, it is not so much the process of documenting than having a somewhat consistent documentation when you start planning for the next sets of PR. Take the Tridiagonal matrix as an example. Here is the near complete list of mathematical features or fast algorithms that, to some extent, set them apart from general sparse matrices (in no special order):

The set of n x n tridiagonal matrices actually forms a 3n-2-dimensional vector subspace, i.e. any linear combination of tridiagonal matrices results in a tridiagonal matrix. This is not the case for general sparse matrices where the sparsity pattern of the result will be the union of the sparsity patterns of each element of the linear combination.
The spmv kernel for tridiagonal matrix-vector product actually relies on lagtm directly provided by lapack rather than hand-crafted kernels as the ones you implemented in stdlib_sparse_spmv.
The determinant can be computed in $\mathcal{O}(n)$ operations using a three-term recurrence rather than the standard $\mathcal{O}(n^3)$ for other matrices (including sparse ones I believe).
Linear systems $Ax = b$ can be solved exactly again in $\mathcal{O}(n)$ operations using the Thomas algorithm compared to standard iterative solvers or specialized sparse direct ones for general sparse matrices.
For symmetric tridiagonal matrices, eigenvalues and eigenvectors can be computed exactly using steqr in lapack. In contrast, it is rare to perform a complete spectral decomposition of a general sparse matrix (and one typically use some form of Arnoldi iteration instead to get only the leading eigenpairs).

Modulo some variations in the algorithms, pretty much the same statements apply directly to SymTridiagonal, Circulant, Toeplitz and Hankel matrices: they all form m-dimensional vector subspaces (with m << n²), they all have specialized algorithms for computing matrix-vector products or solving linear systems, and most also have specialized eig/svd algorithms. And on a longer horizon, the same would also apply to block-variants of these types.

Given the similarities between all these types, having them inside the same stdlib_specialmatrices module made sense to me. From a "math" point of view, I kind of see them as being closer to one another than any of them is to general sparse matrices. It is easy to have a consistent documentation across all the provided types. This module could be used by itself. In contrast, the way I envision stdlib_sparse (but I may be wrong) is that it'll be used in conjunction with a potential stdlib_sparse_linalg (kind of what scipy is doing) or stdlib_sparse_graph if graph-theoretical stuff are being implemented.

jalvesz · 2025-03-24T22:13:44Z

I see! Makes sense!! I would envision that for higher level APIs they should be unified with proper interfaces. Say, if one wants to call a generic matmul or a solve procedure, versions should be available for any matrix type defined in stdlib (dense, sparse, special). Instead of making the matrix structure per-se the center of focus, making it a helper concept for efficient storage and procedure implementation.

If some of the APIs can be brought close in naming convention and usage regardless of the internal implementation details that would already be a good step forward.

loiseaujc · 2025-03-25T06:17:21Z

I see! Makes sense!! I would envision that for higher level APIs they should be unified with proper interfaces. Say, if one wants to call a generic matmul or a solve procedure, versions should be available for any matrix type defined in stdlib (dense, sparse, special).

I guess we're on the same page then! That is pretty much what I tried in my personal specialmatrices implementations. All the matrix types actually extend the generic stdlib interfaces (e.g. x = solve(A, b), call eig(A, lambda, right, left), etc) so that they just work whether $A$ is a rank-2 array, a Tridiagonal matrix, or a Toeplitz one for instance.

Instead of making the matrix structure per-se the center of focus, making it a helper concept for efficient storage and procedure implementation.

Indeed. Sure enough, using the correct matrix structure for computational efficiency can be utterly important, but at the end of the day I want simply to solve $A x = b$ and having a x = solve(A, b) syntax conveys exactly that message. It doesn't really matter to many users how $A$ is actually being represented behind the scenes.

If some of the APIs can be brought close in naming convention and usage regardless of the internal implementation details that would already be a good step forward.

That might be easy to do simply by extending already existing interfaces. In an ideal world however, I'd like these different definitions of matrices to be interoperable with packages other than stdlib. Say you have your own implementation of a particular sparse type while I have a package doing graph-theoretic operations. And say stdlib provides an AbstractSparseMatrix (possibly itself extended from a more fundamental AbstractMatrix) from which your type is being extended and for which I provide interfaces. How cool would it be for

call my_arbitrary_graph_operation( your_specific_sparse_matrix )

to just work even though, on my side, I've never actually imported your package anywhere in my list? I think, to some extent, this is what is driving a large part of the Julia packages interoperability. I don't think it'll be too hard (technically) to partially emulate this behaviour with well-designed fundamental derived-types (possibly abstract or not). This discussion however is beyond the scope of this PR. Let's continue it on the discourse.

loiseaujc · 2025-03-25T14:14:02Z

I think it is pretty much ready for review. There are just a couple of things I'm not sure how to do properly with fyyp or ford:

The spmv driver relies on lagtm behind the scene and thus inherits from the fact that, by default, the lapack backend is not being compiled for extended and quadruple precision. At the moment I have a simple #:if k1 != "qp" and k1 != "xdp" wherever needed but I figure this is not the most fyyp-esque way to do so.
In the ford documentation to link the overloaded operators interfaces to the corresponding section of the specification page, I have [[stdlib_specialmatrices(module):operator(+)(interface)]] which does not work. Not sure what is the correct syntax though and the ford documentation was not very helpful for this particular feature.

jalvesz · 2025-03-26T09:34:02Z

Regarding lagtm: if you check here

stdlib/src/lapack/stdlib_lapack_blas_like_l3.fypp

Line 214 in 2bdc50e

#:if not rk in ["sp","dp"]

you'll see that actually, the single and double precision are taken from the reference implementations and the "if not" implies the the other kinds are generated by fypp templating extension, which means that you could actually use those for xdp and qp. If you link against an optimized library which will most likely only support single and double, then those will be accelerated, xdp and qp will simply continue using the internal implementation.

loiseaujc · 2025-03-26T11:29:06Z

Alright. I don't know why I though extended and quadruple where not supported by default. Maybe I read in an old topic about stdlib. Anyway, I've changed the code and everything runs fine with xdp and qp.

jvdp1

Thank you @loiseaujc for this PR. here are some first minor comments.

test/linalg/test_linalg_sparse.fypp

example/specialmatrices/example_specialmatrices_dp_spmv.f90

example/specialmatrices/example_tridiagonal_dp_type.f90

doc/specs/stdlib_specialmatrices.md

src/stdlib_specialmatrices_tridiagonal.fypp

test/linalg/test_linalg_specialmatrices.fypp

Remove trailing white space. Co-authored-by: Jeremie Vandenplas <jeremie.vandenplas@gmail.com>

Import only required elements from the module. Co-authored-by: Jeremie Vandenplas <jeremie.vandenplas@gmail.com>

loiseaujc · 2025-06-22T12:30:57Z

I think it is pretty much ready now. Let me know if you feel like other modifications should be made.

src/stdlib_specialmatrices.fypp

src/stdlib_specialmatrices_tridiagonal.fypp

test/linalg/test_linalg_specialmatrices.fypp

jalvesz · 2025-06-27T09:55:41Z

I'm thrown off by the lastest CI failures with test-drive ...

[76/869] D:\a\_temp\msys64\mingw64\bin\gfortran.exe -ID:\a\stdlib\stdlib\build\_deps\test-drive-src\test -ID:/a/stdlib/stdlib/build/_deps/test-drive-build/include -fimplicit-none -ffree-line-length-132 -Wall -Wextra -Wimplicit-interface -fPIC -g -fcheck=all -fbacktrace -J_deps\test-drive-build\test -fpreprocessed -c _deps/test-drive-build/test/CMakeFiles/test-drive-tester.dir/test_select.F90-pp.f90 -o _deps/test-drive-build/test/CMakeFiles/test-drive-tester.dir/test_select.F90.obj
FAILED: _deps/test-drive-build/test/CMakeFiles/test-drive-tester.dir/test_select.F90.obj _deps/test-drive-build/test/test_select.mod 
D:\a\_temp\msys64\mingw64\bin\gfortran.exe -ID:\a\stdlib\stdlib\build\_deps\test-drive-src\test -ID:/a/stdlib/stdlib/build/_deps/test-drive-build/include -fimplicit-none -ffree-line-length-132 -Wall -Wextra -Wimplicit-interface -fPIC -g -fcheck=all -fbacktrace -J_deps\test-drive-build\test -fpreprocessed -c _deps/test-drive-build/test/CMakeFiles/test-drive-tester.dir/test_select.F90-pp.f90 -o _deps/test-drive-build/test/CMakeFiles/test-drive-tester.dir/test_select.F90.obj
D:/a/stdlib/stdlib/build/_deps/test-drive-src/test/test_select.F90:156:22:

  156 |     call run_selected(stub_collect, "not-available", unit, stat)
      |                      1
Error: Interface mismatch in dummy procedure 'collect' at (1): ALLOCATABLE mismatch in argument '_formal_0'

I suspect that it is because one argument might be missing in the call check:

call check(error, all_close(y1, y2))

try with

call check(error, all_close(y1, y2), .true. )

jalvesz · 2025-06-27T15:40:24Z

this error #957 (comment) is weird ... maybe @jvdp1 @perazz @zoziha @ivan-pi @certik would any of you have an idea?

perazz · 2025-06-27T16:56:26Z

this error #957 (comment) is weird ... maybe @jvdp1 @perazz @zoziha @ivan-pi @certik would any of you have an idea?

There is an open issue at fortran-lang/test-drive#49 .
It looks like this is a GCC-15 bug, it is triggered by -Wall. Until that is solved in test-drive, for fpm (fortran-lang/fpm#1150), we worked around it by setting -Wall -Wno-external-argument-mismatch.

It's a hack, but it works. GCC<15 versions don't have that flag, but the "negative" form of the warnings flag does never trigger an error so it can be safely used on all GCC versions.

Hacking away the CI for gnu 15: #957 (comment)

Romendakil · 2025-06-27T21:08:15Z

this error #957 (comment) is weird ... maybe @jvdp1 @perazz @zoziha @ivan-pi @certik would any of you have an idea?

There is an open issue at fortran-lang/test-drive#49 . It looks like this is a GCC-15 bug, it is triggered by -Wall. Until that is solved in test-drive, for fpm (fortran-lang/fpm#1150), we worked around it by setting -Wall -Wno-external-argument-mismatch.

It's a hack, but it works. GCC<15 versions don't have that flag, but the "negative" form of the warnings flag does never trigger an error so it can be safely used on all GCC versions.

This is well-known bug that is fixed meanwhile, there are four duplicate bug reports at gcc, e.g.: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119928

jalvesz · 2025-07-02T06:08:22Z

@loiseaujc merging the latest master branch should help resolve the crashes with gfortran 15.1 thanks to the changes introduced by @perazz in #1008.

loiseaujc · 2025-07-02T06:37:37Z

Ok. Will do later today or tomorrow. Thanks guys.

Merge the gfortran fix to tridiagonal matrices branch.

loiseaujc added 3 commits March 20, 2025 09:04

Added the Tridiagonal type and associated spmv kernel.

3912f73

Rename dense to dense_mat because of a naming conflict.

872777d

Added test for Tridiagonal spmv.

abb6904

Changed comparison to account for floating point errors.

9d8710b

loiseaujc commented Mar 20, 2025

View reviewed changes

Move implementations to a dedicated module.

9ae0554

Moving all the implementations to a dedicated module (and submodule) will prevent cluter of the `sparse` module as well as make things clearer from an implementation and documentation point of view.

Implementation of basic operations for Tridiagonal matrices is done.

fc61941

loiseaujc mentioned this pull request Mar 25, 2025

feat: extend intrinsic matmul #951

Open

loiseaujc added 3 commits March 25, 2025 15:09

In-code documentation.

9b8f650

Added examples.

de3098b

Specifications page.

6ead4ba

loiseaujc marked this pull request as ready for review March 25, 2025 14:10

loiseaujc added 2 commits March 25, 2025 15:29

Fix typos in module header.

4698662

Fix errors in examples.

c9d2531

Enabled xdp and qp for spmv kernels.

9e91af3

jvdp1 reviewed May 10, 2025

View reviewed changes

Update test/linalg/test_linalg_sparse.fypp

78133fb

Remove trailing white space. Co-authored-by: Jeremie Vandenplas <jeremie.vandenplas@gmail.com>

loiseaujc and others added 4 commits June 13, 2025 14:13

Update example/specialmatrices/example_specialmatrices_dp_spmv.f90

78b5085

Import only required elements from the module. Co-authored-by: Jeremie Vandenplas <jeremie.vandenplas@gmail.com>

Consistent lower-casing and imports.

6777aa0

Merge branch 'master' into tridiagonal_matrices

4dc860b

Fixed allocation from source for complex types.

48c734e

jalvesz requested changes Jun 26, 2025

View reviewed changes

src/stdlib_specialmatrices.fypp Outdated Show resolved Hide resolved

src/stdlib_specialmatrices_tridiagonal.fypp Show resolved Hide resolved

test/linalg/test_linalg_specialmatrices.fypp Outdated Show resolved Hide resolved

loiseaujc added 4 commits June 27, 2025 09:53

Make use of stdlib_constants to define the various zeros and ones.

f102167

Prevent temporary arrays + lower casing every names.

05be1d6

Import everything from stdlib_kinds

94220f4

Fix imports.

b4b4f87

Fix missing argument in check.

b8366a1

jalvesz added a commit that referenced this pull request Jun 27, 2025

fix: CI issue with test-drive and gnu 15

bbc1f85

Hacking away the CI for gnu 15: #957 (comment)

jalvesz mentioned this pull request Jun 27, 2025

fix: CI issue with test-drive and gnu 15 #1007

Closed

Merge pull request #1 from loiseaujc/master

88707b6

Merge the gfortran fix to tridiagonal matrices branch.

jalvesz approved these changes Jul 3, 2025

View reviewed changes

Tridagonal matrices and associated spmv kernel #957

Are you sure you want to change the base?

Tridagonal matrices and associated spmv kernel #957

Conversation

loiseaujc commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed interface

Key facts

Progress

Uh oh!

loiseaujc commented Mar 20, 2025

Uh oh!

jalvesz commented Mar 20, 2025

Uh oh!

loiseaujc commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

loiseaujc Mar 20, 2025

Choose a reason for hiding this comment

Uh oh!

perazz commented Mar 21, 2025

Uh oh!

jalvesz commented Mar 24, 2025

Uh oh!

loiseaujc commented Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jalvesz commented Mar 24, 2025

Uh oh!

loiseaujc commented Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jalvesz commented Mar 24, 2025

Uh oh!

loiseaujc commented Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

loiseaujc commented Mar 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jalvesz commented Mar 26, 2025

Uh oh!

loiseaujc commented Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jvdp1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

loiseaujc commented Jun 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jalvesz commented Jun 27, 2025

Uh oh!

jalvesz commented Jun 27, 2025

Uh oh!

perazz commented Jun 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Romendakil commented Jun 27, 2025

Uh oh!

jalvesz commented Jul 2, 2025

Uh oh!

loiseaujc commented Jul 2, 2025

Uh oh!

Uh oh!

loiseaujc commented Mar 20, 2025 •

edited

Loading

loiseaujc commented Mar 20, 2025 •

edited

Loading

loiseaujc commented Mar 24, 2025 •

edited

Loading

loiseaujc commented Mar 24, 2025 •

edited

Loading

loiseaujc commented Mar 25, 2025 •

edited

Loading

loiseaujc commented Mar 25, 2025 •

edited

Loading

loiseaujc commented Mar 26, 2025 •

edited

Loading

perazz commented Jun 27, 2025 •

edited

Loading