GridVariable3D, 4D; SpectralVariable3D, 4D #493

milankl · 2024-03-15T17:30:57Z

This introduces GridVariable3D, GridVariable4D, SpectralVariable3D, SpectralVariable4D, four types that extend the horizontal grids and horizontal LowerTriangularMatrices for the spherical harmonics in 1-2 dimensions. At the moment we still have a layer-oriented structure which puts the layer on the outside, i.e. we have dimensions currently structured like (x, y for grids, technically xy as they only have one index, or equivalently l, m for spectral)

x, y, var, t, z

but I think we want for the future (an GPUs)

x, y, z, t, var

Now one can create a GridVariable3D (88 grid points, 5 layers) as

julia> G = zeros(GridVariable3D, OctahedralGaussianGrid{Float32}, 2, 5)
88×5 GridVariable3D{Float32, OctahedralGaussianGrid{Float32}}:
 0.0  0.0  0.0  0.0  0.0
 0.0  0.0  0.0  0.0  0.0
 0.0  0.0  0.0  0.0  0.0
 ⋮                   
 0.0  0.0  0.0  0.0  0.0
 0.0  0.0  0.0  0.0  0.0
 0.0  0.0  0.0  0.0  0.0

a GridVariable4D similarly (88 grid points, 5 layers, 2 timesteps)

julia> G = zeros(GridVariable4D, OctahedralGaussianGrid{Float32}, 2, 5, 2)
88×5×2 GridVariable4D{Float32, OctahedralGaussianGrid{Float32}}:
[:, :, 1] =
 0.0  0.0  0.0  0.0  0.0
 0.0  0.0  0.0  0.0  0.0
 ⋮                   
 0.0  0.0  0.0  0.0  0.0
 0.0  0.0  0.0  0.0  0.0

[:, :, 2] =
 0.0  0.0  0.0  0.0  0.0
 0.0  0.0  0.0  0.0  0.0
 ⋮                   
 0.0  0.0  0.0  0.0  0.0
 0.0  0.0  0.0  0.0  0.0

a SpectralVariable3D similarly (3x3 LowerTriangularMatrix with 2 layers)

julia> zeros(SpectralVariable3D{Float32}, 3, 3, 2)
3×3×2 SpectralVariable3D{Float32}:
[:, :, 1] =
 0.0  0.0  0.0
 0.0  0.0  0.0
 0.0  0.0  0.0

[:, :, 2] =
 0.0  0.0  0.0
 0.0  0.0  0.0
 0.0  0.0  0.0

or a SpectralVariable4D 3x3 LowerTriangularMatrix with 2 layers and 1 timestep

julia> zeros(SpectralVariable4D{Float32}, 3, 3, 2, 1)
3×3×2×1 SpectralVariable4D{Float32}:
[:, :, 1, 1] =
 0.0  0.0  0.0
 0.0  0.0  0.0
 0.0  0.0  0.0

[:, :, 2, 1] =
 0.0  0.0  0.0
 0.0  0.0  0.0
 0.0  0.0  0.0

milankl · 2024-03-15T17:33:05Z

I've just @propagate_inbounds like

Base.@propagate_inbounds Base.getindex(S::SpectralVariable4D, i::Integer, j::Integer, k::Integer, t::Integer) = S.data[k, t][i, j]

which I believe is correct because then all boundschecks are just done for the underlying vectors/matrices in there, or not at all with @inbounds. But I haven't tested any of that yet manually, or included some CIs

…yWeather.jl into mk/variables

simone-silvestri · 2024-03-15T20:59:51Z

src/dynamics/variables_3D_4D.jl

+    k::Integer,
+) where NF
+    npoints2D = RingGrids.get_npoints(Grid, nlat_half)
+    G3D = [zeros(Grid, nlat_half) for _ in 1:k]


is it more convenient to have a vector of vectors rather than a two-dimensional vector?

G3D here is a vector of vectors because the two horizontal dimensions of grid are unraveled because of our reduced grids that cannot be written as matrix

So every grid is ::AbstractVector not Matrix even though it represents a 2D field

maximilian-gelbrecht

This PR is also primarily thought for the upcoming GPU version, right?

To be honest, I am also not certain 100% yet how exactly to do it right either, and I would need to do some tests for it, but I am not so sure, the GridVariable3D, etc is the way to go.

You are implementing all structures as Vector of Vectors/CuArrays. The outer Vector is then a CPU array with pointers to the other Vectors/CuArrays that are likely to not be contiguous in memory. Intuitively, this doesn't seem ideal to me, and possibly detrimental to future GPU performance. Again, I only have a bit half-knowledge here, and didn't do some proper tests.

The alternative that I see, is to really make all low-level implementations of AbstractGrid and LowerTriangularMatrix N dimensional. I do think that this is actually reasonable easy to do, as you can just pass on any trailing indices during indexing. Like a grid then becoming

 struct MyNewGrid{T,N,A<:AbstractArray} <: AbstractGridClass{T}
        data::A{T,N}     
        nlat_half::Int      # resolution: latitude rings on one hemisphere (Equator incl)
end

@inline function Base.getindex(G::AbstractGrid, k::Integer, I...)  # something like this
    @boundscheck 0 < k <= length(G.data) || throw(BoundsError(G, k))
    # add another boundscheck for dimension here
    @inbounds r = getindex(G.data, k, I)
    return r
end

milankl · 2024-03-19T13:50:37Z

The advantage I see when GridVariable3D is a vector of vectors is that a lot of functionality is layer-based because it's a hydrostatic model. Then you can do vor[k] or layer(vor, k) (or whatever) and we'd get a vector for that layer back without allocations or the need to handle subarrays or views.

It also means that you can more easily reuse the internal functions and expose them to the user. If our spectral transforms are only defined between GridVariable3D and SpectralVariable3D we'd need to wrap and unwrap those. Maybe not that much of a problem, but still additional burden on the interface

milankl · 2024-03-19T13:53:20Z

This PR is also primarily thought for the upcoming GPU version, right?

Yes, but also generally to index variables simply as vor[ij,k] so that we can write our functions based on some high-level indexing logic but then swap out the underlying data structure simply by redefining getindex and setindex!

maximilian-gelbrecht · 2024-03-19T14:33:45Z

The advantage I see when GridVariable3D is a vector of vectors is that a lot of functionality is layer-based because it's a hydrostatic model. Then you can do vor[k] or layer(vor, k) (or whatever) and we'd get a vector for that layer back without allocations or the need to handle subarrays or views.

I think avoiding allocations from indexing the N-dimensional isn't really that hard, if it's properly tested. I admit, that I am also sometimes a bit confused when Julia allocates something and when not, but we can definitely set it up in a way that that's not the case, and could also set up convenience functions like
layer(vor, k) = view(vor, :, k) etc for that.

The more I think about it, the more I like the idea of just having N-dimensional Grid and LowerTriangularMatrix. It makes sure everything is contigous in memory, which (at least for GPU) I am almost certain will be an advantage for the performance, especially if we want to write kernels over the layer dimension as well. It would also make the code basis smaller and introduce less new types. You'd just have AbstractGrid and LowerTriangularMatrix which are new N-dimensional array types, and then PrognosticVariables has only those types as fields.

milankl · 2024-03-19T14:47:41Z

So here I've defined

GridVariable3D (widely used)
GridVariable4D (not used and I don't currently see a use case)
SpectralVariable3D (widely used)
SpectralVariable4D (4th dimension only of length 2 for timesteps)

meaning that performance-related, we'd really only care about the 3D cases, the SpectralVariable4D could just be a 2-element Vector{SpectralGrid3D} which gets unpacked depending on the leapfrog time step.

We could write everything for the 3D structs using [ij, k] (grids) and [l, m, k] as well as [lm, k] (spectral) for indexing, including the transforms. I believe that could be a general interface and then we can still play around with how exactly to setup the memory layout underneath. It's just a massive restructuring of all kernels which I'd like to do only once. Especially with all the places that rely on the current diagnostic_variables.layers[k].grid_variables.vor[ij] syntax

maximilian-gelbrecht · 2024-03-19T15:00:51Z

It's just a massive restructuring of all kernels which I'd like to do only once.

Absolutely! That's also why I was directly asking whether this is primarily for the GPU version, because this is something that should work for it as well then.

We could write everything for the 3D structs using [ij, k] (grids) and [l, m, k] as well as [lm, k] (spectral) for indexing, including the transforms. [...] Especially with all the places that rely on the current diagnostic_variables.layers[k].grid_variables.vor[ij] syntax

I was thinking that exactly this is actually a major part of making the model GPU compatible, as the old structure isn't really ideally suited for it, and possibly should be done together with writing the actual GPU kernels. At least I think it would also save a lot of work to only do it once it that regard as well.

milankl · 2024-03-19T15:08:30Z

Yes, it should be reasonably future-proof restructuring for GPU and I believe we can still do the current multi-threading by writing the kernels as @floop for k in eachlayer(grid) or similar, which moves the @floop macros further towards to kernels, which is fine I believe.

I think we need to make a better informed decision by benchmarking. Once we just use the indexing as outlined above, it's just a matter of redefining getindex, setindex!. But to avoid any pitfalls we should test it. It's a bit of work to restructure the transforms, but maybe we can start with simpler 3D kernels to get a feel for how much difference it actually makes.

maximilian-gelbrecht · 2024-03-19T15:14:08Z

Yes, the beauty of Julia is that you can really manipulate all of those things and implement it in this very generic way.

I am just not sure if we (or you) really want to invest this time now, or not wait until we tackle the full GPU implementation. And for the full GPU implementation I'd still go bottom to top and start with the most low-level parts, because this allows us to actually test things on GPU as we go.

milankl · 2024-03-19T22:44:37Z

I am just not sure if we (or you) really want to invest this time now, or not wait until we tackle the full GPU implementation.

Sure, when do you want to start 😉

milankl · 2024-03-20T16:01:14Z

Just had a call with @maximilian-gelbrecht we concluded the following

the module SpeedyWeather should use [ij,k,t] (grid) [l,m,k,t] or [lm,k,t] (spectral) indexing to have an abstract interface of a variable, e.g. prognostic_variables.vor that would be independent of the underlying data structure.
We probably want to implement the grids n-dimensional similar to

struct FullGaussianND{Eltype, Ndims, ArrayType} <: AbstractArray{Eltype,Ndims}
    data::ArrayType{Eltype,Ndims}
    nlat_half::Int      # resolution parameter for grids
    rings::ArrayType{UnitRange{Int},1}    # precalculated vector for indexing rings, e.g. [1:20, 21:44,...]
end

# current FullGaussianGrid would be a subtype
FullGaussianGrid{T} = FullGaussianND{T, 1, Vector}

But to make a better informed decision these should be experimented with on a GPU, using some getindex, setindex and maybe some simple kernels as benchmarks.
Delay the rewrite of the kernels in the dynamical core / the SpeedyTransforms kernels till we know how we want it with respect to a GPU implementation

milankl · 2024-03-25T20:42:54Z

New LowerTriangularArray being implemented at #503 which would replace SpectralVariable3D/SpectralVariable4D here. However, the changes to the PrognosticVariables and DiagnosticVariables should still be implemented just with those new array types!

I'd really like instead of this (currently)

humid = simulation.diagnostic_variables.layers[end].grid_variables.humid_grid

this to work

humid = simulation.diagnostic_variables.grid.humid[:,end]

returning surface humidity as ::AbstractGrid

GridVariable3D, 4D; SpectralVariable3D, 4D

499b9bc

milankl added structure 🏠 Internal structure of the code gpu 🖼️ Everthing GPU related array types 🌐 LowerTriangularMatrices and RingGrids labels Mar 15, 2024

milankl requested review from maximilian-gelbrecht and simone-silvestri March 15, 2024 17:30

milankl added 4 commits March 15, 2024 13:34

boundschecks in underlying vectors/matrices only

a6eeeeb

Merge branch 'main' into mk/variables

2ce6ae5

name harmonics l,m consistently i, j for 3D 4D vars

e5558f9

Merge branch 'mk/variables' of https://github.com/SpeedyWeather/Speed…

031783f

…yWeather.jl into mk/variables

simone-silvestri reviewed Mar 15, 2024

View reviewed changes

PrognosticVariables,DiagnosticVariables with 3D/4D vars

3119d2a

maximilian-gelbrecht reviewed Mar 19, 2024

View reviewed changes

milankl mentioned this pull request Apr 1, 2024

Modified dynamics #507

Open

milankl mentioned this pull request Apr 25, 2024

Prognostic and DiagnosticVariables array-agnostic #525

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GridVariable3D, 4D; SpectralVariable3D, 4D #493

GridVariable3D, 4D; SpectralVariable3D, 4D #493

milankl commented Mar 15, 2024 •

edited

Loading

milankl commented Mar 15, 2024 •

edited

Loading

simone-silvestri Mar 15, 2024

milankl Mar 15, 2024

milankl Mar 15, 2024

maximilian-gelbrecht left a comment •

edited

Loading

milankl commented Mar 19, 2024 •

edited

Loading

milankl commented Mar 19, 2024

maximilian-gelbrecht commented Mar 19, 2024

milankl commented Mar 19, 2024

maximilian-gelbrecht commented Mar 19, 2024

milankl commented Mar 19, 2024

maximilian-gelbrecht commented Mar 19, 2024

milankl commented Mar 19, 2024

milankl commented Mar 20, 2024 •

edited

Loading

milankl commented Mar 25, 2024 •

edited

Loading

GridVariable3D, 4D; SpectralVariable3D, 4D #493

Are you sure you want to change the base?

GridVariable3D, 4D; SpectralVariable3D, 4D #493

Conversation

milankl commented Mar 15, 2024 • edited Loading

milankl commented Mar 15, 2024 • edited Loading

simone-silvestri Mar 15, 2024

Choose a reason for hiding this comment

milankl Mar 15, 2024

Choose a reason for hiding this comment

milankl Mar 15, 2024

Choose a reason for hiding this comment

maximilian-gelbrecht left a comment • edited Loading

Choose a reason for hiding this comment

milankl commented Mar 19, 2024 • edited Loading

milankl commented Mar 19, 2024

maximilian-gelbrecht commented Mar 19, 2024

milankl commented Mar 19, 2024

maximilian-gelbrecht commented Mar 19, 2024

milankl commented Mar 19, 2024

maximilian-gelbrecht commented Mar 19, 2024

milankl commented Mar 19, 2024

milankl commented Mar 20, 2024 • edited Loading

milankl commented Mar 25, 2024 • edited Loading

milankl commented Mar 15, 2024 •

edited

Loading

milankl commented Mar 15, 2024 •

edited

Loading

maximilian-gelbrecht left a comment •

edited

Loading

milankl commented Mar 19, 2024 •

edited

Loading

milankl commented Mar 20, 2024 •

edited

Loading

milankl commented Mar 25, 2024 •

edited

Loading