Refactoring GPU extensions #1365

kmp5VT · 2024-03-27T00:45:58Z

Description

In this PR I am working to address some refactoring issues. Such as removing using statements from module files, try to diagnose/fix GPU blocksparse backend failures, update adapt functions to use storagemode like AMDGPU, remove gpu examples from testing based on plan listed in google drive

Checklist:

Investigate why the AMDGPU and CUDA backends don’t work for BlockSparse storage. Implement resize! for AMDGPUs to get block sparse operations on AMDGPU working (see Add resize! JuliaGPU/Metal.jl#279 for a reference on how it might be implemented).
Update the adapt functions to have storagemode as the input parameter
Update the Extensions libraries to move the bare using statements from the NDTensorsXExt.jl file to the files where they are necessary.
Delete NDTensors/ext/examples.
Remove examples from NDTensors test
Define a MtlArrayAdaptor for adapting storage to MtlArray, analagous to CuArrayAdaptor, to be used here: https://github.com/ITensor/ITensors.jl/blob/v0.3.57/NDTensors/ext/NDTensorsMetalExt/adapt.jl
update the generic_zeros and generic_rand functions for Dense and AbstractArray
All unittests pass

…actor/update_gpu_backends

mtfishman · 2024-03-27T00:55:13Z

Note that resize! is defined for AMDGPU.ROCVector: https://github.com/JuliaGPU/AMDGPU.jl/blob/v0.8.11/src/array.jl#L266.

codecov-commenter · 2024-03-27T01:10:15Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 50.23%. Comparing base (e8197ce) to head (9ce1991).
Report is 4 commits behind head on main.

❗ Current head 9ce1991 differs from pull request most recent head 57abe1f. Consider uploading reports for the commit 57abe1f to get more accurate results

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #1365       +/-   ##
===========================================
- Coverage   79.59%   50.23%   -29.36%     
===========================================
  Files         114      113        -1     
  Lines        9032     8979       -53     
===========================================
- Hits         7189     4511     -2678     
- Misses       1843     4468     +2625

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

kmp5VT · 2024-03-27T20:01:14Z

So far I have updated the CUDA library to remove using statements from the module package. Next I am looking into this

Investigate why the CUDA backends don’t work for BlockSparse storage (there is no append! function) and find a way to fix this issue.

…/ITensors.jl into kmp5/refactor/update_gpu_backends

NDTensors/ext/NDTensorsCUDAExt/append.jl

NDTensors/src/dense/fill.jl

…/ITensors.jl into kmp5/refactor/update_gpu_backends

src/imports.jl

NDTensors/src/abstractarray/fill.jl

NDTensors/src/dense/generic_array_constructors.jl

mtfishman · 2024-04-05T16:53:54Z

NDTensors/src/dense/generic_array_constructors.jl

+  return StoreT(data)
+end
+
+function generic_zeros(StoreT::Type{<:Dense}, dims::Tuple{Integer})


Suggested change

function generic_zeros(StoreT::Type{<:Dense}, dims::Tuple{Integer})

function generic_zeros(StoreT::Type{<:Dense}, dims::Integer)

My proposal was to define both of them in terms of ::Integer, not ::Tuple{Integer}, since it is a simpler interface for new types to overload in general.

My only concern is that if you do something like this generic_zeros(Dense, (2,)) it calls the AbstractArray version of this funciton

Right, so then you should rewrite the AbstractArray version so that doesn't happen.

I am not sure I understand what you're suggesting to do. I reverted the code back the original design you suggested which works for generic_zeros(Dense, 2) and generic_zeros(Dense, (2,))

kmp5VT · 2024-04-12T16:14:09Z

@mtfishman If you have a minute can you please take a look at this PR again? Thank you I appreciate your time!

NDTensors/Project.toml

NDTensors/src/abstractarray/generic_array_constructors.jl

Project.toml

mtfishman · 2024-04-12T18:45:15Z

Looks good, thanks!

kmp5VT added 2 commits March 22, 2024 12:01

Update the CUDA extension library move usings out of module name file

a36f446

Merge commit 'b48b88168caa0bb66aee86116a932d4f71713658' into kmp5/ref…

1962f6f

…actor/update_gpu_backends

kmp5VT marked this pull request as draft March 27, 2024 00:46

kmp5VT added 4 commits March 27, 2024 15:48

format

b2a2490

Update adapt, checking on workstation

7fe2013

format

56a9209

Fix cuarray adaptor issue

d7fe3b3

kmp5VT added 10 commits March 28, 2024 09:00

Merge branch 'main' into kmp5/refactor/update_gpu_backends

bff4b16

Merge branch 'main' into kmp5/refactor/update_gpu_backends

00dcb54

Merge branch 'main' into kmp5/refactor/update_gpu_backends

2016ff7

Merge branch 'main' into kmp5/refactor/update_gpu_backends

4a59610

Update Fill so CUDA DMRG works

7690456

Merge branch 'kmp5/refactor/update_gpu_backends' of github.com:kmp5VT…

d158aef

…/ITensors.jl into kmp5/refactor/update_gpu_backends

Merge branch 'main' into kmp5/refactor/update_gpu_backends

7694636

format

a3617c7

Remove an artifact of debugging

bad9b37

Create CUDA append! function.

78a306e

kmp5VT commented Mar 30, 2024

View reviewed changes

NDTensors/ext/NDTensorsCUDAExt/append.jl Outdated Show resolved Hide resolved

kmp5VT added 2 commits March 30, 2024 09:01

add append to module

7bac7bb

Missing a using

e2a65ff

mtfishman reviewed Mar 30, 2024

View reviewed changes

NDTensors/src/dense/fill.jl Outdated Show resolved Hide resolved

mtfishman reviewed Mar 30, 2024

View reviewed changes

NDTensors/src/dense/fill.jl Outdated Show resolved Hide resolved

mtfishman reviewed Mar 30, 2024

View reviewed changes

NDTensors/src/dense/fill.jl Outdated Show resolved Hide resolved

kmp5VT added 3 commits April 1, 2024 11:20

Merge branch 'main' into kmp5/refactor/update_gpu_backends

4730c84

Create expose version of append! to use @allowscalar

e997c38

Merge branch 'kmp5/refactor/update_gpu_backends' of github.com:kmp5VT…

d9c7d69

…/ITensors.jl into kmp5/refactor/update_gpu_backends

mtfishman reviewed Apr 1, 2024

View reviewed changes

src/imports.jl Show resolved Hide resolved

Update fill

511d434

mtfishman reviewed Apr 4, 2024

View reviewed changes

NDTensors/src/abstractarray/fill.jl Outdated Show resolved Hide resolved

kmp5VT added 2 commits April 4, 2024 17:50

rename fill.jl to generic_array_constructors.jl

153295e

Code review

15dc0b2

kmp5VT marked this pull request as ready for review April 5, 2024 12:43

kmp5VT added 3 commits April 5, 2024 11:17

Swap internal function

e088096

remove functions

b61311d

move using to top

7eda284

mtfishman reviewed Apr 5, 2024

View reviewed changes

NDTensors/src/dense/generic_array_constructors.jl Outdated Show resolved Hide resolved

kmp5VT added 3 commits April 5, 2024 11:55

Bump NDTensors minor version

dd437d7

Use ndims instead of type_parameter

cd1eb24

update generic_randn function for dense

4dfbbdb

mtfishman reviewed Apr 5, 2024

View reviewed changes

NDTensors/src/dense/generic_array_constructors.jl Outdated Show resolved Hide resolved

mtfishman reviewed Apr 5, 2024

View reviewed changes

kmp5VT added 5 commits April 5, 2024 13:18

Update

296f0da

format

64f0778

Update tests to use temp project instead of monorepo for safety

57abe1f

Reverse the structure again

1ca8c8b

format

036bcb1

This was referenced Apr 12, 2024

[ITensors] HDF5.jl package extension #1382

Merged

[ITensors] Observers.jl package extension #1381

Merged

mtfishman reviewed Apr 12, 2024

View reviewed changes

NDTensors/Project.toml Outdated Show resolved Hide resolved

mtfishman reviewed Apr 12, 2024

View reviewed changes

NDTensors/src/abstractarray/generic_array_constructors.jl Outdated Show resolved Hide resolved

mtfishman reviewed Apr 12, 2024

View reviewed changes

Project.toml Outdated Show resolved Hide resolved

kmp5VT added 2 commits April 12, 2024 13:32

Update ITensors and NDTensors versions

d212282

Update function call

a530e0b

mtfishman merged commit 2a6afa5 into ITensor:main Apr 12, 2024
15 of 16 checks passed

mtfishman changed the title ~~[WIP] Refactoring GPU extensions~~ Refactoring GPU extensions Apr 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring GPU extensions #1365

Refactoring GPU extensions #1365

kmp5VT commented Mar 27, 2024 •

edited

Loading

mtfishman commented Mar 27, 2024

codecov-commenter commented Mar 27, 2024 •

edited

Loading

kmp5VT commented Mar 27, 2024

mtfishman Apr 5, 2024

mtfishman Apr 5, 2024

kmp5VT Apr 5, 2024

mtfishman Apr 5, 2024

kmp5VT Apr 6, 2024

kmp5VT commented Apr 12, 2024

mtfishman commented Apr 12, 2024

	function generic_zeros(StoreT::Type{<:Dense}, dims::Tuple{Integer})
	function generic_zeros(StoreT::Type{<:Dense}, dims::Integer)

Refactoring GPU extensions #1365

Refactoring GPU extensions #1365

Conversation

kmp5VT commented Mar 27, 2024 • edited Loading

Description

Checklist:

mtfishman commented Mar 27, 2024

codecov-commenter commented Mar 27, 2024 • edited Loading

Codecov Report

kmp5VT commented Mar 27, 2024

mtfishman Apr 5, 2024

Choose a reason for hiding this comment

mtfishman Apr 5, 2024

Choose a reason for hiding this comment

kmp5VT Apr 5, 2024

Choose a reason for hiding this comment

mtfishman Apr 5, 2024

Choose a reason for hiding this comment

kmp5VT Apr 6, 2024

Choose a reason for hiding this comment

kmp5VT commented Apr 12, 2024

mtfishman commented Apr 12, 2024

kmp5VT commented Mar 27, 2024 •

edited

Loading

codecov-commenter commented Mar 27, 2024 •

edited

Loading