Add a library node for np.transpose for >2 dims and a cuTENSOR implementation #1303

lamyiowce · 2023-07-10T15:21:35Z

Adds

a library node for np.transpose for >2 dims
a cuTENSOR implementation
pure implementation (copy-pasted what was in replacements.py before)
cutensor environment

What I don't know:

whether the file structure is correct
how to make DaCe find cuTensor in a pretty way. It has to be separately downloaded (it's not in cuda) and I don't know how that's handled in DaCe.

…finding cutensor lib files.

tbennun

Looks good! Some minor modifications.

tbennun · 2023-07-10T15:47:51Z

dace/libraries/blas/environments/cutensor.py

+    cmake_variables = {}
+    cmake_includes = []
+    cmake_libraries = ["cutensor"]
+    cmake_compile_flags = ["-L/users/jbazinsk/libcutensor-linux-x86_64-1.7.0.1-archive/lib/11"]


maybe better to set up the LIBRARY_PATH envvar locally.

tbennun · 2023-07-10T15:48:28Z

dace/libraries/blas/environments/cutensor.py

@@ -0,0 +1,54 @@
+# Copyright 2019-2021 ETH Zurich and the DaCe authors. All rights reserved.


Suggested change

# Copyright 2019-2021 ETH Zurich and the DaCe authors. All rights reserved.

# Copyright 2019-2023 ETH Zurich and the DaCe authors. All rights reserved.

tbennun · 2023-07-10T15:48:39Z

dace/libraries/blas/include/dace_cutensor.h

@@ -0,0 +1,67 @@
+// Copyright 2019-2022 ETH Zurich and the DaCe authors. All rights reserved.


Suggested change

// Copyright 2019-2022 ETH Zurich and the DaCe authors. All rights reserved.

// Copyright 2019-2023 ETH Zurich and the DaCe authors. All rights reserved.

tbennun · 2023-07-10T15:49:13Z

dace/libraries/blas/include/dace_cutensor.h

+
+static void CheckCutensorError(cutensorStatus_t const& status) {
+  if (status != CUTENSOR_STATUS_SUCCESS) {
+    throw std::runtime_error("cuSPARSE failed with error code: " + std::to_string(status));


isn't there a cutensorGetErrorString or something similar?

tbennun · 2023-07-10T15:49:33Z

dace/libraries/blas/nodes/permute.py

@@ -0,0 +1,203 @@
+# Copyright 2019-2021 ETH Zurich and the DaCe authors. All rights reserved.


Suggested change

# Copyright 2019-2021 ETH Zurich and the DaCe authors. All rights reserved.

# Copyright 2019-2023 ETH Zurich and the DaCe authors. All rights reserved.

tbennun · 2023-07-10T15:52:26Z

tests/numpy/transpose_test.py

@@ -37,8 +41,39 @@ def test_transpose():
    assert rel_error <= 1e-5


+@pytest.mark.parametrize('implementation', ['pure', 'cuTENSOR'])


cutensor should be marked as pytest.mark.gpu, because it won't be available on the standard machines. See pytest.param and the mark kwarg for more info.

alexnick83 · 2023-07-10T18:10:52Z

It would be best if you used the TensorTranspose library node in the ttranspose library, which includes an expansion for the HPTT library. You can also find in the same branch tests and a working cuTENSOR environment.

BenWeber42 · 2023-10-27T13:23:32Z

What's the status here? Is this related to #1309 ("Support for tensor linear algebra (transpose, dot products)")?

lamyiowce added 4 commits July 10, 2023 16:49

Add permute libnode

8502a2c

Add a CuTensor implementation for the permute libnode. Ugly hack for …

01405b6

…finding cutensor lib files.

Remove unnecessary comment

b0f8570

Remove unused variables

5134531

tbennun requested changes Jul 10, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a library node for np.transpose for >2 dims and a cuTENSOR implementation #1303

Add a library node for np.transpose for >2 dims and a cuTENSOR implementation #1303

lamyiowce commented Jul 10, 2023

tbennun left a comment

tbennun Jul 10, 2023

tbennun Jul 10, 2023

tbennun Jul 10, 2023

tbennun Jul 10, 2023

tbennun Jul 10, 2023

tbennun Jul 10, 2023

alexnick83 commented Jul 10, 2023

BenWeber42 commented Oct 27, 2023

		@@ -0,0 +1,54 @@
		# Copyright 2019-2021 ETH Zurich and the DaCe authors. All rights reserved.

	# Copyright 2019-2021 ETH Zurich and the DaCe authors. All rights reserved.
	# Copyright 2019-2023 ETH Zurich and the DaCe authors. All rights reserved.

		@@ -0,0 +1,67 @@
		// Copyright 2019-2022 ETH Zurich and the DaCe authors. All rights reserved.

	// Copyright 2019-2022 ETH Zurich and the DaCe authors. All rights reserved.
	// Copyright 2019-2023 ETH Zurich and the DaCe authors. All rights reserved.

		@@ -0,0 +1,203 @@
		# Copyright 2019-2021 ETH Zurich and the DaCe authors. All rights reserved.

		@@ -37,8 +41,39 @@ def test_transpose():
		assert rel_error <= 1e-5


		@pytest.mark.parametrize('implementation', ['pure', 'cuTENSOR'])

Add a library node for np.transpose for >2 dims and a cuTENSOR implementation #1303

Are you sure you want to change the base?

Add a library node for np.transpose for >2 dims and a cuTENSOR implementation #1303

Conversation

lamyiowce commented Jul 10, 2023

tbennun left a comment

Choose a reason for hiding this comment

tbennun Jul 10, 2023

Choose a reason for hiding this comment

tbennun Jul 10, 2023

Choose a reason for hiding this comment

tbennun Jul 10, 2023

Choose a reason for hiding this comment

tbennun Jul 10, 2023

Choose a reason for hiding this comment

tbennun Jul 10, 2023

Choose a reason for hiding this comment

tbennun Jul 10, 2023

Choose a reason for hiding this comment

alexnick83 commented Jul 10, 2023

BenWeber42 commented Oct 27, 2023