Expose `linalg::dot` in public API #968

benfred · 2022-10-31T23:47:58Z

Closes #805

cjnolet

Thanks again for this PR! We are going to want to think about whether these (axpy, dot, etc...) should be accepting general mdspan or whether we should be constraining them to be vectors up front.

It would also be nice to see the current vector factory functions made more flexible to enable strided layouts rather than adding new functions.

These types of examples (using the existing device vector factory functions to create a strided vector) would be great to have in the quick start as well.

cjnolet · 2022-11-01T00:18:57Z

cpp/include/raft/core/device_mdspan.hpp

+ */
+template <typename ElementType, typename IndexType = int, typename LayoutPolicy = layout_stride>
+auto make_strided_device_vector_view(ElementType* ptr, IndexType n, IndexType stride)


Rather than adding another factory function for a strided vector, why not just allow a strided layout to be configured in the make_device_vector_view and make_host_vector_view?

Right now the make_*_vector_view automatically configures a row-major layout but the layout should really be configurable (and potentially strided, or col major if desired).

I've updated make_device_vector_view to allow strided input here - let me know what you think.

cpp/include/raft/core/device_mdspan.hpp

cjnolet · 2022-11-01T00:59:54Z

cpp/include/raft/linalg/dot.cuh

+template <typename InputType1,
+          typename InputType2,
+          typename OutputType,
+          typename = raft::enable_if_input_device_mdspan<InputType1>,


I brought this up with the axpy as well, but it seems weird to accept a general mdspan for this when what we are really looking for is a 1d vector. Do you see value in accepting a matrix or dense tensor with 3+ dimensional extents? If not, we should just accept the vector_view directly (which is aliased to be any mdspan with 1d extents.

If we accepted a device_vector_view directly, we wouldn't need the enable_if statements at all. I think we should go ahead and do the same for the axpy to keep things consistent.

agreed - made the changes here so that both axpy and dot take device_vector_view's

cjnolet · 2022-11-01T01:09:00Z

cpp/include/raft/linalg/dot.cuh

+
+  // Right now the inputs and outputs need to all have the same value_type (float/double etc).
+  // Try to output a meaningful compiler error if mismatched types are passed here.
+  // Note: In the future we could remove this restriction using the cublasDotEx function


Should we just go ahead and wrap the cublasEx functions?

I created an issue so we can discuss further #977 .

Reading the docs a little closer, and it looks like even w/ cublasDotEx having different dtypes for the input/outputs isn't currently supported: https://docs.nvidia.com/cuda/cublas/index.html#cublas-dotEx - so it won't have much value for the dot API (though I could see a use for it myself with the gemm api w/ implicit and the mixed precision work I was talking about last week)

cjnolet

Changes are looking great! Remaining things are very minor.

cjnolet · 2022-11-07T13:56:06Z

cpp/include/raft/core/device_mdspan.hpp

 * @return raft::device_vector_view
 */
 template <typename ElementType,
          typename IndexType    = std::uint32_t,
          typename LayoutPolicy = layout_c_contiguous>
-auto make_device_vector_view(ElementType* ptr, IndexType n)
+auto make_device_vector_view(ElementType* ptr, IndexType n, IndexType stride = 1)


This is a little awkward. We accept a layout policy as a template argument, but then we also accept a function argument for a stride which essentially overrides the layout from the template.

Would it be achieving this same goal if a user were to just set a strided layout on the template argument directly? Perhaps we could provide a factory function to make said strided layout and provide the user with something like a statically sized object (eg. std::array) to set the strides for each dimension?

An of course, this is one of those things (the new strided factory function) that I think should have a usage example in the doxygen and perhaps even a subsection section in the mdspan tutorial markdown of the docs.

If I'm understanding you correctly - you're thinking we can just pass the layout mapping to the make_device_vector_view function directly , and add a new factory function for creating this layout mapping?

I took a stab at that in the last commit - unfortunately, I couldn't get a single make_device_vector_view function to compile successfully with being passed both a IndexType with the number of elements and the Mapping with the strided layout (was getting compile errors in various other raft functions that I hadn't updated). However, I could get it to work with adding an overload - which is whats in the last commit. Do you have any suggestions on how to clean this up =) ?

I'll add something to the tutorial / docs once we're happy with the API -

cpp/include/raft/linalg/axpy.cuh

cpp/include/raft/linalg/dot.cuh

* Remove default types, * Try to fix up factory functions for creating strided vector views * Add dot funcction that takes host scalar / host_scalar_view

cjnolet · 2022-11-08T19:05:10Z

cpp/include/raft/linalg/dot.cuh

+void dot(const raft::handle_t& handle,
+         raft::device_vector_view<const ElementType, IndexType, LayoutPolicy1> x,
+         raft::device_vector_view<const ElementType, IndexType, LayoutPolicy2> y,
+         ElementType* out)


I think for the host output, we probably should drop this overload. Sorry for being confusing here. I think it makes more sense to accept a host scalar by value for functions like axpy where the scalar is an input. For output on host, I think we should stick to the mdspan scalar wrappers.

removed in latest commit

cpp/include/raft/core/device_mdspan.hpp

cpp/test/linalg/axpy.cu

cpp/test/linalg/dot.cu

cpp/include/raft/core/device_mdspan.hpp

cjnolet

Looks great, thanks again @benfred!

cjnolet · 2022-11-09T22:20:34Z

@gpucibot merge

Expose linalg::dot in public API

01dd067

Closes rapidsai#805

benfred requested review from a team as code owners October 31, 2022 23:47

github-actions bot added CMake cpp labels Oct 31, 2022

benfred added non-breaking Non-breaking change enhancement New feature or request and removed cpp CMake labels Oct 31, 2022

formatting

e6a5bb1

github-actions bot added CMake cpp labels Oct 31, 2022

benfred added improvement Improvement / enhancement to an existing function and removed enhancement New feature or request labels Oct 31, 2022

cjnolet requested changes Nov 1, 2022

View reviewed changes

cjnolet assigned benfred Nov 1, 2022

benfred added 3 commits November 1, 2022 10:13

Merge branch 'branch-22.12' into linalg_dot

400dfa9

Updates from code review

f376c51

Update axpy to take a device_vector_view

9c9efe8

cjnolet requested changes Nov 7, 2022

View reviewed changes

Changes from codereview

2bf4eae

* Remove default types, * Try to fix up factory functions for creating strided vector views * Add dot funcction that takes host scalar / host_scalar_view

cjnolet reviewed Nov 8, 2022

View reviewed changes

remove dot w/ host pointer overload

2e1c0e6

cjnolet reviewed Nov 8, 2022

View reviewed changes

cpp/include/raft/core/device_mdspan.hpp Show resolved Hide resolved

Added docs / created 'make_strided_layout' factory function

a668098

cjnolet reviewed Nov 9, 2022

View reviewed changes

cpp/test/linalg/axpy.cu Outdated Show resolved Hide resolved

cpp/test/linalg/dot.cu Outdated Show resolved Hide resolved

cpp/include/raft/core/device_mdspan.hpp Outdated Show resolved Hide resolved

benfred added 4 commits November 9, 2022 10:26

Add doxygen for make_strided_layout

977949f

fix

a125c4f

Test out host and device api's

6f8a76c

Merge remote-tracking branch 'origin/branch-22.12' into linalg_dot

af52c0c

test out both device/host alpha scalar overloads with dot

25333ee

cjnolet approved these changes Nov 9, 2022

View reviewed changes

Fix docstring

98f7d85

rapids-bot bot merged commit 7176d94 into rapidsai:branch-22.12 Nov 10, 2022

benfred deleted the linalg_dot branch November 10, 2022 06:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose `linalg::dot` in public API #968

Expose `linalg::dot` in public API #968

benfred commented Oct 31, 2022

cjnolet left a comment

cjnolet Nov 1, 2022

benfred Nov 1, 2022

cjnolet Nov 1, 2022

benfred Nov 1, 2022

cjnolet Nov 1, 2022

benfred Nov 1, 2022

cjnolet left a comment

cjnolet Nov 7, 2022

benfred Nov 8, 2022

cjnolet Nov 8, 2022

benfred Nov 8, 2022

cjnolet left a comment

cjnolet commented Nov 9, 2022

Expose linalg::dot in public API #968

Expose linalg::dot in public API #968

Conversation

benfred commented Oct 31, 2022

cjnolet left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cjnolet left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cjnolet left a comment

Choose a reason for hiding this comment

cjnolet commented Nov 9, 2022

Expose `linalg::dot` in public API #968

Expose `linalg::dot` in public API #968