Adding cast string to float kernel #631

hyperbolic2346 · 2022-10-12T03:37:46Z

This adds a string to float kernel to improve performance of this conversion.

This is one of the casting kernels for NVIDIA/spark-rapids#5639.

Closes #632

Signed-off-by: Mike Wilson <knobby@burntsheep.com>

hyperbolic2346 · 2022-10-12T03:47:13Z

build

jlowe

Would be good to have someone with more CUDA experience review cast_string_to_float.cu.

src/main/cpp/src/cast_string.hpp

src/main/java/com/nvidia/spark/rapids/jni/CastStrings.java

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

ttnghia · 2022-10-13T02:44:22Z

src/main/cpp/CMakeLists.txt

@@ -144,6 +144,7 @@ add_library(
  src/CastStringJni.cpp
  src/NativeParquetJni.cpp
  src/cast_string.cu
+  src/cast_string_to_float.cu


What is cast_string.cu? This new file name is more meaningful, thus we may want to rename the old file to make it more meaningful too.

Originally the plan was to put all the casting kernels in cast_string.cu, but at this point we realized that it would simply be too big. This one is split out now and the original kernels will split out as well in another PR.

src/main/cpp/src/cast_string_to_float.cu

ttnghia · 2022-10-13T03:02:03Z

src/main/cpp/src/cast_string_to_float.cu

+  __device__ string_to_float(T* _out,
+                             bitmask_type* _validity,
+                             int32_t* _ansi_except,
+                             size_type* _valid_count,
+                             const char* const _chars,
+                             offset_type const* _offsets,
+                             int _warp_id,
+                             uint64_t const* const _ipow,
+                             bitmask_type const* _incoming_null_mask,
+                             size_type const _num_rows)


I feel something's wrong: This functor takes too many parameters. It would be great if we can decouple them by using multiple functors instead.

The design choice was to use a functor to prevent passing so many variables around to each function, but that leaves us passing it all in at the beginning to feed into that object. I don't know how much we could free up as this is a single kernel invocation for speed reasons.

Thinking about this more, we can pull the base functor into this function and then call the private functions with parameters, but it feels like too much for this function. If we push that out to a function we're back in the same boat as now as we have to pass all that into that next function.

src/main/cpp/src/cast_string_to_float.cu

…rapids-jni into mwilson/float_cast

src/main/cpp/src/cast_string_to_float.cu

src/main/cpp/tests/cast_string.cpp

ttnghia · 2022-10-15T04:23:25Z

src/main/cpp/tests/cast_string.cpp

+
+TYPED_TEST(StringToFloatTests, ANSIInvalids)
+{
+  cudf::test::strings_column_wrapper in[] = {{"A"}, {"."}, {"e"}};


I think that we should add more cases for the ansi test. Here are only very simple cases.

I added tests from the plugin that failed to ensure they didn't regress. The plugin tests are VERY extensive.

src/main/java/com/nvidia/spark/rapids/jni/CastStrings.java

ttnghia

Finally I have read through the code. My head was about to explode without calling to any xxx::explode.

src/main/cpp/tests/cast_string.cpp

hyperbolic2346 · 2022-10-21T02:16:15Z

Performance information added to spark-rapids PR since the original code path was entirely inside that code base.

Co-authored-by: Nghia Truong <nghiatruong.vn@gmail.com>

hyperbolic2346 · 2022-10-21T03:36:57Z

build

ttnghia · 2022-10-21T04:02:35Z

src/main/cpp/src/cast_string_to_float.cu

-  }
+  CUDF_EXPECTS(dtype == data_type{type_id::FLOAT32} || dtype == data_type{type_id::FLOAT64},
+               "invalid float data type");
+  if (string_col.size() == 0) { return std::make_unique<column>(); }


Probably empty column of floating point type?

Good point, I should. Thanks.

ttnghia · 2022-10-21T04:04:24Z

Please be aware of rapidsai/cudf#11875. You may need to change the code in this PR (and throughout this repo, in another separate PR).

Given that PR is ready to merge, I'll probably approve this after that is propagated here.

ttnghia

I changed my mind. Please finalize this and merge it ASAP then make a separate PR to adopt that breaking change.

hyperbolic2346 · 2022-10-21T04:46:08Z

build

nvdbaranec and others added 7 commits October 3, 2022 14:19

Almost complete.

5874b18

Merge branch 'branch-22.12' into mwilson/float_cast

098d26d

initial float work import

31811b8

fixing edge cases

dd61cd2

updating for edge cases

9d015f6

removing debug

c18d764

test cleanup

64ceca4

hyperbolic2346 added the enhancement New feature or request label Oct 12, 2022

hyperbolic2346 self-assigned this Oct 12, 2022

signoff

99b237f

Signed-off-by: Mike Wilson <knobby@burntsheep.com>

hyperbolic2346 mentioned this pull request Oct 12, 2022

Switch string to float casting to use new kernel NVIDIA/spark-rapids#6761

Merged

jlowe reviewed Oct 12, 2022

View reviewed changes

hyperbolic2346 and others added 2 commits October 12, 2022 21:12

Apply suggestions from code review

dbbe954

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

Update src/main/cpp/src/cast_string.hpp

4c4252b

Co-authored-by: Jason Lowe <jlowe@nvidia.com>

ttnghia self-requested a review October 13, 2022 02:17

ttnghia reviewed Oct 13, 2022

View reviewed changes

src/main/cpp/src/cast_string_to_float.cu Outdated Show resolved Hide resolved

ttnghia reviewed Oct 13, 2022

View reviewed changes

src/main/cpp/src/cast_string_to_float.cu Outdated Show resolved Hide resolved

ttnghia reviewed Oct 13, 2022

View reviewed changes

src/main/cpp/src/cast_string_to_float.cu Outdated Show resolved Hide resolved

hyperbolic2346 added 2 commits October 13, 2022 18:19

updating from review comments

afc5130

Merge branch 'mwilson/float_cast' of github.com:hyperbolic2346/spark-…

faa8420

…rapids-jni into mwilson/float_cast