[BUG] cudf::strings::replace_re
can be 20X slower than a single thread on the CPU for some cases
#12778
Labels
0 - Backlog
In queue waiting for assignment
bug
Something isn't working
libcudf
Affects libcudf (C++/CUDA) code.
strings
strings issues (C++ and Python)
Similar to #12694, we discovered a similar issue with
cudf::strings::replace_re
for some input data.In particular, using the same data file as in #12694, Spark CPU (
regexp_replace
) can complete its job in around 600ms while the GPU version (which calls tocudf::strings::replace_re
) on the same input takes around 11 seconds.The text was updated successfully, but these errors were encountered: