Is this a duplicate?
Area
libcu++
Is your feature request related to a problem? Please describe.
We want to implement a CUDA backend for the following algorithms:
Describe the solution you'd like
The should reuse the device_select CUB backend
Describe alternatives you've considered
No response
Additional context
No response