-
-
Notifications
You must be signed in to change notification settings - Fork 776
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
max
and min
for float32 cupy array may be slow
#2085
Comments
"reduction" operations including "max" and "min" in cupy are currently implemented in rather general way and are not so optimized in terms of performance, as far as I know. You may be able to get better performance by using some reduction implementation in cuDNN, CUB or Thrust, though those are not used in cupy for now. CuPy team: Is anyone already working on performance improvement of "reduction" operations? I'm considering to speed up cupy reductions with CUB. Is there any concerns on use of CUB in cupy? |
Hi, as of the latest version of cupy, the
May I please get more information on what may be a workaround? Thanks |
#6549 resolved this issue.
|
python -c 'import cupy; cupy.show_config()'
)In this case, Cupy is slower then Numpy.
The text was updated successfully, but these errors were encountered: