You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We actually use only 16 lanes, because we convert float16 to float32 to detect NAN. There is a better way to do it, but this works for now and hardly affects perf.
The bug was introduced when we started using size_t instead of int64_t to represent array sizes. arrsize - 16 underflows when it is size_t and looped forever and segfaulted.
Describe the issue:
Segmentation fault during testing quicksort/half-precision on AMD/ZEN4
Reproduce the code example:
Error message:
Python and NumPy Versions:
show_config()
The text was updated successfully, but these errors were encountered: