Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Update interfaces to CUB 1.16 (#192)
* test_hipcub_device_radix_sort.cpp Correctly test -NaN. * `test_utils::native_half` -NaN to `float` fix * `hipcub::WarpExchange` interface to `::rocprim::warp_exchange` * Fix after review * Default CUDA architecture is 53 to fix __half * Apply 1 suggestion(s) to 1 file(s) * Added NVGPU_TARGETS to gitlab-ci * Update .gitlab-ci.yml file * Changes from [PR346](NVIDIA/cub#346) * Add deprecation warnings. * Update of deprecated statement. * Adding constants from [PR418](NVIDIA/cub#418). * Fix deprecation warnings. * Fix a forgotten deprecation warnings. * Fix deprecation warnings. * Fix deprecation warnings for nvcc. * Replace '__host__ __device__' by 'HIPCUB_HOST_DEVICE' * Added Cuda standard * Bumped referenced CUB and thrust version to 1.16 * Download thrust in test/extra * Added the interface for UniqueByKey * Added test for UniqueByKey * Added benchmark for UniqueByKey * Add UniqueByKey interface * Fix alignment of UniqueByKey parameters * Use 'unsigned int' instead of a one element vector for selected_count_output in UniqueByKey benchmark * Update interface * Update tests, add test for int64_t size * Upde CUB interface * Apply 1 suggestion(s) to 1 file(s) * Add interfaces for subtract * Ignore deprecation warnings from rocPRIM for flags API * Add deprecation warnings for Flags API * Ignore deprecation warnings for Flags API tests * Fix Subtract interfaces * Fix SubtractRightPartial not using the right method * Add benchmark for AdjacentDifference (Subtract) * Add test for AdjacentDifference (Subtract) * Use 'HIPCUB_HOST_DEVICE' macro * Fix a typo * Fix interfaces of Subtract not matching the CUB one * Upadte the tests and benchmarks to the fixed interfaces of Subtract * Fix to use temp_storage_ in subtract call * Fix the tests of Subtract to work with the CUB interfaces * Add the macros to ignore warning in config.hpp and remove it from block_adjacent_difference file and the from the tests * Device adjacent difference CUB backend * New thread operators [skip ci] * Test device adjacent difference [skip ci] * Device adjacent difference rocPRIM backend * Added new headers to the hipcub.hpp-s * Benchmark for device adjacent difference * Added missing thread operators * Updated changelog for CUB 1.16 * Updating changelog for hipCUB 1.16 in next release Co-authored-by: Vince <vince@streamhpc.com> Co-authored-by: Gergely Mészáros <gergely@streamhpc.com> Co-authored-by: Théo Battrel <theo@streamhpc.com> Co-authored-by: Balint Soproni <balint@streamhpc.com> Co-authored-by: Stanley Tsang <stanley.tsang@amd.com>
- Loading branch information