GitHub - josephjohnjj/cuda-sum-search: Benchmarking of parallel sum and search algorithms

README

This repository contains microbenchmarking code for various implementations of GPU algorithms that build and search a prefix sum, where the desired output is the result of the search (or multiple searches), and not the prefix sum itself. The focus is on the "partial prefix sum" algorithm (presented at NCI TechTake on May 31st, 2022) in which only the up-sweep phase of the work-efficient parallel prefix sum is performed, and the resultant binary tree is searched directly.

Baseline comparisons are the work-efficient parallel prefix sum as described in GPU Gems 3, and the single pass with decoupled lookback algorithm implemented in CUB.

A 2x speedup over the work-efficient algorithm is achieved by the partial sum algorithm, with performance on par with the CUB implementation. Further optimisation (also applicable to the single pass algorithm), relying on extra memory working space, provides an additional 20% faster throughput, with the possibility to go even faster while also requiring less extra global memory (L1 cache size allowing, and at a minor cost in search speed).

LICENSE

Some parts of this code are derived from the source code of the work-efficient parallel scan algorithm described in GPU Gems 3, and are thereby subject to copyright as indicated in the relevant files.

All other code in this repository is released under the MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE.txt		LICENSE.txt
README.md		README.md
modules_wiener.txt		modules_wiener.txt
plot_results.py		plot_results.py
run_tests.sh		run_tests.sh
submit_wiener.sh		submit_wiener.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

README

LICENSE

About

Uh oh!

Releases

Packages

Languages

License

josephjohnjj/cuda-sum-search

Folders and files

Latest commit

History

Repository files navigation

README

LICENSE

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages