Skip to content
#

parallel-algorithms

Here are 36 public repositories matching this topic...

Efficient implementations of Merge Sort and Bitonic Sort algorithms using CUDA for GPU parallel processing, resulting in accelerated sorting of large arrays. Includes both CPU and GPU versions, along with a performance comparison.

  • Updated Jul 27, 2023
  • Cuda

SIMPLE is a framework for implementation of parallel algorithms using our methodology for developing high performance programs running on clusters of SMP nodes. Our methodology is based on a small kernel (SIMPLE) of collective communication primitives that make efficient use of the hybrid shared and message passing environment. We illustrate the…

  • Updated Dec 28, 2020
  • C

Improve this page

Add a description, image, and links to the parallel-algorithms topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the parallel-algorithms topic, visit your repo's landing page and select "manage topics."

Learn more