OpenCL integration for Python, plus shiny features
-
Updated
May 29, 2024 - Python
OpenCL integration for Python, plus shiny features
High-Performance Computing: CPU Instructions, GPU OpenCL & CUDA, etc. ☀️
Distributed_compy is a distributed computing library that offers multi-threading, heterogeneous (CPU + mult-GPU), and multi-node support
Simulating ADC background calibration algorithms in OpenCL
GloVe representation with parallelization
To accompany the paper "An efficient new static scheduling heuristic for accelerated architectures".
We present an algorithm to dynamically adjust the data assigned for each worker at every epoch during the training in a heterogeneous cluster. We empirically evaluate the performance of the dynamic partitioning by training deep neural networks on the CIFAR10 dataset.
Convolutional Nets implemented in pyCuda.
Add a description, image, and links to the heterogeneous-parallel-programming topic page so that developers can more easily learn about it.
To associate your repository with the heterogeneous-parallel-programming topic, visit your repo's landing page and select "manage topics."