Skip to content
View piDack's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Beijing
  • 09:52 (UTC +08:00)

Block or report piDack

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Heterogeneous Computing

CUDA,HIP & SYCL
15 repositories

HIP: C++ Heterogeneous-Compute Interface for Portability

C++ 4,342 579 Updated Mar 13, 2026

Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.

LLVM 1,442 819 Updated Mar 15, 2026

大规模并行处理器编程实战 第二版答案

C++ 34 Updated Jun 4, 2022

NVIDIA Linux open GPU kernel module source

C 16,797 1,617 Updated Mar 13, 2026

Main Book repository for the Parallel and High Performance Computing book, Manning Publications

Shell 229 54 Updated Jun 5, 2022

A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation

C++ 1,490 233 Updated Feb 16, 2026

GPU-accelerated real-time reference-based dynamic phase retrieval G-LS3U

C++ 11 2 Updated Nov 13, 2021

Memory Topology for GPUs

C++ 18 9 Updated Mar 4, 2026

Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.

C 162 31 Updated Feb 3, 2022

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,951 2,295 Updated Jan 6, 2026

An extension library of WMMA API (Tensor Core API)

Cuda 111 16 Updated Jul 12, 2024

Test suite for probing the numerical behavior of NVIDIA tensor cores

Cuda 43 15 Updated Jul 24, 2024

Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.

Cuda 408 52 Updated Jan 2, 2025

Tile primitives for speedy kernels

Cuda 3,226 257 Updated Mar 14, 2026

The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm

Python 850 196 Updated Mar 15, 2026