A 3D render engine from scratch, using CUDA/C++.
-
Updated
Jun 9, 2024 - Cuda
A 3D render engine from scratch, using CUDA/C++.
A high-performance, zero-overhead, extensible Python compiler using LLVM
A General-purpose Parallel and Heterogeneous Task Programming System
CUDA C++ Core Libraries
CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
An Upstream Clang/LLVM-based toolchain for contemporary C++ and heterogeneous programming
A real-time path tracer from scratch written in C++ using CUDA and OpenGL
Productive, portable, and performant GPU programming in Python.
🌟 Vertex Centric approach for building GNN/TGNNs
Upload of CUDA programs developed in my GPU Computing Course
Learnings and experimentation with GPU programming
Computations and statistics on manifolds with geometric structures.
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
A GPU algorithm for enumerating weak pseudomanifolds
FastFlow pattern-based parallel programming framework (formerly on sourceforge)
Add a description, image, and links to the gpu-programming topic page so that developers can more easily learn about it.
To associate your repository with the gpu-programming topic, visit your repo's landing page and select "manage topics."