NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
-
Updated
May 15, 2024 - C++
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
A high performance anime upscaler
stdgpu: Efficient STL-like Data Structures on the GPU
Cross Platform Professional Procedural Terrain Generation & Texturing Tool
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
CUDA C++ Core Libraries
Node-based image editor with GPU-acceleration.
Deep learning toolkit-enabled VLSI placement
Vulkan compute for people
Open-Source CUDA/OpenCL Speed Of Light Ray-tracer
GPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Vahana VR & VideoStitch Studio: software to create immersive 360° VR video, live and in post-production
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
Fast Neural Machine Translation in C++ - development repository
Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
An efficient, extensible occupancy map supporting probabilistic occupancy, normal distribution transforms in CPU and GPU.
Add a description, image, and links to the gpu-acceleration topic page so that developers can more easily learn about it.
To associate your repository with the gpu-acceleration topic, visit your repo's landing page and select "manage topics."