See what the GitHub community is most excited about today.
Efficient GPU kernels for block-sparse matrix multiplication and convolution
cuDF - GPU DataFrame Library
Code and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189
Fast parallel CTC.
Fully Convolutional Instance-aware Semantic Segmentation
MatConvNet: CNNs for MATLAB
GPU database engine
The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"
Introduction to Parallel Programming class code
Automatically exported from code.google.com/p/cuda-convnet2
CUB is a flexible library of cooperative threadblock primitives and other utilities for CUDA kernel programming.
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume, CVPR 2018 (Oral)
Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches
Reference implementation of real-time autoregressive wavenet inference
High-Performance Graph Primitives on GPUs
A GPU implementation of Convolutional Neural Nets in C++
Fast, gpu-based CSV parser
Precise RoI Pooling with coordinate gradient support, proposed in the paper "Acquisition of Localization Confidence for Accurate Object Detection" (https://arxiv.org/abs/1807.11590).
GPU Accelerated t-SNE for CUDA with Python bindings
A personal depthwise convolution layer implementation on caffe by liuhao.(only GPU)
Code release for "Convolutional Two-Stream Network Fusion for Video Action Recognition", CVPR 2016.
Pytorch Bindings for warp-ctc
PyTorch implementation of Deformable Convolution
A CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)