Skip to content

River12/NVIDIA-tensor-core-examples

 
 

Repository files navigation

NVIDIA Tensor Core Examples

This repository collects multiple examples for using NVIDIA Tensor Cores. Please see individual examples for their licensing requirements.

Examples

  • cudaTensorCoreGemm - Implements a GEMM operation using WMMA instructions
  • simpleCUBLASEx - Demonstrates an SGEMM using Tensor Cores via the cublasGemmEx API
  • simpleCUBLASHgemm - Demonstrates calling HGEMM directly from cuBLAS
  • simpleCUBLASSgemm - Demonstrates using Tensor Cores implicitly from SGEMM
  • CUTLASS WMMA GEMM - Using WMMA instructions from the CUTLASS framework.
  • pictc - Implements a simple Particle-In-Cell pusher using Tensor Cores
  • DCGAN - Illustrates using Automatic Mixed Precision (AMP) within PyTorch using the DCGAN network.
  • ImageNet - Illustrates using Automatic Mixed Precision (AMP) with imagenet.

Instructions

Some examples are stored in git submodules. It is necessary to call git submodule init after cloning or clone with the --recursive-submodules option.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 86.2%
  • Makefile 13.8%