-
Notifications
You must be signed in to change notification settings - Fork 0
Closed
Description
This tutorial will demonstrate how you can accelerate your networks with nvFuser. nvFuser is a Deep Learning Compiler that just-in-time compiles fast and flexible GPU specific code to reliably accelerate users’ networks automatically, providing speedups for deep learning networks running on Volta and later CUDA accelerators by generating fast custom “fusion” kernels at runtime. nvFuser is specifically designed to meet the unique requirements of the PyTorch community, and it supports diverse network architectures and programs with dynamic inputs of varying shapes and strides.
https://pytorch.org/tutorials/intermediate/nvfuser_intro_tutorial.html
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels