Skip to content
/ atex Public
forked from NVIDIA/atex

A TensorFlow Extension: GPU performance tools for TensorFlow.

License

Notifications You must be signed in to change notification settings

5l1v3r1/atex

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NVIDIA/Atex: A TensorFlow Extension

This repository holds NVIDIA-maintained utilities to improve GPU performance and usability for Tensorflow training and inference. The intent of Atex is to make up-to-date utilities available to users as quickly as possible. As such, these utilities are experimental. Some we will upstream and support permanently in projects such as TensorFlow or Keras. Others will eventually be discontinued.

Contents

1. Fused Layer/Instance Normalization

nv_norms.LayerNormalization and nv_norms.InstanceNormalization are fused implementations designed to replace the tf.keras.layers.LayerNormalization and tfa.layers.InstanceNormalization. The Keras and Addons implementations compose many ops, such as Mul, Add, etc., to compute the stats and then scale/offset the results. In contrast, the NVIDIA fused implementation provided in nv_norms compute the norms in a single operation, eliminating many expensive round-trips to device memory and significantly improving performance. More in nv_ops.

Contribution guidelines

Please review the Contribution Guidelines.

GitHub issues will be used for tracking requests and bugs.

License

BSD License 2.0

About

A TensorFlow Extension: GPU performance tools for TensorFlow.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 76.0%
  • Python 20.8%
  • Shell 1.7%
  • Makefile 1.5%