Skip to content

@prasshantg prasshantg released this Aug 28, 2019 · 8 commits to master since this release

Release 1.2.0-OC

NVDLA 1.2.0-OC release is focused on open sourcing the NVDLA compiler.

New features and improvements

  • Source code for the NVDLA compiler
  • Guide to using the TensorRT INT8 calibration tool with NVDLA
  • Pre-generated INT8 calibration table for ResNet-50
  • Find the complete NVDLA feature set here

Fixes and other changes

  • None

Known limitations

  • Pre-quantized INT8 models are not supported
Assets 2

@prasshantg prasshantg released this May 28, 2019 · 10 commits to master since this release

Release 1.2.0

NVDLA 1.2.0 release is focused on INT8 precision and nv_large/nv_small configurations support along with performance optimizations

New features and improvements

  • INT8 precision support
    • Per-tensor and per-kernel scale factors
    • Symmetric scaling
  • nv_small and nv_large configuration support
  • FP16 winograd support
  • SDP fusion performance feature

Fixes and other changes

  • Failures in deconvolution for number of groups greater than one
  • Disable scale+bias fusion to BN as it is not required with SDP fusion enabled
  • Disable winograd for output layers of network as it causes size mismatch due to special requirements of winograd
  • Return error for pooling layers with pad size greater than kernel size

Known limitations

  • An INT8 calibration tool is not provided
Assets 2
You can’t perform that action at this time.