Release 1.2.0-OC

NVDLA 1.2.0-OC release is focused on open sourcing the NVDLA compiler.

New features and improvements

Source code for the NVDLA compiler

Guide to using the TensorRT INT8 calibration tool with NVDLA

Pre-generated INT8 calibration table for ResNet-50

Find the complete NVDLA feature set here

Fixes and other changes

None

Known limitations

Pre-quantized INT8 models are not supported

Release 1.2.0

NVDLA 1.2.0 release is focused on INT8 precision and nv_large/nv_small configurations support along with performance optimizations

INT8 precision support
- Per-tensor and per-kernel scale factors
- Symmetric scaling
nv_small and nv_large configuration support
FP16 winograd support
SDP fusion performance feature

Failures in deconvolution for number of groups greater than one
Disable scale+bias fusion to BN as it is not required with SDP fusion enabled
Disable winograd for output layers of network as it causes size mismatch due to special requirements of winograd
Return error for pooling layers with pad size greater than kernel size

An INT8 calibration tool is not provided
- More details about supplying INT8 scale factors to the NVDLA Compiler can be found in Low precision support in NVDLA

Provide feedback