Releases: nvdla/sw
Releases · nvdla/sw
NVDLA 1.2.0-OC
Release 1.2.0-OC
NVDLA 1.2.0-OC release is focused on open sourcing the NVDLA compiler.
New features and improvements
- Source code for the NVDLA compiler
- Guide to using the TensorRT INT8 calibration tool with NVDLA
- Pre-generated INT8 calibration table for ResNet-50
- Find the complete NVDLA feature set here
Fixes and other changes
- None
Known limitations
- Pre-quantized INT8 models are not supported
NVDLA 1.2.0
Release 1.2.0
NVDLA 1.2.0 release is focused on INT8 precision and nv_large/nv_small configurations support along with performance optimizations
New features and improvements
- INT8 precision support
- Per-tensor and per-kernel scale factors
- Symmetric scaling
- nv_small and nv_large configuration support
- FP16 winograd support
- SDP fusion performance feature
Fixes and other changes
- Failures in deconvolution for number of groups greater than one
- Disable scale+bias fusion to BN as it is not required with SDP fusion enabled
- Disable winograd for output layers of network as it causes size mismatch due to special requirements of winograd
- Return error for pooling layers with pad size greater than kernel size
Known limitations
- An INT8 calibration tool is not provided
- More details about supplying INT8 scale factors to the NVDLA Compiler can be found in Low precision support in NVDLA