Release Release 18.05 · ARM-software/armnn

This release of Arm NN integrates the latest Compute Library and adds improvements to thread-safety, memory consumption and overall performance.

New Features:

In general, the amount of RAM needed for a loaded network has been reduced by 20-30% compared to release 18.03.
The latest 8-bit quantized operations from Compute Library have been integrated. In testing, 8-bit quantized mobilenets models are 3x faster compared to release 18.03.
It is now supported to load and unload graphs simultaneously from multiple threads. In other words, the methods IRuntime::LoadNetwork() and IRuntime::UnloadNetwork() are thread-safe.

Public API Changes:

IsConvolution2dSupported requires additional TensorInfo arguments describing the output and bias tensors.

Other changes:

This release of ArmNN requires at least release 18.05 of the Compute Library.
Fixed an issue where pooling operations with different pooling width and height would produce the wrong output.
Fixed an issue in the Caffe parser where BatchNormalization would return the wrong results when the rolling average factor was non-zero.
Fixed the known issue in 18.03 where the multiplication layer could not support tensors of different shapes.

Provide feedback