Skip to content

Release 18.05

Choose a tag to compare

@TelmoARM TelmoARM released this 23 May 15:30
· 2873 commits to master since this release

This release of Arm NN integrates the latest Compute Library and adds improvements to thread-safety, memory consumption and overall performance.

New Features:

  • In general, the amount of RAM needed for a loaded network has been reduced by 20-30% compared to release 18.03.
  • The latest 8-bit quantized operations from Compute Library have been integrated. In testing, 8-bit quantized mobilenets models are 3x faster compared to release 18.03.
  • It is now supported to load and unload graphs simultaneously from multiple threads. In other words, the methods IRuntime::LoadNetwork() and IRuntime::UnloadNetwork() are thread-safe.

Public API Changes:

  • IsConvolution2dSupported requires additional TensorInfo arguments describing the output and bias tensors.

Other changes:

  • This release of ArmNN requires at least release 18.05 of the Compute Library.
  • Fixed an issue where pooling operations with different pooling width and height would produce the wrong output.
  • Fixed an issue in the Caffe parser where BatchNormalization would return the wrong results when the rolling average factor was non-zero.
  • Fixed the known issue in 18.03 where the multiplication layer could not support tensors of different shapes.