CianMcGriskinARM
released this
29 Nov 16:33
·
5 commits
to branches/armnn_24_08
since this release
Arm NN SDK
New Features:
- Implemented "All or Nothing" logic in assign backends.
- Added Signed64 support for Constant and Tile workloads.
- Added Int8 and Uint8 support for LogSoftMax to run on CpuAcc and GpuAcc backends.
- Added automatic backend selection of GPU to ExecuteNetwork.
- Added TfLite Turbo Model recognition and enable Turbo Mode.
TosaCommon & TosaRef:
- Added "Convolution3d" support.
- Added support for activation functions "Sigmoid" and "TanH".
- Added support for activation function "HardSwish".
- Added "StridedSlice" support.
- Added "ElementwiseBinary:SqDiff" support.
- Added "Stack" support.
- Added "Dequantize" support.
- Added "DepthToSpace" support.
- Added "Gather" support.
Bug Fixes:
- Fixed the ReduceProdOp Int8 CpuAcc and GpuAcc backend failures in the delegate test suite.
- Fixed Mean operator in TosaCommon and TosaRef.
- Fixed ExecuteNetwork logic error when using -N parameter.
- Fixed convolution delegate test suite failures.
- Fixed UnpackOpTest Acc backend failures in the delegate test suite.
- Fixed layer name error on classic delegate DepthwiseConvolution2d.
- Fixed GatherNd Acc backend failures in the delegate test suite.
- Fixed FloorDiv Acc backend failures in the delegate test suite.
- StridedSliceOp Acc backend failures in the delegate test suite.
- Fixed Pad tests incorrectly reporting unsupported in the delegate test suite.
Removed Features:
- Removed the Arm NN Converter
- Removed the GpuFsa backend.
Other Changes:
- Updated ExecuteNetwork comparison options to return a result value.
- Added Debugging macros to LoadedNetwork.
ABI/API Changes:
The following front-end API changes have occurred during the implementation of 24.11, which users should be aware of before upgrading.
Due to these, changes we have bumped our ARMNN_VERSION to 34.0.0, following Semantic Versioning guidelines.
Feature | SHA | Gerrit Review | Resultant ABI/API Changes |
---|---|---|---|
Make ExecuteNetwork handle automatic selection of GPU backends. | b670f11 | https://review.mlplatform.org/c/ml/armnn/+/12662 | The size of the BackendRegistry class has changed from 192 bytes to 216 bytes. Field m_MappedGpuBackends has been added. |
FixFloorDiv Acc DTS Failures. | 88a149f | https://review.mlplatform.org/c/ml/armnn/+/12411 | BinaryOperatory enum has a new member FloorDiv. The member FloorDiv with a value of 8 has been added. |
Build Dependencies
Tools | Supported Version |
---|---|
Git | 2.17.1 or later |
SCons | 2.4.1 (Ubuntu) and 2.5.1 (Debian) |
CMake | 3.22.1 |
Tensorflow | 2.15.0 |
Onnx | 1.6.0 |
Flatbuffer | 23.5.26 |
Protobuf | 3.12.0 |
Android NDK | r26b |
cxxopts | 3.1.1 |
doctest | 2.4.6 |
fmt | 7.0.1 |
ghc | 1.3.2 |
half | 1.12.0 |
mapbox/variant | 1.1.0 |
stb | 2.16 |
Gemmlowp | 16e8662c34917be0065110bfcd9cc27d30f52fdf |