17 Jan 12:08

Naville

0.2.1.7

91b5ade

0.2.1.7 Pre-release

Pre-release

Vulkan winograd bug Fix
Fix Segment Fault issue crash when converting TFLite Models
Workaround Metal Softmax Op
Lower required OpenCL version to 1.10 from 2.0 to 1.10
vulkan winograd bug 修复
修正 tflite 转换时段错误问题
metal softmax bug 暂时规避
OpenCL 所需要版本由 2.0 降到 1.10 ，提升兼容性

Assets 6

31 Dec 05:04

Naville

0.2.1.6

e4fafa7

0.2.1.6 Pre-release

Pre-release

Op Support

Added support for:

Over 30 TFLite Ops
Over 20 Onnx Ops
9 Caffe Ops
24 Tensorflow Ops

Project Layout and Engineering Improvements

CMake Build System Rewrite
Header Layout Standardization. Public headers are now under include/MNN and is installed as <MNN/>

Inferencing Improvements and Bug Fixes

OpenCL BinaryOp Bugs
CPU MatMul Bugs
Added Unit Testing for over 30 ops
ImageProcess now supports NV12 input and NV21 / NV12 stride

Training (Experimental Feature)

Optimize Express Module's Dynamic Graph Execution Policy
Improve single machine training with a working demo

Op 补全

新增 TFlite 30 + op 支持
新增 Onnx 20 + op 支持
新增 Caffe 9 个 op 支持
新增 Tensorflow 24 个 op 支持

工程优化

完善CMake相关编译配置
头文件目录规范化，原 include 目录下的头文件移到 include/MNN 下

推理相关功能完善与Bug修复

OpenCL BinaryOp 相关 Bug 修复
CPU MatMul Bug 修复
完善单元测试，添加 30 + op 单元测试用例
ImageProcess 支持 NV12 输入以及 NV21 / NV12 stride 支持

训练能力(整体仍处调试阶段)

优化 Express 模块动态图运行机制
MNN 单机训练功能完善，Demo完成

Assets 2

15 Nov 06:23

li-qing

0.2.1.5

e93e8dc

0.2.1.5 Pre-release

Pre-release

0.2.1.5

integration

add travis CI
fix building parameters for python

converter

add half storage option for MNN converter
fix op name lost in converter
fix converter bug for print input output, identity remove output

ops

add quantized Convolution & Deconvolution support on OpenCL
add more expression supports
add DetectionPostProcess Op for TensorFlow Lite (ssd is supported directly now)
add supports for LSTM & ELU for ONNX
add support for Convolution that weights is not constant for ONNX
fix Unary Op compile error on Linux
fix Metal backend buffer reuse after resize
fix Metal raw memory access after model releasing
fix redundant transpose in Winograd generater

Assets 2

29 Oct 07:14

li-qing

0.2.1.2

a7982a8

0.2.1.2 Pre-release

Pre-release

build

unify schema building in core and converter;
add more build script for android;
add linux build script for python;

ops impl

add floor mod support in binary;
use eltwise impl in add/max/sub/mul binary for optimization;
remove fake double support in cast;
fix 5d support for concat;
add adjX and adjY support for batch matmul;
optimize conv2d back prop filter;
add pad mode support for conv3d;
fix bug in conv2d & conv depthwise with very small feature map;
optimize binary without broacast;
add data types support for gather;
add gather ND support;
use uint8 data type in gather v2;
add transpose support for matmul;
add matrix band part;
add dim != 4 support for padding, reshape & tensor convert;
add pad type support for pool3d;
make ops based on TensorFlow Lite quantization optional;
add all & any support for reduction;
use type in parameter as output type in reduction;
add int support for unary;
add variable weight support for conv2d;
fix conv2d depthwise weights initialization;
fix type support for transpose;
fix grad outputs count for reduce grad and reshape grad;
fix priorbox & detection output;
fix metal softmax error;

python

add runSessionWithCallBackInfo interface;
add max nodes limit (1400) for visualization tool;
fix save error in python3;
align default dim;

convert

add extra design for optimization;
add more post converting optimizers;
add caffe v1 weights blob support;
add cast, unary, conv transpose support for onnx model;
optimize batchnorm, conv with variable weights, prelu, reshape, slice, upsample for onnx model;
add cos/sin/atan/tan support for unary for tensorflow model;
add any/all support for reduction for tensorflow model;
add elu, conv3d, pool3d support for tensorflow model;
optimize argmax, batchnorm, concat, batch to space, conv with variable weights, prelu, slice for tensorflow model;

others

fix size computer lock;
fix thread pool deadlock;
add express & parameters in express;
rewrite blitter chooser without static map;
add tests for expr;

Assets 2

26 Sep 13:03

li-qing

0.2.1.0

73ad341

0.2.1.0 Pre-release

Pre-release

0.2.1.0

dynamic computation graph (beta)
- add supports (/express)
- add tests
- add benchmarks with it (/benchmark/exprModels)
Python
- MNN engine and tools were submitted to pip
- available on Windows/macOS/Linux
Engine/Converter
- add supports for each op benchmarking
- refactor optimizer by separating steps
CPU
- add supports for Conv3D, Pool3D, ELU, ReverseSequence
- fix ArgMax, Permute, Scale, BinaryOp, Slice, SliceTf
OpenCL
- add half transform in CPU
- add broadcast supports for binary
- optimize Conv2D, Reshape, Eltwise, Gemm, etc.
OpenGL
- add sub, real div supports for binary
- add supports for unary
- optimize Conv2D, Reshape
Vulkan
- add max supports for eltwise
Metal
- fix metallib missing problem
Train/Quantization
- use express to refactor training codes

Assets 2

01 Sep 11:25

li-qing

0.2.0.9

487a0fb

beta 0.2.0.9 Pre-release

Pre-release

beta 0.2.0.9

fix quantization tool compiling on Windows
fix converter compiling on Windows
fix eltwise optimization on Windows
separate sse & avx for Windows
add LeakyReLU support for TensorFlow
fix reshape, const for TensorFlow
fix dimension format error for ONNX ops
optimize winograd, ReLU for OpenCL
add fp16 availability & dimensions size check-up for OpenCL
optimize GEMM for arm32
fix ExpandDims shape calculation when inputs size == 1

Assets 2

22 Aug 12:15

li-qing

0.2.0.8

b995b25

beta 0.2.0.8 Pre-release

Pre-release

beta 0.2.0.8

add NaN check-up
add quantification support for ScaleAdd Op
add binary to eltwise optimization
add console logs for quantization tool
better document for quantization tool
replace redundant dimension flags with dimension format
optimize performance of TensorFlow Lite Quantized Convolution
fix axis support for ONNX softmax converting
fix getPerformance tool compiling error on Windows

Assets 2

15 Aug 09:31

li-qing

0.2.0.7

1005c13

beta 0.2.0.7 Pre-release

Pre-release

move docs to http://www.yuque.com/mnn
fix bugs for CPU ops TopKV2 and quantized convolution
add enqueue map buffer error handle for OpenCL
add nullptr protection for extra tensor desc
add failure protection for memory acquirement
fix slice shape calculation
refactor binary shape calculation

Assets 2

07 Aug 08:47

li-qing

0.2.0.6

f085106

release 0.2.0.6 Pre-release

Pre-release

fix bugs in quantization
add evaluating tool for quantization
add ADMM support in quantization
fix lock in thread pool
fix fusing for deconv
fix reshape converting from ONNX to MNN
turn off blob size checking by default

Assets 2

05 Aug 02:41

li-qing

0.2.0.5

7bb0df9

beta 0.2.0.5 Pre-release

Pre-release

beta 0.2.0.5

CPU
add support for DepthToSpace & SpaceToDepth ops
OpenGL
add Android demo
add half / float runtime option
add support for ROIPooling, Squeeze
fix bugs in conv im2col
OpenCL
fix Concat, Eltwise, Reshape bugs
Tools
add KL threshold method in quantization tool
support optimization for graph with multiple rnn

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Op Support

Project Layout and Engineering Improvements

Inferencing Improvements and Bug Fixes

Training (Experimental Feature)

Op 补全

工程优化

推理相关功能完善与Bug修复

训练能力(整体仍处调试阶段)

integration

converter

ops

build

ops impl

python

convert

others

Releases: alibaba/MNN

0.2.1.7

0.2.1.6

Op Support

Project Layout and Engineering Improvements

Inferencing Improvements and Bug Fixes

Training (Experimental Feature)

Op 补全

工程优化

推理相关功能完善与Bug修复

训练能力(整体仍处调试阶段)

0.2.1.5

integration

converter

ops

0.2.1.2

build

ops impl

python

convert

others

0.2.1.0

beta 0.2.0.9

beta 0.2.0.8

beta 0.2.0.7

release 0.2.0.6

beta 0.2.0.5