Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Fix trt multistream logger (#1521) * [FIX] fix trt logger * [FIX] catch std::bad_alloc error for trt8 building * [FIX] return null while shape_tensor size -1 * Update version.h Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * Update split_utils.cc (#1528) 我使用mingw32编译提示错误,因为使用mingw32编译器仍然需要空间命名 [ 99%] Building CXX object CMakeFiles/TNN.dir/source/tnn/utils/split_utils.cc.obj D:\TNN\source\tnn\utils\split_utils.cc: In static member function 'static tnn::Status tnn::SplitUtils::SplitStr(const char*, tnn::str_arr&, const char*, bool, bool, bool, bool, bool)': D:\TNN\source\tnn\utils\split_utils.cc:163:23: error: 'min' was not declared in this scope int len = min((i - cursor), subs_length - 1); 个人认为修改这样更好一下,可以适应mingw32和兼顾之前的编译器 Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * Update README.md (#1538) Typos * [UPD]update QQ group (#1552) * [BUG]fix YouTu face alignment model * [UPD]update mean pts file logic * [UPD]draw face points green * [UPD]unify example controller list * [UPD]unify example controller list * [UPD]move blaze anchor file to resource * [METAL]update tnn project * [UPD]update tool onnx2coreml * [ADD]support ShareCommandQueue between instances * [ADD]support ShareCommandQueue between instances * [UPD]add log message * [UPD]transfer file half.hpp * [UPD]fix xcode compile error with fp16 * [UPD]fix xcode compile error with fp16 * [UPD]update model type erro msg * [FIX]fix logic error of constofshape * [UPD]update debug message * [FIX]fsupport int32 for neg op * [BUG]fix init error with nil commadbuffer * [UPD]add mac build xcode project; fix ios mac build script; * [UPD]add mac build xcode project; fix ios mac build script; * [ADD]add QQ group 2 of TNN Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * [opencl][fix] try save program cache (#1557) * Dev roi align (#1511) * [ARM] fix int32 blob cvt to mat * [ARM] support roi align * [ARM] add roi align unit test * [ARM] add to xcodeproj Co-authored-by: lucasktian <lucasktian@tencent.com> Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * Fix arm gather and constant blob (#1564) * [ARM][BUG] fix gather error for indice < 0 * [ARM][BUG] fix buffer to blob error without converting precision * [ARM] update type convert in layer_norm fp16 Co-authored-by: quinnrong94 <67782915+quinnrong94@users.noreply.github.com> * Dev add config layer (#1569) * add config layer param to set arm conv algorithm for specific layer Co-authored-by: powerpwang <powerpwang@outlook.com> Co-authored-by: ealinli <ealinli@tencent.com> * 修复 protobuf 版本升级造成的 onnx2tnn 编译失败的问题 (#1571) * [ONNX][BUG]1. fix compile bug; * [ONNX2TNN][BUG]1. 修复因为 protobuf 版本升级带来的编译问题; * [ADD][TOOLS] add dynamic range quantization (#1572) * [ADD][TOOLS] support fake quantization * [UPD][FAKE_QUANT] fix bug * [UPD][DOC] add fake quantization in doc * [UPD] 1.rename fake quant to dynamic range quant 2.move dequant to net_optimizer * [UPD] remove redundant comment * [UPD] update comment for DynamicRangeDequant * [DRQuant][UPD] fix namespace issue * [DRQuant][UPD] Turn off TNN_SYMBOL_HIDE to fix ci Co-authored-by: ealinli <ealinli@tencent.com> Co-authored-by: Dandi Ding <bluaxe@users.noreply.github.com> Co-authored-by: lucasktian <lucasktian@tencent.com> * [UPD][OPENCL] opencl support using unoptimized conv (#1581) Co-authored-by: ealinli <ealinli@tencent.com> * [UPD][CONVERTER] lstm support sequence_lens (#1585) Co-authored-by: ealinli <ealinli@tencent.com> * [MODEL_CHECK][BUG]1. fix bug for dump layer(fp16); (#1567) Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * Bugfix from train branch (#1592) * [BUG] fix get dims value bug when input is 1D or 2D in arm_reduce_layer_acc.cc. * [BUG] fix Convert from NCHW to NHWC error when input is on arm device. * [BUG] fix convert mat to blob bug when input is NC_INT32 on arm device. * [BUG] fix tflite_converter bug when transform a activation layer. * add nchw format condition when copy int32 mat to blob * rollback changes on tflite_op_converter.cc Co-authored-by: sanerzheng <sanerzheng@tencent.com> * [UPD][OPENCL] opencl support x86 mat (#1593) Co-authored-by: ealinli <ealinli@tencent.com> * [CONVERTER][BUG]1. fix issue 1595; (#1596) * [UPD][OPENCL] add ocl version check (#1601) * [UPD][OPENCL] add ocl version check * [UPD][OPENCL] update message for vervion check Co-authored-by: ealinli <ealinli@tencent.com> * [UPD][OPENCL] solve the problem that matmul, tile have incorrect results on helio p65 (#1602) Co-authored-by: ealinli <ealinli@tencent.com> * [UPD][DYQ] fix dynamic range quant compile error on windows (#1604) Co-authored-by: ealinli <ealinli@tencent.com> * [DOC][UPD] modify image links in doc (#1617) Co-authored-by: ealinli <ealinli@tencent.com> * remove redundant test cases (#1614) * Fix typos. (#1626) * Fix typos. * Update Readme. Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * Interpreter change from std::map to safe_map, later one offers a const operator[] function (#1618) Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: lucasktian <lucasktian@tencent.com> * [UPD][OPENCL] get opencl version when GpuType is OTHER (#1636) * [UPD][OPENCL] get opencl version when GpuType is OTHER * [UPD][OPENCL] optimize nv gpu judgment logic Co-authored-by: ealinli <ealinli@tencent.com> * Patch x86 avx support (#1633) * merge dev_vc14_m1_debug, support x86 avx * add option to support x86 avx2 compile * update win_x86_opencl building script Co-authored-by: Dandiding <Dandiding@tencent.com> * fix x86 avx2 options (#1638) * fix typos in doc (#1634) Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> * [X86][BUG] fix deconv layer build error (#1641) * [OPENCL][FIX] fix conv and dwconv on some of the AMD GPUs * [UPD][OPENCL] add coor check for conv and dwconv * [OPENCL][FIX] fix compilation issues * [OPENCL][UPD] optimize AMD GPU judgment logic Co-authored-by: ealinli <ealinli@tencent.com> * [OPENCL][UPD] fix deconv, avgpool on AMD GPU (#1646) * [OPENCL][UPD] fix deconv and avgpool when read image * [OPENCL][UPD] add header file for pooling Co-authored-by: ealinli <ealinli@tencent.com> * [OPENCL][UPD] opencl support cache on windows (#1645) * [UPD][OPENCL] add coor check for conv and dwconv * [OPENCL][FIX] fix compilation issues * [OPENCL][UPD] optimize AMD GPU judgment logic * [OPENCL][UPD] support cache on windows * [OPENCL][UPD] fix load cache on windows Co-authored-by: ealinli <ealinli@tencent.com> * [DRQ][UPD] dynamic range quant model support do const folder (#1647) * [DRQ][UPD] dynamic range quant model support do const folder * [TOOLS][UPD] dynamic range quant updates usage Co-authored-by: ealinli <ealinli@tencent.com> * 1. make model_check support dynamic range quantized model; (#1653) * [ADD][TUTORIAL] add mbv2-ssd conversion and deployment tutorial (#1640) * [ADD][TUTORIAL] add mbv2-ssd conversion and deployment tutorial * [TUTORIAL][UPD] update code link * [TUTORIAL][UPD] fix typo Co-authored-by: ealinli <ealinli@tencent.com> * [X86][FIX] binary op support fp16 weights (#1655) * [X86][FIX] binary op support fp16 weights * [X86][FIX] matmul support fp16 weights Co-authored-by: ealinli <ealinli@tencent.com> * Feature dynamic quant fc (#1660) * [DYNAMIC_QUANT][UPD]1. dynamic quant support inner_product layer; * [ARM][UPD]1. arm gemm 部分情况下使用 Kahan sum 算法,以避免 fp16 累加误差; * [FIX][CPU][TRT] Fix CPU Not OP bug, Fix TensorRT ShapeTensor Class Bug. (#1663) * [FIX] Fix CPU Not Operator data type error. * [FIX] Fix TensorRT ShapeTensor class ConvertTo1D() func bug * fix _mm256_load_ps segmentation fault (#1682) * fix _mm256_load_ps segmentation fault * fix crash on mm256_load when innerproduct * use loadu instead of stride-judgement * remove unused code Co-authored-by: fishdai <fishdai@tencent.com> * x86_acc & blob_converter now will consider the BlobHandle.bytes_offset (#1684) * Dev x86 layer adapter (#1683) * [X86] add layer acc adapter * [X86] NULL to nullptr * [X86][OPENVINO] add openvino adapter layer builder, fallback to cpu naive impl if there is no normal ov layer builder * [X86][OPENVINO] fix hard code of ov precision Co-authored-by: anonymous <anonymous@mail.org> * [ARM] fix arm cross compile error caused by float-abi (#1678) * avoid nullptr in IsSupport (#1685) * [UPD][TOOLS] 1.increase subs_length 2.align model support bool and int32 input 3. fix gather and onehot convert 4. gather_nd support indices_shape[-1] < r (#1686) Co-authored-by: ealinli <ealinli@tencent.com> * Dev metal ngray (#1693) * [METAL] metal support ngray input mat * [METAL]fix bytes_size * [COREML] fix dynamic quantization model about coreml Co-authored-by: jacinhu <jacinhu@tencent.com> Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com> * [UPD][DRQ] support quantizing matmul's const weight (#1698) * [UPD][DRQ] support quantizing matmul's const weight * [UPD][DRQ] add scale check in constant map Co-authored-by: ealinli <ealinli@tencent.com> * [FIX] fix compile macos framework (#1687) Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com> * Optimize dynamic range quantize (#1699) * [DynamicRangeQuantize][UPD]1. 添加了根据权重分布判断是否量化的逻辑; * [DynamicQuantization][UPD]1. dynamic_range_quantization support TNN fp16 model; * [DRQ][UPD]1. 修复了 model_check_android.sh 脚本中指定 reference file,但是推理没有用到的 bug;2. 优化了 dynamic_range_quantization 中的部分代码; * [DRQ][UPD]1.fix conflict with merge master code; Co-authored-by: ealinli <37806708+1627180283@users.noreply.github.com> * Fix windows x86 build (#1697) * [FIX] remove nanodet for windows * remove ninga compile for some bug * fix x86 mat type register macro name * fix x86 matmul with 2 inputs Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com> * [METAL] fix stride slice crach when dims is 2 (#1701) Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com> * [mac] 1. FIX X86 and ARM conflict; 2. ADD ARM arch on intel cpu (You can use ARM if rosetta-X86 crash). 3. Use ios project build/profile M1-Mac. (#1700) Co-authored-by: gennyxu <gennyxu@tencent.com> Co-authored-by: lucasktian <lucasktian@tencent.com> * [iOS][UPD]1. add missing file for xcode project; (#1705) * [BUG]fix coreml error of slicev2、padv2 and matmul; (#1703) * [BUG]fix YouTu face alignment model * [UPD]update mean pts file logic * [UPD]draw face points green * [UPD]unify example controller list * [UPD]unify example controller list * [UPD]move blaze anchor file to resource * [METAL]update tnn project * [UPD]update tool onnx2coreml * [ADD]support ShareCommandQueue between instances * [ADD]support ShareCommandQueue between instances * [UPD]add log message * [UPD]transfer file half.hpp * [UPD]fix xcode compile error with fp16 * [UPD]fix xcode compile error with fp16 * [UPD]update model type erro msg * [FIX]fix logic error of constofshape * [UPD]update debug message * [FIX]fsupport int32 for neg op * [BUG]fix init error with nil commadbuffer * [UPD]add mac build xcode project; fix ios mac build script; * [UPD]add mac build xcode project; fix ios mac build script; * [ADD]add QQ group 2 of TNN * [BUG]fix dynamic dequant error; fix arm pad error; * [BUG]support coreml padv2 * [BUG]fix ccoreml matmul error when it has const input blob * [BUG]fix coreml slicev2 * [UPD]add convert logic of swish * [BUG]fix error cpu error for x86 mac * [UPD]support fusion for gemm + bn * [UPD]add convert logic of swish Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: lucasktian <lucasktian@tencent.com> * [UPD]update merge logic for swish groupnorm deconv (#1708) * [BUG]fix YouTu face alignment model * [UPD]update mean pts file logic * [UPD]draw face points green * [UPD]unify example controller list * [UPD]unify example controller list * [UPD]move blaze anchor file to resource * [METAL]update tnn project * [UPD]update tool onnx2coreml * [ADD]support ShareCommandQueue between instances * [ADD]support ShareCommandQueue between instances * [UPD]add log message * [UPD]transfer file half.hpp * [UPD]fix xcode compile error with fp16 * [UPD]fix xcode compile error with fp16 * [UPD]update model type erro msg * [FIX]fix logic error of constofshape * [UPD]update debug message * [FIX]fsupport int32 for neg op * [BUG]fix init error with nil commadbuffer * [UPD]add mac build xcode project; fix ios mac build script; * [UPD]add mac build xcode project; fix ios mac build script; * [ADD]add QQ group 2 of TNN * [BUG]fix dynamic dequant error; fix arm pad error; * [BUG]support coreml padv2 * [BUG]fix ccoreml matmul error when it has const input blob * [BUG]fix coreml slicev2 * [UPD]add convert logic of swish * [BUG]fix error cpu error for x86 mac * [UPD]support fusion for gemm + bn * [UPD]add convert logic of swish * [UPD]support fusion for deconv+add and deconv+add+bn * [UPD]add aliyun disk link for tnn models * [UPD]support fusion for group norm * [UPD]support fusion for swish Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: lucasktian <lucasktian@tencent.com> * [DRQ][BUG]1. fix bug for max_values; (#1716) * Hotfix m1 build (#1715) * fix apple m1 clang 13.1 compile error * fix unit test compile error Co-authored-by: quinnrong <quinnrong@quinnrongs-MacBook-Pro.local> Co-authored-by: ealinli <37806708+1627180283@users.noreply.github.com> Co-authored-by: shenpenwang <41420892+Maosquerade@users.noreply.github.com> Co-authored-by: neiltian <65950677+neiltian-tencent@users.noreply.github.com> Co-authored-by: sxj731533730 <sxj731533730@gmail.com> Co-authored-by: Yulv-git <34329208+Yulv-git@users.noreply.github.com> Co-authored-by: darrenyao87 <62542779+darrenyao87@users.noreply.github.com> Co-authored-by: quinnrong94 <67782915+quinnrong94@users.noreply.github.com> Co-authored-by: lucasktian <lucasktian@tencent.com> Co-authored-by: powerpwang <72859430+powerpwang@users.noreply.github.com> Co-authored-by: ealinli <37806708+1627180283@users.noreply.github.com> Co-authored-by: powerpwang <powerpwang@outlook.com> Co-authored-by: ealinli <ealinli@tencent.com> Co-authored-by: Dandi Ding <bluaxe@users.noreply.github.com> Co-authored-by: saner zheng <zqawszqaws@126.com> Co-authored-by: sanerzheng <sanerzheng@tencent.com> Co-authored-by: Feng Shijie <j514681085@icloud.com> Co-authored-by: Dandiding <Dandiding@tencent.com> Co-authored-by: FeiGeChuanShu <774074168@qq.com> Co-authored-by: seanxcwang <66675860+seanxcwang@users.noreply.github.com> Co-authored-by: doxutx <92915535+doxutx@users.noreply.github.com> Co-authored-by: kumbayaco <xyu.dai@gmail.com> Co-authored-by: fishdai <fishdai@tencent.com> Co-authored-by: anonymous <anonymous@mail.org> Co-authored-by: jacinhu <jacinhu@tencent.com> Co-authored-by: XDC <196890111@qq.com> Co-authored-by: gennyxu <gennyxu@tencent.com> Co-authored-by: quinnrong <quinnrong@quinnrongs-MacBook-Pro.local>
- Loading branch information