Compile

I. Compile (iOS)

1. Environment requirements

Mac, Xcode IDE
cmake（version 3.1 or higher）

2. Steps

1）switch to 'scripts' dir

cd <path_to_tnn>/scripts

2）execute the build script

./build_framework_ios.sh

If the xcrun, metal or metallib commands cannot be found during the compilation, try the following commands.

sudo xcode-select -s /Applications/Xcode.app/Contents/Developer/

After the compilation, the tnn.framework library and tnn.bundle resources are generated under the directory platforms/ios 3）Add to the project

Add tnn.framework library and tnn.bundle resource under the root directory of iOS app project;
Find the Build Setting -> Linking-> Other Linker Flags option in the settings of the app Xcode project;
Add -force_load" $ (SRCROOT) /tnn.framework " to it;

3. Restrictions

Currently, the compiled tnn.framework supports running on CPU and GPU of iOS devices, but only supports running on GPU of Mac. The support for Mac CPU will come in future TNN updates.

II. Compile (Android)

1. Environment requirements

Dependency

cmake（version 3.6 or higher）

NDK configuration

Download Android NDK (version>=15c) https://developer.android.com/ndk/downloads
- version>=r18b, when armv8.2 is enable
Configure the NDK path in env export ANDROID_NDK=<ndk_path>

2. Compile

1）switch to 'scripts' dir

cd <path_to_tnn>/scripts

2）edit build_android.sh to config the building options

 ABIA32="armeabi-v7a with NEON"
 ABIA64="arm64-v8a"
 STL="c++_static"
 SHARED_LIB="ON"                # ON for dynamic lib，OFF for static lib
 ARM="ON"                       # ON to build for ARM CPU
 OPENMP="ON"                    # ON to build for OpenMP
 OPENCL="ON"                    # ON to build for GPU
 HUAWEI_NPU="ON"                # ON to enable HUAWEI NPU
 SHARING_MEM_WITH_OPENGL=0      # 1 for sharing OpenGL texture with openCL

Huawei NPU PS: You need to download the DDK library files and copy them to the specified directory. You could use a script to do. See: HuaweiNPU Compilation Prerequisite in FAQ.

3）execute the building script

./build_android.sh

After the compilation is completed, the corresponding armeabi-v7a library, the arm64-v8a library and the include header file are generated in the release directory of the current directory. Notice that add -Wl,--whole-archive tnn -Wl,--no-whole-archive to the project, if tnn static library is compiled.

III. Cross-Compile in Linux

1. Environment requirements

Dependencies

cmake（version 3.1 or higher）
Install arm toolchain
ubuntu: aarch64: sudo apt-get install g++-aarch64-linux-gnu gcc-aarch64-linux-gnu arm32hf: sudo apt-get install g++-arm-linux-gnueabihf gcc-arm-linux-gnueabihf
other linux: download toolchains from https://developer.arm.com/tools-and-software/open-source-software/developer-tools/gnu-toolchain/gnu-a/downloads

2. Compilation Steps

1）switch to 'scripts' dir

cd <path_to_tnn>/scripts

2）edit build_aarch64_linux.sh or build_armhf_linux.sh to config the building options

 SHARED_LIB="ON"                # ON for dynamic lib，OFF for static lib
 ARM="ON"                       # ON to build for ARM CPU
 OPENMP="ON"                    # ON to enable OpenMP
 OPENCL="OFF"                   # ON to build for GPU
 RKNPU="OFF"                    # ON to build for RKNPU 
 #for arm64:
 CC=aarch64-linux-gnu-gcc       # set compiler for aarch64 C
 CXX=aarch64-linux-gnu-g++      # set compiler for aarch64 C++
 TARGET_ARCH=aarch64
 #for arm32hf:
 CC=arm-linux-gnueabihf-gcc       # set compiler for aarch64 C
 CXX=arm-linux-gnueabihf-g++      # set compiler for aarch64 C++
 TARGET_ARCH=arm

3）execute the building script

./build_aarch64_linux.sh

RKNPU: You need to download the DDK library files and copy them to the specified directory. Please see: RKNPU Compilation Prerequisite in FAQRKNPU Compilation Prerequisite.

IV. Compile(x86 Linux)

1. Enviromnment requirements

Dependencies

cmake (version 3.11 or higher)

2. Compilation Steps

switch to 'scripts' directory

cd <path_to_tnn>/scripts

execute the building scripts

compile without openvino

./build_linux_native.sh

compile with openvino

./build_x86_linux.sh

Openvino can only be compiled to 64-bit version, cmake version 3.13 or higher

V. Compile(Linux CUDA)

1. Enviromnment requirements

Dependency

cmake（version 3.8 or higher）
CUDA (version 10.2 or higher)

TensorRT configuration

Download TensorRT (version>=7.1) https://developer.nvidia.com/nvidia-tensorrt-7x-download
Configure the TensorRT path in env export TENSORRT_ROOT_DIR=<TensorRT_path>

CuDNN configuration

Download CuDNN (version>=8.0) https://developer.nvidia.com/rdp/cudnn-download
Configure the CuDNN path in env export CUDNN_ROOT_DIR=<CuDNN_path>

2. Compilation Steps

switch to 'scripts' directory

cd <path_to_tnn>/scripts

execute the building scripts

./build_cuda_linux.sh

VI. Compile(x86 Windows)

1. Environment requirements

Dependencies

Visual Studio (version 2015 or higher)
cmake (vsrsion 3.11 or higher; Or use build-in cmake in Visual Studio)
ninja (faster compilation, installed with chocolatey)

2. Compilation Steps

Open x64 Native Tools Command Prompt for VS 2017/2019. Or open x86 Native Tools Command Prompt for VS 2017/2019 to compile 32-bit version

switch to 'scripts' directory

cd <path_to_tnn>/scripts

execute the building scripts

compile without openvino

.\build_msvc_native.bat

compile with openvino

.\build_msvc.bat

Openvino can only be compiled to 64-bit version. More problems refer to FAQ

VII. Compile(CUDA Windows)

1. Environment requirements

Dependencies

Visual Studio (version 2015 or higher)
cmake (vsrsion 3.11 or higher; Or use build-in cmake in Visual Studio)
CUDA (version 10.2 or higher) and make sure CUDA_PATH was set in Environment Virables

TensorRT configuration

Download TensorRT (version>=7.1) https://developer.nvidia.com/nvidia-tensorrt-7x-download
Configure the TensorRT path in build_cuda_msvc.bat : set TENSORRT_ROOT_DIR=<TensorRT_path>

CuDNN configuration

Download CuDNN (version>=8.0) https://developer.nvidia.com/rdp/cudnn-download
Configure the CuDNN path in build_cuda_msvc.bat : set CUDNN_ROOT_DIR=<CuDNN_path>

2. Compilation Steps

Open x64 Native Tools Command Prompt for VS 2017/2019. Or open cmd with cmake environment virable setted.

switch to 'scripts' directory

cd <path_to_tnn>/scripts

execute the building scripts

.\build_cuda_msvc.bat

VIII. Compile(Macos)

1. Environment requirements

Dependencies

cmake 3.11 or above
xcode command line tools (Xcode shall be installed in AppStore，then execute xcode-select --install in terminal)
automake, libtool (can be installed with brew, brew install libtool automake)
Network access

2. Compilation Steps

1）switch to 'scripts' directory

cd <path_to_tnn>/scripts

2）execute the building scripts

./build_macos.sh

IX. Compile(Jetson Nano)

1. Environment requirements

Dependencies

Jetpack 4.6 or above

2. Compilation Steps

1）create a build directory

cd <path_to_tnn>
mkdir build
cd build

set environment.

export CUDACXX=/usr/local/cuda/bin/nvcc
export CUDNN_ROOT_DIR=/usr/lib/aarch64-linux-gnu
export CUBLAS_ROOT_DIR=/usr/local/cuda/lib64
export TENSORRT_ROOT_DIR=/usr/lib/aarch64-linux-gnu

execute cmake using ninja. (ninja supports longer command lines)

PARALLEL_LEVEL=4 cmake -D CMAKE_BUILD_TYPE=Release \
      -D TNN_JETSON_NANO_ENABLE=ON \
      -D TNN_TEST_ENABLE=ON \
      -D TNN_QUANTIZATION_ENABLE=ON \
      -D TNN_BENCHMARK_MODE=OFF \
      -D TNN_BUILD_SHARED=ON \
      -D TNN_ONNX2TNN_ENABLE=ON \
      -D TNN_TNN2MEM_ENABLE=ON \
      -D TNN_CONVERTER_ENABLE=ON \
      -G Ninja ..
ninja

Description for build options

Option	Default	Description
TNN_CPU_ENABLE	ON	Code source/device/cpu compilation switch, the implementation is all c ++ code, does not contain specific CPU acceleration instructions.
TNN_X86_ENABLE	OFF	The code source/device/x86 compilation switch is currently adapted to the openvino implementation, and more accelerated code implementation will be moved in later.
TNN_ARM_ENABLE	OFF	Code source/device/arm compilation switch, the code contains neon acceleration instructions, and partially implements int8 acceleration.
TNN_ARM82_ENABLE	OFF	Code source/device/arm/acc/compute_arm82 compilation switch, the code implements fp16 acceleration.
TNN_METAL_ENABLE	OFF	Code source/device/metal compilation switch, the code contains metal acceleration instructions.
TNN_OPENCL_ENABLE	OFF	Code source/device/opencl compilation switch, the code contains opencl acceleration instructions.
TNN_CUDA_ENABLE	OFF	Code source/device/cuda compilation switch, the code contains cuda acceleration instructions, currently only a small part of the implementation has been migrated.
TNN_DSP_ENABLE	OFF	Code source/device/dsp compilation switch, currently adapted to snpe implementation.
TNN_ATLAS_ENABLE	OFF	The code source/device/atlas compilation switch is currently adapted to Huawei's atlas acceleration framework.
TNN_HUAWEI_NPU_ENABLE	OFF	The code source/device/huawei_npu compilation switch is currently adapted to the HiAI acceleration framework.
TNN_RK_NPU_ENABLE	OFF	The code source/device/rknpu compilation switch is currently adapted to the rknpu_ddk acceleration framework.
TNN_SYMBOL_HIDE	ON	The symbols of the acceleration library are hidden, and the default non-public interface symbols of release are not visible.
TNN_OPENMP_ENABLE	OFF	OpenMP switch, control whether to open openmp acceleration.
TNN_BUILD_SHARED	ON	The dynamic library compilation switch, close to compile the static library.
TNN_TEST_ENABLE	OFF	test code compilation switch
TNN_UNIT_TEST_ENABLE	OFF	Unit test compilation switch, open the unit test compilation switch will automatically turn on the TNN_CPU_ENABLE switch, as a test benchmark.
TNN_PROFILER_ENABLE	OFF	Performance debugging switch, after opening it will print more performance information, only for debugging.
TNN_QUANTIZATION_ENABLE	OFF	Quantization tool compilation switch
TNN_BENCHMARK_MODE	OFF	Benchmark switch, after opening, the model weights file is empty, and data can be automatically generated.
TNN_ARM82_SIMU	OFF	Armv8.2 simulation switch, should be open together with TNN_ARM82_ENABLE, after opening, the code can be run on the CPU which without half precision support.

Files

compile_en.md

Latest commit

History

compile_en.md

File metadata and controls

Compile

I. Compile (iOS)

1. Environment requirements

2. Steps

3. Restrictions

II. Compile (Android)

1. Environment requirements

Dependency

NDK configuration

2. Compile

III. Cross-Compile in Linux

1. Environment requirements

Dependencies

2. Compilation Steps

IV. Compile(x86 Linux)

1. Enviromnment requirements

Dependencies

2. Compilation Steps

V. Compile(Linux CUDA)

1. Enviromnment requirements

Dependency

TensorRT configuration

CuDNN configuration

2. Compilation Steps

VI. Compile(x86 Windows)

1. Environment requirements

Dependencies

2. Compilation Steps

VII. Compile(CUDA Windows)

1. Environment requirements

Dependencies

TensorRT configuration

CuDNN configuration

2. Compilation Steps

VIII. Compile(Macos)

1. Environment requirements

Dependencies

2. Compilation Steps

IX. Compile(Jetson Nano)

1. Environment requirements

Dependencies

2. Compilation Steps

Description for build options