Skip to content
master
Switch branches/tags
Code

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
Apr 13, 2022
May 17, 2022

The collection of pre-trained, state-of-the-art AI models.

About ailia SDK

ailia SDK is a self-contained cross-platform high speed inference SDK. The ailia SDK provides a consistent C++ API on Windows, Mac, Linux, iOS, Android, Jetson and Raspberry Pi. It supports Unity, Python and JNI for efficient AI implementation. The ailia SDK makes great use of the GPU via Vulkan and Metal to serve accelerated computing.

How to use

ailia MODELS tutorial

Supported models

Action recognition

Model Reference Exported From Supported Ailia Version Blog
mars MARS: Motion-Augmented RGB Stream for Action Recognition Pytorch 1.2.4 and later EN JP
st-gcn ST-GCN Pytorch 1.2.5 and later EN JP
ax_action_recognition Realtime-Action-Recognition Pytorch 1.2.7 and later
va-cnn View Adaptive Neural Networks (VA) for Skeleton-based Human Action Recognition Pytorch 1.2.7 and later
driver-action-recognition-adas driver-action-recognition-adas-0002 OpenVINO 1.2.5 and later

Anomaly detection

Model Reference Exported From Supported Ailia Version Blog
padim PaDiM-Anomaly-Detection-Localization-master Pytorch 1.2.6 and later EN JP
spade-pytorch Sub-Image Anomaly Detection with Deep Pyramid Correspondences Pytorch 1.2.6 and later

Audio processing

Model Reference Exported From Supported Ailia Version Blog
crnn_audio_classification crnn-audio-classification Pytorch 1.2.5 and later EN JP
deepspeech2 deepspeech.pytorch Pytorch 1.2.2 and later EN JP
pytorch-dc-tts Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention Pytorch 1.2.6 and later EN JP
unet_source_separation source_separation Pytorch 1.2.6 and later EN JP
transformer-cnn-emotion-recognition Combining Spatial and Temporal Feature Representions of Speech Emotion by Parallelizing CNNs and Transformer-Encoders Pytorch 1.2.5 and later
auto_speech AutoSpeech: Neural Architecture Search for Speaker Recognition Pytorch 1.2.5 and later EN JP
voicefilter VoiceFilter Pytorch 1.2.7 and later JP

Background removal

Model Reference Exported From Supported Ailia Version Blog
U-2-Net U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection Pytorch 1.2.2 and later EN JP
u2net-portrait-matting U^2-Net - Portrait matting Pytorch 1.2.7 and later
u2net-human-seg U^2-Net - human segmentation Pytorch 1.2.4 and later
deep-image-matting Deep Image Matting Keras 1.2.3 and later EN JP
indexnet Indices Matter: Learning to Index for Deep Image Matting Pytorch 1.2.7 and later
modnet MODNet: Trimap-Free Portrait Matting in Real Time Pytorch 1.2.7 and later
background_matting_v2 Real-Time High-Resolution Background Matting Pytorch 1.2.9 and later
cascade_psp CascadePSP Pytorch 1.2.9 and later
rembg Rembg Pytorch 1.2.4 and later

Crowd counting

Model Reference Exported From Supported Ailia Version Blog
crowdcount-cascaded-mtl CNN-based Cascaded Multi-task Learning of
High-level Prior and Density Estimation for Crowd Counting
(Single Image Crowd Counting)
Pytorch 1.2.1 and later EN JP
c-3-framework Crowd Counting Code Framework(C^3-Framework) Pytorch 1.2.5 and later

Deep fashion

Model Reference Exported From Supported Ailia Version Blog
clothing-detection Clothing-Detection Pytorch 1.2.1 and later EN JP
mmfashion MMFashion Pytorch 1.2.5 and later EN JP
mmfashion_tryon MMFashion virtual try-on Pytorch 1.2.8 and later
mmfashion_retrieval MMFashion In-Shop Clothes Retrieval Pytorch 1.2.5 and later
fashionai-key-points-detection A Pytorch Implementation of Cascaded Pyramid Network for FashionAI Key Points Detection Pytorch 1.2.5 and later

Depth estimation

Model Reference Exported From Supported Ailia Version Blog
monodepth2 Monocular depth estimation from a single image Pytorch 1.2.2 and later
midas Towards Robust Monocular Depth Estimation:
Mixing Datasets for Zero-shot Cross-dataset Transfer
Pytorch 1.2.4 and later EN JP
fcrn-depthprediction Deeper Depth Prediction with Fully Convolutional Residual Networks TensorFlow 1.2.6 and later
fast-depth ICRA 2019 "FastDepth: Fast Monocular Depth Estimation on Embedded Systems" Pytorch 1.2.5 and later
lap-depth LapDepth-release Pytorch 1.2.9 and later
hitnet ONNX-HITNET-Stereo-Depth-estimation Pytorch 1.2.9 and later

Face detection

Model Reference Exported From Supported Ailia Version Blog
yolov1-face YOLO-Face-detection Darknet 1.1.0 and later
yolov3-face Face detection using keras-yolov3 Keras 1.2.1 and later
blazeface BlazeFace-PyTorch Pytorch 1.2.1 and later EN JP
face-mask-detection Face detection using keras-yolov3 Keras 1.2.1 and later EN JP
dbface DBFace : real-time, single-stage detector for face detection,
with faster speed and higher accuracy
Pytorch 1.2.2 and later
retinaface RetinaFace: Single-stage Dense Face Localisation in the Wild. Pytorch 1.2.5 and later
anime-face-detector Anime Face Detector Pytorch 1.2.6 and later

Face identification

Model Reference Exported From Supported Ailia Version Blog
vggface2 VGGFace2 Dataset for Face Recognition Caffe 1.1.0 and later
arcface pytorch implement of arcface Pytorch 1.2.1 and later EN JP
insightface InsightFace: 2D and 3D Face Analysis Project Pytorch 1.2.5 and later

Face recognition

Model Reference Exported From Supported Ailia Version Blog
face_classification Real-time face detection and emotion/gender classification Keras 1.1.0 and later
facial_feature kaggle-facial-keypoints Pytorch 1.2.0 and later
face_alignment 2D and 3D Face alignment library build using pytorch Pytorch 1.2.1 and later EN JP
prnet Joint 3D Face Reconstruction and Dense Alignment
with Position Map Regression Network
TensorFlow 1.2.2 and later
gazeml A deep learning framework based on Tensorflow
for the training of high performance gaze estimation
TensorFlow 1.2.0 and later
facemesh facemesh.pytorch Pytorch 1.2.2 and later EN JP
mediapipe_iris irislandmarks.pytorch Pytorch 1.2.2 and later EN JP
hopenet deep-head-pose Pytorch 1.2.2 and later EN JP
ax_gaze_estimation ax Gaze Estimation Pytorch 1.2.2 and later EN JP
age-gender-recognition-retail age-gender-recognition-retail-0013 OpenVINO 1.2.5 and later EN JP
ferplus FER+ CNTK 1.2.2 and later
face-anti-spoofing Lightweight Face Anti Spoofing Pytorch 1.2.5 and later EN JP
ax_facial_features ax Facial Features Pytorch 1.2.5 and later EN

Frame Interpolation

Model Reference Exported From Supported Ailia Version Blog
flavr FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation Pytorch 1.2.7 and later EN JP
cain Channel Attention Is All You Need for Video Frame Interpolation Pytorch 1.2.5 and later

Generative adversarial networks

Model Reference Exported From Supported Ailia Version Blog
pytorch-gan Code repo for the Pytorch GAN Zoo project (used to train this model) Pytorch 1.2.4 and later
council-GAN Council-GAN Pytorch 1.2.4 and later
restyle-encoder ReStyle Pytorch 1.2.9 and later
sam Age Transformation Using a Style-Based Regression Model Pytorch 1.2.9 and later

Hand detection

Model Reference Exported From Supported Ailia Version Blog
yolov3-hand Hand detection branch of Face detection using keras-yolov3 Keras 1.2.1 and later
hand_detection_pytorch hand-detection.PyTorch Pytorch 1.2.2 and later
blazepalm MediaPipePyTorch Pytorch 1.2.5 and later

Hand recognition

Model Reference Exported From Supported Ailia Version Blog
blazehand MediaPipePyTorch Pytorch 1.2.5 and later EN JP
hand3d ColorHandPose3D network TensorFlow 1.2.5 and later
minimal-hand Minimal Hand TensorFlow 1.2.8 and later

Image captioning

Model Reference Exported From Supported Ailia Version Blog
illustration2vec Illustration2Vec Caffe 1.2.2 and later
image_captioning_pytorch Image Captioning pytorch Pytorch 1.2.5 and later EN JP

Image classification

Model Reference Exported From Supported Ailia Version Blog
vgg16 Very Deep Convolutional Networks for Large-Scale Image Recognition Keras 1.1.0 and later
googlenet Going Deeper with Convolutions Pytorch 1.2.0 and later
resnet50 Deep Residual Learning for Image Recognition Chainer 1.2.0 and later
inceptionv3 Rethinking the Inception Architecture for Computer Vision Pytorch 1.2.0 and later JP
inceptionv4 Keras Inception-V4 Keras 1.2.5 and later
mobilenetv2 PyTorch Implemention of MobileNet V2 Pytorch 1.2.0 and later
mobilenetv3 PyTorch Implemention of MobileNet V3 Pytorch 1.2.1 and later
partialconv Partial Convolution Layer for Padding and Image Inpainting Pytorch 1.2.0 and later
efficientnet A PyTorch implementation of EfficientNet Pytorch 1.2.3 and later
efficientnetv2 EfficientNetV2 Pytorch 1.2.4 and later
vit Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale) Pytorch 1.2.7 and later JP
wide_resnet50 Wide Resnet Pytorch 1.2.5 and later
resnet18 ResNet18 Pytorch 1.2.8 and later
mlp_mixer MLP-Mixer Pytorch 1.2.9 and later
alexnet AlexNet PyTorch Pytorch 1.2.5 and later
clip CLIP Pytorch 1.2.9 and later EN JP
landmarks_classifier_asia Landmarks classifier_asia_V1.1 TensorFlow Hub 1.2.4 and later EN JP
weather-prediction-from-image Weather Prediction From Image - (Warmth Of Image) Keras 1.2.5 and later
swin-transformer Swin Transformer Pytorch 1.2.6 and later
convnext A PyTorch implementation of ConvNeXt Pytorch 1.2.5 and later

Image inpainting

Model Reference Exported From Supported Ailia Version Blog
inpainting-with-partial-conv pytorch-inpainting-with-partial-conv PyTorch 1.2.6 and later EN JP
inpainting_gmcnn Image Inpainting via Generative Multi-column Convolutional Neural Networks TensorFlow 1.2.6 and later
3d-photo-inpainting 3D Photography using Context-aware Layered Depth Inpainting Pytorch 1.2.7 and later
deepfillv2 Free-Form Image Inpainting with Gated Convolution Pytorch 1.2.9 and later

Image manipulation

Model Reference Exported From Supported Ailia Version Blog
noise2noise Learning Image Restoration without Clean Data Pytorch 1.2.0 and later
dewarpnet DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks Pytorch 1.2.1 and later
illnet Document Rectification and Illumination Correction using a Patch-based CNN Pytorch 1.2.2 and later
colorization Colorful Image Colorization Pytorch 1.2.2 and later EN JP
u2net_portrait U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection Pytorch 1.2.2 and later
style2paints Style2Paints TensorFlow 1.2.6 and later
deep_white_balance Deep White-Balance Editing, CVPR 2020 (Oral) PyTorch 1.2.6 and later
deblur_gan DeblurGAN Pytorch 1.2.6 and later
invertible_denoising_network Invertible Image Denoising Pytorch 1.2.8 and later

Image segmentation

Model Reference Exported From Supported Ailia Version Blog
deeplabv3 Xception65 for backbone network of DeepLab v3+ Chainer 1.2.0 and later
hrnet_segmentation High-resolution networks (HRNets) for Semantic Segmentation Pytorch 1.2.1 and later
hair_segmentation hair segmentation in mobile device Keras 1.2.1 and later
pspnet-hair-segmentation pytorch-hair-segmentation Pytorch 1.2.2 and later
human_part_segmentation Self Correction for Human Parsing Pytorch 1.2.4 and later EN JP
semantic-segmentation-mobilenet-v3 Semantic segmentation with MobileNetV3 TensorFlow 1.2.5 and later
pytorch-unet Pytorch-Unet Pytorch 1.2.5 and later
pytorch-enet PyTorch-ENet Pytorch 1.2.8 and later
yet-another-anime-segmenter Yet Another Anime Segmenter Pytorch 1.2.6 and later
swiftnet SwiftNet Pytorch 1.2.6 and later
dense_prediction_transformers Vision Transformers for Dense Prediction Pytorch 1.2.7 and later EN JP
paddleseg PaddleSeg Pytorch 1.2.7 and later JP
pp_liteseg PP-LiteSeg Pytorch 1.2.10 and later
suim SUIM Keras 1.2.6 and later

Line segment detection

Model Reference Exported From Supported Ailia Version Blog
mlsd M-LSD: Towards Light-weight and Real-time Line Segment Detection TensorFlow 1.2.8 and later EN JP
dexined DexiNed: Dense Extreme Inception Network for Edge Detection Pytorch 1.2.5 and later

Low Light Image Enhancement

Model Reference Exported From Supported Ailia Version Blog
agllnet AGLLNet: Attention Guided Low-light Image Enhancement (IJCV 2021) Pytorch 1.2.9 and later EN JP

Natural language processing

Model Reference Exported From Supported Ailia Version Blog
bert pytorch-pretrained-bert Pytorch 1.2.2 and later EN JP
bert_maskedlm huggingface/transformers Pytorch 1.2.5 and later
bert_ner huggingface/transformers Pytorch 1.2.5 and later
bert_question_answering huggingface/transformers Pytorch 1.2.5 and later
bert_sentiment_analysis huggingface/transformers Pytorch 1.2.5 and later
bert_zero_shot_classification huggingface/transformers Pytorch 1.2.5 and later
bert_tweets_sentiment huggingface/transformers Pytorch 1.2.5 and later
gpt2 GPT-2 Pytorch 1.2.7 and later
rinna_gpt2 japanese-pretrained-models Pytorch 1.2.7 and later

Neural Rendering

Model Reference Exported From Supported Ailia Version Blog
nerf NeRF: Neural Radiance Fields Tensorflow 1.2.10 and later

Object detection

Model Reference Exported From Supported Ailia Version Blog
yolov1-tiny YOLO: Real-Time Object Detection Darknet 1.1.0 and later
yolov2 YOLO: Real-Time Object Detection Pytorch 1.2.0 and later
yolov2-tiny YOLO: Real-Time Object Detection Pytorch 1.2.6 and later
yolov3 YOLO: Real-Time Object Detection ONNX Runtime 1.2.1 and later EN JP
yolov3-tiny YOLO: Real-Time Object Detection ONNX Runtime 1.2.1 and later
yolov4 Pytorch-YOLOv4 Pytorch 1.2.4 and later EN JP
yolov4-tiny Pytorch-YOLOv4 Pytorch 1.2.5 and later
yolov5 yolov5 Pytorch 1.2.5 and later EN JP
yolor yolor Pytorch 1.2.5 and later
yolox YOLOX Pytorch 1.2.6 and later EN JP
yolox-ti-lite edgeai-yolox Pytorch 1.2.9 and later
mobilenet_ssd MobileNetV1, MobileNetV2, VGG based SSD/SSD-lite implementation in Pytorch Pytorch 1.2.1 and later EN JP
maskrcnn Mask R-CNN: real-time neural network for object instance segmentation Pytorch 1.2.3 and later
m2det M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network Pytorch 1.2.3 and later EN JP
centernet CenterNet : Objects as Points Pytorch 1.2.1 and later EN JP
pedestrian_detection Pedestrian-Detection-on-YOLOv3_Research-and-APP Keras 1.2.1 and later
efficientdet EfficientDet: Scalable and Efficient Object Detection, in PyTorch Pytorch 1.2.6 and later
nanodet NanoDet Pytorch 1.2.6 and later
mobile_object_localizer mobile_object_localizer_v1 TensorFlow Hub 1.2.6 and later JP
sku110k-densedet SKU110K-DenseDet Pytorch 1.2.9 and later JP
traffic-sign-detection Traffic Sign Detection Tensorflow 1.2.10 and later EN JP
detic Detecting Twenty-thousand Classes using Image-level Supervision Pytorch 1.2.10 and later JP

Object detection 3d

Model Reference Exported From Supported Ailia Version Blog
3d_bbox 3D Bounding Box Estimation Using Deep Learning and Geometry Pytorch 1.2.6 and later
3d-object-detection.pytorch 3d-object-detection.pytorch Pytorch 1.2.8 and later EN JP
mediapipe_objectron MediaPipe Objectron TensorFlow Lite 1.2.5 and later
egonet EgoNet Pytorch 1.2.9 and later
d4lcn D4LCN Pytorch 1.2.9 and later

Object tracking

Model Reference Exported From Supported Ailia Version Blog
deepsort Deep Sort with PyTorch Pytorch 1.2.3 and later EN JP
person_reid_baseline_pytorch UTS-Person-reID-Practical Pytorch 1.2.6 and later
abd_net Attentive but Diverse Person Re-Identification Pytorch 1.2.7 and later
siam-mot SiamMOT Pytorch 1.2.9 and later
bytetrack ByteTrack Pytorch 1.2.5 and later JP 
qd-3dt Monocular Quasi-Dense 3D Object Tracking Pytorch 1.2.11 and later  

Optical Flow Estimation

Model Reference Exported From Supported Ailia Version Blog
raft RAFT: Recurrent All Pairs Field Transforms for Optical Flow Pytorch 1.2.6 and later JP 

Point segmentation

Model Reference Exported From Supported Ailia Version Blog
pointnet_pytorch PointNet.pytorch Pytorch 1.2.6 and later

Pose estimation

Model Reference Exported From Supported Ailia Version Blog
openpose Code repo for realtime multi-person pose estimation in CVPR'17 (Oral) Caffe 1.2.1 and later
lightweight-human-pose-estimation Fast and accurate human pose estimation in PyTorch.
Contains implementation of
"Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose" paper.
Pytorch 1.2.1 and later EN JP
pose_resnet Simple Baselines for Human Pose Estimation and Tracking Pytorch 1.2.1 and later EN JP
blazepose MediaPipePyTorch Pytorch 1.2.5 and later
efficientpose Code repo for EfficientPose TensorFlow 1.2.6 and later
movenet Code repo for movenet TensorFlow 1.2.8 and later EN JP
animalpose MMPose - 2D animal pose estimation Pytorch 1.2.7 and later EN JP

Pose estimation 3d

Model Reference Exported From Supported Ailia Version Blog
lightweight-human-pose-estimation-3d Real-time 3D multi-person pose estimation demo in PyTorch.
OpenVINO backend can be used for fast inference on CPU.
Pytorch 1.2.1 and later
3d-pose-baseline A simple baseline for 3d human pose estimation in tensorflow.
Presented at ICCV 17.
TensorFlow 1.2.3 and later
pose-hg-3d Towards 3D Human Pose Estimation in the Wild: a Weakly-supervised Approach Pytorch 1.2.6 and later
blazepose-fullbody MediaPipe TensorFlow Lite 1.2.5 and later EN JP
3dmppe_posenet PoseNet of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image" Pytorch 1.2.6 and later
gast A Graph Attention Spatio-temporal Convolutional Networks for 3D Human Pose Estimation in Video (GAST-Net) Pytorch 1.2.7 and later EN JP

Road detection

Model Reference Exported From Supported Ailia Version Blog
codes-for-lane-detection Codes-for-Lane-Detection Pytorch 1.2.6 and later EN JP
roneld RONELD-Lane-Detection Pytorch 1.2.6 and later
road-segmentation-adas road-segmentation-adas-0001 OpenVINO 1.2.5 and later
cdnet CDNet Pytorch 1.2.5 and later
lstr LSTR Pytorch 1.2.8 and later
ultra-fast-lane-detection Ultra-Fast-Lane-Detection Pytorch 1.2.6 and later
yolop YOLOP Pytorch 1.2.6 and later
hybridnets HybridNets Pytorch 1.2.6 and later

Rotation prediction

Model Reference Exported From Supported Ailia Version Blog
rotnet CNNs for predicting the rotation angle of an image to correct its orientation Keras 1.2.1 and later

Style transfer

Model Reference Exported From Supported Ailia Version Blog
adain Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization Pytorch 1.2.1 and later EN JP
psgan PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer Pytorch 1.2.7 and later
beauty_gan BeautyGAN Pytorch 1.2.7 and later
animeganv2 PyTorch Implementation of AnimeGANv2 Pytorch 1.2.5 and later
pix2pixHD pix2pixHD: High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs Pytorch 1.2.6 and later

Super resolution

Model Reference Exported From Supported Ailia Version Blog
srresnet Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network Pytorch 1.2.0 and later EN JP
edsr Enhanced Deep Residual Networks for Single Image Super-Resolution Pytorch 1.2.6 and later EN JP
han Single Image Super-Resolution via a Holistic Attention Network Pytorch 1.2.6 and later
real-esrgan Real-ESRGAN Pytorch 1.2.9 and later

Text detection

Model Reference Exported From Supported Ailia Version Blog
craft_pytorch CRAFT: Character-Region Awareness For Text detection Pytorch 1.2.2 and later
pixel_link Pixel-Link TensorFlow 1.2.6 and later
east EAST: An Efficient and Accurate Scene Text Detector TensorFlow 1.2.6 and later

Text recognition

Model Reference Exported From Supported Ailia Version Blog
etl Japanese Character Classification Keras 1.1.0 and later JP
deep-text-recognition-benchmark deep-text-recognition-benchmark Pytorch 1.2.6 and later
crnn.pytorch Convolutional Recurrent Neural Network Pytorch 1.2.6 and later
paddleocr PaddleOCR : Awesome multilingual OCR toolkits based on PaddlePaddle Pytorch 1.2.6 and later EN JP
easyocr Ready-to-use OCR with 80+ supported languages Pytorch 1.2.6 and later

Vehicle recognition

Model Reference Exported From Supported Ailia Version Blog
vehicle-attributes-recognition-barrier vehicle-attributes-recognition-barrier-0042 OpenVINO 1.2.5 and later EN JP
vehicle-license-plate-detection-barrier vehicle-license-plate-detection-barrier-0106 OpenVINO 1.2.5 and later

Commercial model

Model Reference Exported From Supported Ailia Version Blog
acculus-pose Acculus, Inc. Caffe 1.2.3 and later

Other languages

unity version

c++ version