Files

.github
001_deeplabv3
002_mobilenetv3-ssd
003_posenet
004_efficientnet
005_one_class_anomaly_detection
006_mobilenetv2-ssdlite
007_mobilenetv2-poseestimation
008_mask_rcnn_inceptionv2
009_multi-scale_local_planar_guidance_for_monocular_depth_estimation
010_mobilenetv3
011_mobilenetv2
012_Fast_Accurate_and_Lightweight_Super-Resolution
013_ml-sound-classifier
014_tf-monodepth2
015_Faster-Grad-CAM
016_EfficientNet-lite
017_Artistic-Style-Transfer
018_EfficientDet
019_White-box-Cartoonization
020_edgetpu-deeplab
021_edgetpu-deeplab-slim
022_Learning_to_See_Moving_Objects_in_the_Dark
023_yolov3-nano
024_yolov3-lite
025_head_pose_estimation
026_mobile-deeplabv3-plus
027_minimal-hand
028_struct2depth
029_human-pose-estimation-3d-0001
030_BlazeFace
031_yolov4
032_FaceMesh
033_Hand_Detection_and_Tracking
034_ssd_mobilenet_v2_mnasfpn_shared_box_predictor
035_BodyPix
036_Objectron
037_First_Neural_Style_Transfer
038_ssdlite_mobiledet_edgetpu
039_ssdlite_mobiledet_cpu
040_DSFD_vgg
041_DBFace
042_centernet
043_face_landmark
044_selfie2anime
045_ssd_mobilenet_v2_oid_v4
046_yolov4-tiny
047_SpineNetMB_49
048_mobile_bert
049_iris_landmark
050_AnimeGANv2
051_East_Text_Detection
052_Handwritten_Text_Recognition
053_BlazePose
054_KNIFT
055_Handwritten_Japanese_Recognition
056_TextBoxes++
057_BiSeNetV2
058_keras-retinanet
060_hair_segmentation
061_U-2-Net
062_facial_cartoonization
063_3d-bounding-box-estimation-for-autonomous-driving
064_Dense_Depth
065_ThreeDPoseUnityBarracuda
066_footprints
067_MiDaS
068_Colorful_Image_Colorization
069_ENet
070_age-gender-recognition
071_Noise2Noise
072_NanoDet
073_RetinaNet
074_Yolact
075_ERFNet
076_Deep_White_Balance
077_ESRGAN
078_MODNet
079_MIRNet
080_tf_pose_estimation
081_MiDaS_v2
082_MediaPipe_Meet_Segmentation
083_Person_Reidentification
084_EfficientPose
085_Yolact_Edge
086_defocus-deblurring-dual-pixel
087_DeepSort
088_mobilenetv3-poseestimation
089_DETR
090_Ghost-free_Shadow_Removal
091_gaze-estimation-adas-0002
092_weld-porosity-detection-0001
093_ocr_japanese
094_hand_recrop
095_centerface
096_RetinaFace
097_YAMNet
098_SPICE
099_efficientnet_anomaly_detection_segmentation
100_HiFill
101_arbitrary_image_stylization
102_Coconet
103_EfficientDet_lite
104_DeeplabV3-plus
105_MobileStyleGAN
106_WHENet
107_SFA3D
108_HAWP
109_Selfie_Segmentation
110_L-CNN
111_SRN-Deblur
112_DeblurGANv2
113_Anime2Sketch
114_Two-branch-dehazing
115_MoveNet
116_DroNet
117_DTLN
118_Speech-enhancement
119_M-LSD
120_FRILL
121_GPT2_DistillGPT2
122_DistillBert
123_YOLOR
124_person-attributes-recognition-crossroad-0230
125_person-attributes-recognition-crossroad-0234
126_person-attributes-recognition-crossroad-0238
127_dino
129_SCRFD
131_CFNet
132_YOLOX
133_Real-ESRGAN
134_head-pose-estimation-adas-0001
135_CoEx
136_road-segmentation-adas-0001
137_MoveNet_MultiPose
138_BackgroundMattingV2
139_PSD-Principled-Synthetic-to-Real-Dehazing-Guided-by-Physical-Priors
140_Ultra-Fast-Lane-Detection
141_lanenet-lane-detection
142_HITNET
143_RAPiD
144_YuNet
145_text_detection_db
146_FastDepth
147_PackNet-SfM
148_LapDepth
149_depth_estimation
150_MobileStereoNet
151_object_detection_mobile_object_localizer
152_DeepLPF
153_MegaDepth
154_driver-action-recognition-adas-0002-encoder
155_driver-action-recognition-adas-0002-decoder
156_MobileHumanPose
157_3DMPPE_POSENET
158_HR-Depth
159_EPCDepth
160_msg_chn_wacv20
161_EigenGAN-Tensorflow
162_PyDNet
163_MST_inpainting
164_MADNet
165_RealtimeStereo
166_Insta-DM
167_LSTR
168_DPT
169_spaghettinet_edgetpu
170_Learning-to-See-in-the-Dark
171_Fast-SRGAN
172_Real-Time-Super-Resolution
173_MVDepthNet
174_PP-PicoDet
175_face-recognition-resnet100-arcface-onnx
176_StableLLVE
177_BirdNET-Lite
178_vehicle-detection-0200
179_person-detection-0202
181_models_edgetpu_checkpoint_and_tflite_vision_segmentation-edgetpu_tflite_default_argmax
182_models_edgetpu_checkpoint_and_tflite_vision_segmentation-edgetpu_tflite_fused_argmax
183_pedestrian-detection-adas-0002
184_pedestrian-and-vehicle-detector-adas-0001
185_person-vehicle-bike-detection-crossroad-0078
186_person-vehicle-bike-detection-crossroad-1016
187_vehicle-attributes-recognition-barrier-0039
188_vehicle-attributes-recognition-barrier-0042
189_vehicle-license-plate-detection-barrier-0106
190_person-detection-asl-0001
191_anti-spoof-mn3
192_open-closed-eye-0001
193_CoCosNet
194_face_recognizer_fast
195_person_reid_youtu
196_human_segmentation_pphumanseg
197_yolact-resnet50-fpn
198_YOLOF
199_NSFW
200_AGLLNet
201_CityscapesSOTA
202_stereoDNN
203_SRHNet
204_HINet
205_MBLLEN
206_Matting
207_GLADNet
208_SAPNet
209_MSBDN-DFF
210_SC_Depth_pl
211_Lac-GwcNet
212_GFN
213_TBEFN
214_EnlightenGAN
215_AOD-Net
216_Zero-DCE-TF
217_RUAS
218_DSLR
219_StereoNet
220_HEP
221_YOLACT-PyTorch
222_LFT
223_DA_dahazing
224_Y-net
225_NTIRE-2021-Dehazing-Two-branch
226_CascadeTableNet
227_face-detection-adas-0001
228_Fast-SCNN
229_DexiNed
230_Single-Image-Desnowing-HDCWNet
231_DRBL
232_MIMO-UNet
233_HRNet-for-Fashion-Landmark-Estimation
234_FBCNN
235_W-Stereo-Disp
236_A-TVSNet
237_piano_transcription
238_SUIM-Net
239_CasStereoNet
240_BSRGAN
241_SCL-LLE
242_RobustVideoMatting
243_Zero-DCE-improved
244_FINNger
245_GLPDepth
246_SqueezeSegV3
247_PoseC3D
248_MS-G3D
249_Real-CUGAN
250_Face-Mask-Detection
251_AU-GAN
252_RAFT
253_TransWeather
254_FullSubNet-plus
255_FILM
256_SFace
257_PiCANet
258_TinyHITNet
259_Emotion_FERPlus
260_KP2D
261_EfficientDerain
262_ByteTrack
263_EgoNet
264_object_localization_network
265_PoseAug
266_ACVNet
267_LIOT
268_Lite-HRNet
269_Higher-HRNet
270_HWMNet
271_HRNet
272_CSFlow
273_OPN
274_DeepFillv2
275_FD-GAN
276_HybridNets
277_EDN-GTM
278_DWARF
279_F-Clip
280_GASDA
281_IMDN
282_face_landmark_with_attention
283_UIE-WD
284_CREStereo
285_Decoupled-Low-light-Image-Enhancement
286_SCI
287_Topformer
288_perceptual-reflection-removal
289_face-detection-0100
290_AdaFace
291_SeAFusion
292_Graft-PSMNet
293_Lightweight-Head-Pose-Estimation
294_FSRE-Depth
295_SparseInst
296_MGNet
297_GazeNet
297x_↑↑↑_OpenVINO_2021.4.582_↓↓↓_OpenVINO_2022.1.0
298_DEQ-Flow
299_DGNet
300_6DRepNet
301_YOLOv4_Face
302_SLPT
303_FAN
304_SynergyNet
305_DMHead
306_GMFlowNet
307_YOLOv7
308_FastestDet
309_ImageForensicsOSN
310_attentive-gan-derainnet
311_HHP-Net
312_NeWCRFs
313_IS-Net
314_PyDNet2
315_Illumination-Adaptive-Transformer
316_night_enhancement
317_MobileOne
318_pips
319_ACR-Loss
320_Dehamer
321_DID-M3D
322_YOLOv7_Head
323_Stripformer
324_Ultra-Fast-Lane-Detection-v2
325_DehazeFormer
326_YOLOPv2
327_EMDC
328_Stable_Diffusion
329_YOLOX-PAI
330_MOSAIC
- image
- README.md
- download.sh
- param_replacement.json
332_CrowdDet
333_E2Pose
334_DAMO-YOLO
335_PIDNet
336_PP-YOLOE-Plus
337_FreeYOLO
338_Fast-ACVNet
339_DeepLSD
340_Dense-Head-Pose-Estimation
341_YOLOv6
342_ALIKE
343_PP-MattingV2
344_XYDeblur
346_facial_expression_recognition_mobilefacenet
347_RGBX_Semantic_Segmentation
348_Bread
349_PMN
350_P-STMO
351_RFDN
352_MAXIM
353_ShadowFormer
354_DEA-Net
355_MHFormer
356_EdgeYOLO
357_Unimatch
358_CGI-Stereo
359_MSPFN
360_PARSeq
361_KBNet
362_ZoeDepth
363_YOLO-6D-Pose
364_IGEV
365_HTNet
366_text_recognition_CRNN
367_FLW-Net
368_C2PNet
369_Segment_Anything
370_Semantic-Guided-Low-Light-Image-Enhancement
371_Lite-Mono
372_URetinex-Net
373_LiteTrack
374_LaneSOD
375_SCANet
376_RT-DETR
377_DRSformer
378_P2PNet_tfkeras
379_PP-LCNetV2
380_Skin-Clothes-Hair-Segmentation-using-SMP
381_Whisper
382_Light-SERNet
383_DirectMHP
384_TCMonoDepth
385_PairLIE
386_naruto_handsign_detection
387_YuNetV2
388_LightGlue
389_WGWS-Net
390_BlendshapeV2
391_MagicTouch
392_STCFormer
393_RTMPose_WholeBody
394_RTMPose_Animal
395_FFNet
396_MixDehazeNet
397_MiDaSv3.1
398_L2CS-Net
399_RetinaFace_MobileNetv2
400_CSRNet
401_CLRerNet
402_trt_pose
403_trt_pose_hand
404_HDR-Transformer
405_Ear_Segmentation
406_DeDoDe
407_Generalizing_Gaze_Estimation
408_UAED
409_nighttime_dehaze
410_FaceMeshV2
411_UDR-S2Former_deraining
412_pytorch_cpn
413_DocShadow
414_STAR
415_High-frequency-Stereo-Matching-Network
416_GeoNet
417_PopNet
418_Diffusion-Low-Light
419_MobileViT_v1_v2
420_Gold-YOLO-Hand
421_Gold-YOLO-Head
422_Gold-YOLO-Head-Hand
423_6DRepNet360
424_Gold-YOLO-Body
425_Gold-YOLO-Body-Head-Hand
426_YOLOX-Body-Head-Hand
427_RTMPose_Hand
428_ISR
429_OSNet
430_FastReID
431_NITEC
432_face-reidentification-retail-0095
433_FaceBoxes.PyTorch
434_YOLOX-Body-Head-Hand-Face
435_MobileFaceNet
436_Peppa_Pig_Face_Landmark
437_PIPNet
438_PeCLR
439_Depth-Anything
440_ViTPose
441_YOLOX-Body-Head-Hand-Face-Dist
442_YOLOX-Body-Head-Face-HandLR-Dist
443_Opal23_HeadPose
444_YOLOX-Foot-Dist
445_YOLOX-Body-Head-Face-HandLR-Foot-Dist
446_YOLOX-Body-With-Wheelchair
447_YOLOX-Wholebody-with-Wheelchair
448_YOLOX-Eye-Nose-Mouth-Ear
449_YOLOX-WholeBody12
450_YOLOv9-Wholebody-with-Wheelchair
451_DAN
452_FairFace
453_FairDAN
454_YOLOv9-Wholebody13
455_YOLOv9-Gender
456_YOLOv9-Wholebody15
457_YOLOv9-Wholebody17
458_YOLOv9-Discrete-HeadPose-Yaw
459_YOLOv9-Wholebody25
460_RT-DETRv2-Wholebody25
461_YOLOv9-Phone
462_Gaze-LLE
463_YOLOv9-Shoulder-Elbow-Knee
464_YOLOv9-Wholebody28
465_DEIM-Wholebody28
466_People_Segmentation
467_Human_Parsing
999_media
third_party
.gitignore
.gitmodules
LICENSE
README.md
log-cleaner.sh

330_MOSAIC

Name		Name	Last commit message	Last commit date
parent directory ..
image		image
README.md		README.md
download.sh		download.sh
param_replacement.json		param_replacement.json

README.md

MOSAIC with TensorRT on Jetson Nano

Original model

MOSAIC: Mobile Segmentation via decoding Aggregated Information and encoded Context

TensorFlow Official Models

Discription

Convert checkpoints to ONNX model and add fused argmax.

Convert checkpoints to TF-Lite model with argmax.
Convert TF-Lite model to ONNX model.
Convert ONNX model to TF-Lite model using onnx2tf and replace argmax with fused argmax.
Convert TF-Lite model to ONNX model and run Jetson Nano.

How to

Host PC

Downlaod checkpoints and convert checkpoints to TF-Lite model.

$ git clone https://github.com/PINTO0309/PINTO_model_zoo.git
$ cd PINTO_model_zoo/330_MOSAIC/
$ sudo podman run -it --rm -v `pwd`:/workdir tensorflow/tensorflow:2.10.0-gpu

# apt update && apt install git wget
# cd /workdir
# git clone https://github.com/NobuoTsukamoto/models.git
# cd models
# export PYTHONPATH=$PYTHONPATH:`pwd`
# /usr/bin/python3 -m pip install --upgrade pip
# pip3 install -r official/requirements.txt
# cd official/projects/mosaic/
# wget https://storage.googleapis.com/tf_model_garden/vision/mosaic/MobileNetMultiAVGSeg-r1024-ebf64-gp.tar.gz
# tar xf MobileNetMultiAVGSeg-r1024-ebf64-gp.tar.gz
# python3 serving/export_tflite.py \
    --model_name=mosaic_mnv35_cityscapes \
    --ckpt_path=./gcs_ckpt/best_ckpt-857 \
    --output_dir=/tmp \
    --image_height=1024 \
    --image_width=2048 \
    --finalize_method=resize1024_2048,argmax
# cp /tmp/mosaic_mnv35_cityscapes.tflite /workdir/mosaic_mnv35_cityscapes_argmax.tflite

Convert ONNX model.

# pip3 install tf2onnx
# cd /workdir
# python3 -m tf2onnx.convert \
    --opset 13 \
    --tflite ./mosaic_mnv35_cityscapes_argmax.tflite \
    --output ./mosaic_mnv35_cityscapes_argmax.onnx \
    --inputs-as-nchw serving_default_input_2:0 \
    --dequantize

Convert ONNX model to TF-Lite model and replace argmax with fused argmax.
The converted onnx model contains Transpose ope. You need to exclude Transpose ope when converting to TF-Lite model with onnx2tf. Specify the Transpose ope to exclude with the param_replacement_file option.

Check the param_replacement.json file for details.

Note:
Jetson Nano cannot convert the model due to lack of memory unless fused_argmax_scale_ratio is set to 0.25. It affects quality.

# pip3 install onnx \
    && pip3 install nvidia-pyindex \
    && pip3 install onnx-graphsurgeon \
    && pip3 install onnxsim \
    && pip3 install simple_onnx_processing_tools \
    && pip3 install onnx2tf
# onnx2tf \
    -i ./mosaic_mnv35_cityscapes_argmax.onnx \
    -o saved_model \
    --param_replacement_file ./param_replacement.json \
    --replace_argmax_to_fused_argmax_and_indicies_is_int64 \
    --fused_argmax_scale_ratio 0.25
# python3 -m tf2onnx.convert \
    --opset 13 --tflite ./saved_model/model_float32.tflite \
    --output /workdir/mosaic_mnv35_cityscapes_fused_argmax.onnx \
    --inputs-as-nchw inputs_0 \
    --dequantize

Repace argmax to fused argmax.

Exec trtexec on Jetson Nano

Copy mosaic_mnv35_cityscapes_fused_argmax.onnx to Jetson Nano.
Check trtexec.

$ /usr/src/tensorrt/bin/trtexec \
    --onnx=mosaic_mnv35_cityscapes_argmax.onnx \
    --workspace=512 \
    --verbose=true \
    --fp16

    ...
[11/10/2022-22:09:08] [I] === Performance summary ===
[11/10/2022-22:09:08] [I] Throughput: 1.51249 qps
[11/10/2022-22:09:08] [I] Latency: min = 659.427 ms, max = 664.576 ms, mean = 661.149 ms, median = 660.98 ms, percentile(99%) = 664.576 ms
[11/10/2022-22:09:08] [I] End-to-End Host Latency: min = 659.438 ms, max = 664.586 ms, mean = 661.159 ms, median = 660.989 ms, percentile(99%) = 664.586 ms
[11/10/2022-22:09:08] [I] Enqueue Time: min = 5.93988 ms, max = 6.66797 ms, mean = 6.40006 ms, median = 6.43967 ms, percentile(99%) = 6.66797 ms
[11/10/2022-22:09:08] [I] H2D Latency: min = 2.50977 ms, max = 2.58398 ms, mean = 2.56918 ms, median = 2.57477 ms, percentile(99%) = 2.58398 ms
[11/10/2022-22:09:08] [I] GPU Compute Time: min = 656.029 ms, max = 661.178 ms, mean = 657.757 ms, median = 657.614 ms, percentile(99%) = 661.178 ms
[11/10/2022-22:09:08] [I] D2H Latency: min = 0.815918 ms, max = 0.824219 ms, mean = 0.822473 ms, median = 0.822998 ms, percentile(99%) = 0.824219 ms
[11/10/2022-22:09:08] [I] Total Host Walltime: 6.6116 s
[11/10/2022-22:09:08] [I] Total GPU Compute Time: 6.57757 s
[11/10/2022-22:09:08] [I] Explanations of the performance metrics are printed in the verbose logs.
...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

330_MOSAIC

330_MOSAIC

README.md

MOSAIC with TensorRT on Jetson Nano

Original model

Discription

How to

Host PC

Exec trtexec on Jetson Nano

Demo project

Reference

Files

330_MOSAIC

Directory actions

More options

Directory actions

More options

Latest commit

History

330_MOSAIC

Folders and files

parent directory

README.md

MOSAIC with TensorRT on Jetson Nano

Original model

Discription

How to

Host PC

Exec trtexec on Jetson Nano

Demo project

Reference