Files

.github
001_deeplabv3
002_mobilenetv3-ssd
003_posenet
004_efficientnet
005_one_class_anomaly_detection
006_mobilenetv2-ssdlite
007_mobilenetv2-poseestimation
008_mask_rcnn_inceptionv2
009_multi-scale_local_planar_guidance_for_monocular_depth_estimation
010_mobilenetv3
011_mobilenetv2
012_Fast_Accurate_and_Lightweight_Super-Resolution
013_ml-sound-classifier
014_tf-monodepth2
015_Faster-Grad-CAM
016_EfficientNet-lite
017_Artistic-Style-Transfer
018_EfficientDet
019_White-box-Cartoonization
020_edgetpu-deeplab
021_edgetpu-deeplab-slim
022_Learning_to_See_Moving_Objects_in_the_Dark
023_yolov3-nano
024_yolov3-lite
025_head_pose_estimation
026_mobile-deeplabv3-plus
027_minimal-hand
028_struct2depth
029_human-pose-estimation-3d-0001
030_BlazeFace
031_yolov4
032_FaceMesh
033_Hand_Detection_and_Tracking
034_ssd_mobilenet_v2_mnasfpn_shared_box_predictor
035_BodyPix
036_Objectron
037_First_Neural_Style_Transfer
038_ssdlite_mobiledet_edgetpu
039_ssdlite_mobiledet_cpu
040_DSFD_vgg
041_DBFace
042_centernet
043_face_landmark
044_selfie2anime
045_ssd_mobilenet_v2_oid_v4
046_yolov4-tiny
047_SpineNetMB_49
048_mobile_bert
049_iris_landmark
050_AnimeGANv2
051_East_Text_Detection
052_Handwritten_Text_Recognition
053_BlazePose
054_KNIFT
055_Handwritten_Japanese_Recognition
056_TextBoxes++
057_BiSeNetV2
058_keras-retinanet
060_hair_segmentation
061_U-2-Net
062_facial_cartoonization
063_3d-bounding-box-estimation-for-autonomous-driving
064_Dense_Depth
065_ThreeDPoseUnityBarracuda
066_footprints
067_MiDaS
068_Colorful_Image_Colorization
069_ENet
070_age-gender-recognition
071_Noise2Noise
072_NanoDet
073_RetinaNet
074_Yolact
075_ERFNet
076_Deep_White_Balance
077_ESRGAN
078_MODNet
079_MIRNet
080_tf_pose_estimation
081_MiDaS_v2
082_MediaPipe_Meet_Segmentation
083_Person_Reidentification
084_EfficientPose
085_Yolact_Edge
086_defocus-deblurring-dual-pixel
087_DeepSort
088_mobilenetv3-poseestimation
089_DETR
090_Ghost-free_Shadow_Removal
091_gaze-estimation-adas-0002
092_weld-porosity-detection-0001
093_ocr_japanese
094_hand_recrop
095_centerface
096_RetinaFace
097_YAMNet
098_SPICE
099_efficientnet_anomaly_detection_segmentation
100_HiFill
101_arbitrary_image_stylization
102_Coconet
103_EfficientDet_lite
104_DeeplabV3-plus
105_MobileStyleGAN
106_WHENet
107_SFA3D
108_HAWP
109_Selfie_Segmentation
110_L-CNN
111_SRN-Deblur
112_DeblurGANv2
113_Anime2Sketch
114_Two-branch-dehazing
115_MoveNet
116_DroNet
117_DTLN
118_Speech-enhancement
119_M-LSD
120_FRILL
121_GPT2_DistillGPT2
122_DistillBert
123_YOLOR
124_person-attributes-recognition-crossroad-0230
125_person-attributes-recognition-crossroad-0234
126_person-attributes-recognition-crossroad-0238
127_dino
129_SCRFD
131_CFNet
132_YOLOX
133_Real-ESRGAN
134_head-pose-estimation-adas-0001
135_CoEx
136_road-segmentation-adas-0001
137_MoveNet_MultiPose
138_BackgroundMattingV2
139_PSD-Principled-Synthetic-to-Real-Dehazing-Guided-by-Physical-Priors
140_Ultra-Fast-Lane-Detection
141_lanenet-lane-detection
142_HITNET
143_RAPiD
144_YuNet
145_text_detection_db
146_FastDepth
147_PackNet-SfM
148_LapDepth
149_depth_estimation
150_MobileStereoNet
151_object_detection_mobile_object_localizer
152_DeepLPF
153_MegaDepth
154_driver-action-recognition-adas-0002-encoder
155_driver-action-recognition-adas-0002-decoder
156_MobileHumanPose
157_3DMPPE_POSENET
158_HR-Depth
159_EPCDepth
160_msg_chn_wacv20
161_EigenGAN-Tensorflow
162_PyDNet
163_MST_inpainting
164_MADNet
165_RealtimeStereo
166_Insta-DM
167_LSTR
168_DPT
169_spaghettinet_edgetpu
170_Learning-to-See-in-the-Dark
171_Fast-SRGAN
172_Real-Time-Super-Resolution
173_MVDepthNet
174_PP-PicoDet
175_face-recognition-resnet100-arcface-onnx
176_StableLLVE
177_BirdNET-Lite
178_vehicle-detection-0200
179_person-detection-0202
181_models_edgetpu_checkpoint_and_tflite_vision_segmentation-edgetpu_tflite_default_argmax
182_models_edgetpu_checkpoint_and_tflite_vision_segmentation-edgetpu_tflite_fused_argmax
183_pedestrian-detection-adas-0002
184_pedestrian-and-vehicle-detector-adas-0001
185_person-vehicle-bike-detection-crossroad-0078
186_person-vehicle-bike-detection-crossroad-1016
187_vehicle-attributes-recognition-barrier-0039
188_vehicle-attributes-recognition-barrier-0042
189_vehicle-license-plate-detection-barrier-0106
190_person-detection-asl-0001
191_anti-spoof-mn3
192_open-closed-eye-0001
193_CoCosNet
194_face_recognizer_fast
195_person_reid_youtu
196_human_segmentation_pphumanseg
197_yolact-resnet50-fpn
198_YOLOF
199_NSFW
200_AGLLNet
201_CityscapesSOTA
202_stereoDNN
203_SRHNet
204_HINet
205_MBLLEN
206_Matting
207_GLADNet
208_SAPNet
209_MSBDN-DFF
210_SC_Depth_pl
211_Lac-GwcNet
212_GFN
213_TBEFN
214_EnlightenGAN
215_AOD-Net
216_Zero-DCE-TF
217_RUAS
218_DSLR
219_StereoNet
220_HEP
221_YOLACT-PyTorch
222_LFT
223_DA_dahazing
224_Y-net
225_NTIRE-2021-Dehazing-Two-branch
226_CascadeTableNet
227_face-detection-adas-0001
228_Fast-SCNN
229_DexiNed
230_Single-Image-Desnowing-HDCWNet
231_DRBL
232_MIMO-UNet
233_HRNet-for-Fashion-Landmark-Estimation
234_FBCNN
235_W-Stereo-Disp
236_A-TVSNet
237_piano_transcription
238_SUIM-Net
239_CasStereoNet
240_BSRGAN
241_SCL-LLE
242_RobustVideoMatting
243_Zero-DCE-improved
244_FINNger
245_GLPDepth
246_SqueezeSegV3
247_PoseC3D
248_MS-G3D
249_Real-CUGAN
250_Face-Mask-Detection
251_AU-GAN
252_RAFT
253_TransWeather
254_FullSubNet-plus
255_FILM
256_SFace
257_PiCANet
258_TinyHITNet
259_Emotion_FERPlus
260_KP2D
261_EfficientDerain
262_ByteTrack
263_EgoNet
264_object_localization_network
265_PoseAug
266_ACVNet
267_LIOT
268_Lite-HRNet
269_Higher-HRNet
270_HWMNet
271_HRNet
272_CSFlow
273_OPN
274_DeepFillv2
275_FD-GAN
276_HybridNets
277_EDN-GTM
278_DWARF
279_F-Clip
280_GASDA
281_IMDN
282_face_landmark_with_attention
283_UIE-WD
284_CREStereo
285_Decoupled-Low-light-Image-Enhancement
286_SCI
287_Topformer
288_perceptual-reflection-removal
289_face-detection-0100
290_AdaFace
291_SeAFusion
292_Graft-PSMNet
293_Lightweight-Head-Pose-Estimation
294_FSRE-Depth
295_SparseInst
296_MGNet
297_GazeNet
297x_↑↑↑_OpenVINO_2021.4.582_↓↓↓_OpenVINO_2022.1.0
298_DEQ-Flow
299_DGNet
300_6DRepNet
301_YOLOv4_Face
302_SLPT
303_FAN
304_SynergyNet
305_DMHead
306_GMFlowNet
307_YOLOv7
308_FastestDet
309_ImageForensicsOSN
310_attentive-gan-derainnet
311_HHP-Net
312_NeWCRFs
313_IS-Net
314_PyDNet2
315_Illumination-Adaptive-Transformer
316_night_enhancement
317_MobileOne
318_pips
319_ACR-Loss
320_Dehamer
321_DID-M3D
322_YOLOv7_Head
323_Stripformer
324_Ultra-Fast-Lane-Detection-v2
325_DehazeFormer
326_YOLOPv2
327_EMDC
328_Stable_Diffusion
329_YOLOX-PAI
330_MOSAIC
332_CrowdDet
333_E2Pose
334_DAMO-YOLO
335_PIDNet
336_PP-YOLOE-Plus
337_FreeYOLO
338_Fast-ACVNet
339_DeepLSD
340_Dense-Head-Pose-Estimation
341_YOLOv6
342_ALIKE
343_PP-MattingV2
344_XYDeblur
346_facial_expression_recognition_mobilefacenet
347_RGBX_Semantic_Segmentation
348_Bread
349_PMN
350_P-STMO
351_RFDN
352_MAXIM
353_ShadowFormer
354_DEA-Net
355_MHFormer
356_EdgeYOLO
357_Unimatch
358_CGI-Stereo
359_MSPFN
360_PARSeq
361_KBNet
362_ZoeDepth
363_YOLO-6D-Pose
364_IGEV
365_HTNet
366_text_recognition_CRNN
367_FLW-Net
368_C2PNet
369_Segment_Anything
370_Semantic-Guided-Low-Light-Image-Enhancement
371_Lite-Mono
372_URetinex-Net
373_LiteTrack
374_LaneSOD
375_SCANet
376_RT-DETR
377_DRSformer
378_P2PNet_tfkeras
379_PP-LCNetV2
380_Skin-Clothes-Hair-Segmentation-using-SMP
381_Whisper
382_Light-SERNet
383_DirectMHP
384_TCMonoDepth
385_PairLIE
386_naruto_handsign_detection
387_YuNetV2
388_LightGlue
389_WGWS-Net
390_BlendshapeV2
391_MagicTouch
392_STCFormer
393_RTMPose_WholeBody
394_RTMPose_Animal
395_FFNet
396_MixDehazeNet
397_MiDaSv3.1
398_L2CS-Net
399_RetinaFace_MobileNetv2
400_CSRNet
401_CLRerNet
402_trt_pose
403_trt_pose_hand
404_HDR-Transformer
405_Ear_Segmentation
406_DeDoDe
407_Generalizing_Gaze_Estimation
408_UAED
409_nighttime_dehaze
410_FaceMeshV2
411_UDR-S2Former_deraining
412_pytorch_cpn
413_DocShadow
414_STAR
415_High-frequency-Stereo-Matching-Network
416_GeoNet
417_PopNet
418_Diffusion-Low-Light
419_MobileViT_v1_v2
420_Gold-YOLO-Hand
421_Gold-YOLO-Head
422_Gold-YOLO-Head-Hand
423_6DRepNet360
424_Gold-YOLO-Body
425_Gold-YOLO-Body-Head-Hand
- demo
- post_process_gen_tools
- LICENSE
- README.md
- download_l.sh
- download_m.sh
- download_n.sh
- download_s.sh
- url.txt
426_YOLOX-Body-Head-Hand
427_RTMPose_Hand
428_ISR
429_OSNet
430_FastReID
431_NITEC
432_face-reidentification-retail-0095
433_FaceBoxes.PyTorch
434_YOLOX-Body-Head-Hand-Face
435_MobileFaceNet
436_Peppa_Pig_Face_Landmark
437_PIPNet
438_PeCLR
439_Depth-Anything
440_ViTPose
441_YOLOX-Body-Head-Hand-Face-Dist
442_YOLOX-Body-Head-Face-HandLR-Dist
443_Opal23_HeadPose
444_YOLOX-Foot-Dist
445_YOLOX-Body-Head-Face-HandLR-Foot-Dist
446_YOLOX-Body-With-Wheelchair
447_YOLOX-Wholebody-with-Wheelchair
448_YOLOX-Eye-Nose-Mouth-Ear
449_YOLOX-WholeBody12
450_YOLOv9-Wholebody-with-Wheelchair
451_DAN
452_FairFace
453_FairDAN
454_YOLOv9-Wholebody13
455_YOLOv9-Gender
456_YOLOv9-Wholebody15
457_YOLOv9-Wholebody17
458_YOLOv9-Discrete-HeadPose-Yaw
459_YOLOv9-Wholebody25
460_RT-DETRv2-Wholebody25
461_YOLOv9-Phone
462_Gaze-LLE
463_YOLOv9-Shoulder-Elbow-Knee
464_YOLOv9-Wholebody28
465_DEIM-Wholebody28
466_People_Segmentation
467_Human_Parsing
999_media
third_party
.gitignore
.gitmodules
LICENSE
README.md
log-cleaner.sh

425_Gold-YOLO-Body-Head-Hand

Name		Name	Last commit message	Last commit date
parent directory ..
demo		demo
post_process_gen_tools		post_process_gen_tools
LICENSE		LICENSE
README.md		README.md
download_l.sh		download_l.sh
download_m.sh		download_m.sh
download_n.sh		download_n.sh
download_s.sh		download_s.sh
url.txt		url.txt

README.md

Gold-YOLO-Body-Head-Hand

Lightweight human detection model generated using a high-quality human dataset. I annotated all the data by myself. Extreme resistance to blur and occlusion. In addition, the recognition rate at short, medium, and long distances has been greatly enhanced. The camera's resistance to darkness and halation has been greatly improved.

Head does not mean Face. Thus, the entire head is detected rather than a narrow region of the face. This makes it possible to detect all 360° head orientations.

1. Dataset

COCO-Hand (14,667 Images, 66,903 labels, All re-annotated manually)
http://vision.cs.stonybrook.edu/~supreeth/COCO-Hand.zip
I am adding my own enhancement data to COCO-Hand and re-annotating all images. In other words, only COCO images were cited and no annotation data were cited.

I have no plans to publish my own dataset.

body_label_count: 30,729 labels
head_label_count: 26,268 labels
hand_label_count: 18,087 labels
===============================
           Total: 66,903 labels
           Total: 14,667 images

2. Annotation

Halfway compromises are never acceptable.

3. Test

Python 3.10
onnxruntime-gpu v1.16.1 (TensorRT Execution Provider Enabled Binary)
opencv-contrib-python 4.8.0.76
numpy 1.24.3
TensorRT 8.5.3-1+cuda11.8

With CUDA. TensorRT not used. Approximately twice as fast with TensorRT enabled. (250 FPS)

usage: demo_goldyolo_onnx.py [-h] [-m MODEL] [-v VIDEO]

options:
  -h, --help            show this help message and exit
  -m MODEL, --model MODEL
  -v VIDEO, --video VIDEO

640x480 CUDA RTX3070

python demo/demo_goldyolo_onnx.py \
-m gold_yolo_n_body_head_hand_post_0461_0.4428_1x3x480x640.onnx \
-v 0

output_body_head_hand_n.mp4

320x256 CPU Corei9

python demo/demo_goldyolo_onnx.py \
-m gold_yolo_n_body_head_hand_post_0461_0.4428_1x3x256x320.onnx \
-v 0

output_256x320.mp4

160x128 CPU Corei9

python demo/demo_goldyolo_onnx.py \
-m gold_yolo_n_body_head_hand_post_0461_0.4428_1x3x128x160.onnx \
-v 0

output_128x160.mp4

Still image

usage: demo_goldyolo_onnx_image.py [-h] [-m MODEL] [-i IMAGES_PATH] [-o OUTPUT_PATH]

options:
  -h, --help            show this help message and exit
  -m MODEL, --model MODEL
  -i IMAGES_PATH, --images_path IMAGES_PATH
  -o OUTPUT_PATH, --output_path OUTPUT_PATH

python demo/demo_goldyolo_onnx_image.py \
-m gold_yolo_n_body_head_hand_post_0461_0.4428_1x3x480x640.onnx \
-i images_folder

Body-Head-Hand - N

Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.443
Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.689
Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.467
Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.303
Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.654
Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.830
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.135
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.389
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.515
Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.381
Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.739
Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.872
Results saved to runs/train/gold_yolo-n
Epoch: 462 | mAP@0.5: 0.6892104619015829 | mAP@0.50:0.95: 0.4427396559181031

Class Labeled_images Labels P@.5iou R@.5iou F1@.5iou mAP@.5 mAP@.5:.95
all              486   8858   0.856    0.62    0.719  0.689      0.443
body             486   3747   0.857    0.60    0.706  0.662      0.440
head             475   3269   0.912    0.68    0.779  0.726      0.497
hand             483   1842   0.842    0.59    0.694  0.680      0.391

Body-Head-Hand - S

Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.460
Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.704
Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.491
Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.327
Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.665
Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.838
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.137
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.399
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.526
Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.397
Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.739
Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.874
Results saved to runs/train/gold_yolo-s
Epoch: 456 | mAP@0.5: 0.7040425163160517 | mAP@0.50:0.95: 0.46049785564440426

Class Labeled_images Labels P@.5iou R@.5iou F1@.5iou mAP@.5 mAP@.5:.95
all              486   8858   0.852    0.65    0.738  0.704      0.460
body             486   3747   0.848    0.63    0.723  0.669      0.455
head             475   3269   0.919    0.69    0.788  0.730      0.511
hand             483   1842   0.814    0.65    0.723  0.712      0.415

Body-Head-Hand - M

Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.500
Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.738
Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.540
Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.359
Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.722
Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.864
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.143
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.427
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.562
Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.430
Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.788
Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.892
Results saved to runs/train/gold_yolo-m
Epoch: 488 | mAP@0.5: 0.7378339081274632 | mAP@0.50:0.95: 0.5004409472223532

Class Labeled_images Labels P@.5iou R@.5iou F1@.5iou mAP@.5 mAP@.5:.95
all              486   8858   0.872    0.68    0.764  0.738      0.500
body             486   3747   0.895    0.64    0.746  0.701      0.499
head             475   3269   0.937    0.71    0.808  0.751      0.536
hand             483   1842   0.842    0.69    0.759  0.762      0.466

Body-Head-Hand - L

Average Precision  (AP) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.509
Average Precision  (AP) @[ IoU=0.50      | area=   all | maxDets=100 ] = 0.739
Average Precision  (AP) @[ IoU=0.75      | area=   all | maxDets=100 ] = 0.556
Average Precision  (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.367
Average Precision  (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.729
Average Precision  (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.869
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=  1 ] = 0.146
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets= 10 ] = 0.432
Average Recall     (AR) @[ IoU=0.50:0.95 | area=   all | maxDets=100 ] = 0.567
Average Recall     (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.434
Average Recall     (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.792
Average Recall     (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.903
Results saved to runs/train/gold_yolo-l
Epoch: 339 | mAP@0.5: 0.7393661924683652 | mAP@0.50:0.95: 0.5093183767567647

Class Labeled_images Labels P@.5iou R@.5iou F1@.5iou mAP@.5 mAP@.5:.95
all              486   8858   0.890    0.68    0.771  0.740      0.509
body             486   3747   0.880    0.66    0.754  0.704      0.509
head             475   3269   0.933    0.71    0.806  0.751      0.540
hand             483   1842   0.843    0.70    0.765  0.765      0.479

Post-Process

Because I add my own post-processing to the end of the model, which can be inferred by TensorRT, CUDA, and CPU, the benchmarked inference speed is the end-to-end processing speed including all pre-processing and post-processing. EfficientNMS in TensorRT is very slow and should be offloaded to the CPU.

4. Citiation

If this work has contributed in any way to your research or business, I would be happy to be cited in your literature.

@software{Gold-YOLO-Body-Head-Hand,
  author={Katsuya Hyodo},
  title={Lightweight human detection model generated using a high-quality human dataset},
  url={https://github.com/PINTO0309/PINTO_model_zoo/tree/main/425_Gold-YOLO-Body-Head-Hand},
  year={2023},
  month={11},
  doi={10.5281/zenodo.10229410},
}

5. Cited

I am very grateful for their excellent work.

COCO-Hand

https://vision.cs.stonybrook.edu/~supreeth/

@article{Hand-CNN,
  title={Contextual Attention for Hand Detection in the Wild},
  author={Supreeth Narasimhaswamy and Zhengwei Wei and Yang Wang and Justin Zhang and Minh Hoai},
  booktitle={International Conference on Computer Vision (ICCV)},
  year={2019},
  url={https://arxiv.org/pdf/1904.04882.pdf}
}

Gold-YOLO

https://github.com/huawei-noah/Efficient-Computing/tree/master/Detection/Gold-YOLO

@misc{wang2023goldyolo,
  title={Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism}, 
  author={Chengcheng Wang and Wei He and Ying Nie and Jianyuan Guo and Chuanjian Liu and Kai Han and Yunhe Wang},
  year={2023},
  eprint={2309.11331},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

6. TODO

Synthesize and retrain the dataset to further improve model performance. CD-COCO: Complex Distorted COCO database for Scene-Context-Aware computer vision

@INPROCEEDINGS{10323035,
  author={Beghdadi, Ayman and Beghdadi, Azeddine and Mallem, Malik and Beji, Lotfi and Cheikh, Faouzi Alaya},
  booktitle={2023 11th European Workshop on Visual Information Processing (EUVIP)}, 
  title={CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer Vision}, 
  year={2023},
  volume={},
  number={},
  pages={1-6},
  doi={10.1109/EUVIP58404.2023.10323035}}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

425_Gold-YOLO-Body-Head-Hand

425_Gold-YOLO-Body-Head-Hand

README.md

Gold-YOLO-Body-Head-Hand

1. Dataset

2. Annotation

3. Test

4. Citiation

5. Cited

6. TODO

Files

425_Gold-YOLO-Body-Head-Hand

Directory actions

More options

Directory actions

More options

Latest commit

History

425_Gold-YOLO-Body-Head-Hand

Folders and files

parent directory

README.md

Gold-YOLO-Body-Head-Hand

1. Dataset

2. Annotation

3. Test

4. Citiation

5. Cited

6. TODO