Skip to content

Latest commit

 

History

History
 
 

yolov5

YOLOv5

Abstract

YOLOv5 is a family of object detection architectures and models pretrained on the COCO dataset, and represents Ultralytics open-source research into future vision AI methods, incorporating lessons learned and best practices evolved over thousands of hours of research and development.

Results and models

COCO

Backbone Arch size SyncBN AMP Mem (GB) box AP Config Download
YOLOv5-n P5 640 Yes Yes 1.5 28.0 config model | log
YOLOv5-s P5 640 Yes Yes 2.7 37.7 config model | log
YOLOv5-m P5 640 Yes Yes 5.0 45.3 config model | log
YOLOv5-l P5 640 Yes Yes 8.1 48.8 config model | log
YOLOv5-n P6 1280 Yes Yes 5.8 35.9 config model | log
YOLOv5-s P6 1280 Yes Yes 10.5 44.4 config model | log
YOLOv5-m P6 1280 Yes Yes 19.1 51.3 config model | log
YOLOv5-l P6 1280 Yes Yes 30.5 53.7 config model | log

Note: In the official YOLOv5 code, the random_perspective data augmentation in COCO object detection task training uses mask annotation information, which leads to higher performance. Object detection should not use mask annotation, so only box annotation information is used in MMYOLO. We will use the mask annotation information in the instance segmentation task. See ultralytics/yolov5#9917 for details.

  1. fast means that YOLOv5DetDataPreprocessor and yolov5_collate are used for data preprocessing, which is faster for training, but less flexible for multitasking. Recommended to use fast version config if you only care about object detection.
  2. detect means that the network input is fixed to 640x640 and the post-processing thresholds is modified.
  3. SyncBN means use SyncBN, AMP indicates training with mixed precision.
  4. We use 8x A100 for training, and the single-GPU batch size is 16. This is different from the official code.
  5. The performance is unstable and may fluctuate by about 0.4 mAP and the highest performance weight in COCO training in YOLOv5 may not be the last epoch.
  6. balloon means that this is a demo configuration.

VOC

Backbone size Batchsize AMP Mem (GB) box AP(COCO metric) Config Download
YOLOv5-n 512 64 Yes 3.5 51.2 config model | log
YOLOv5-s 512 64 Yes 6.5 62.7 config model | log
YOLOv5-m 512 64 Yes 12.0 70.1 config model | log
YOLOv5-l 512 32 Yes 10.0 73.1 config model | log

Note:

  1. Training on VOC dataset need pretrained model which trained on COCO.
  2. The performance is unstable and may fluctuate by about 0.4 mAP.
  3. Official YOLOv5 use COCO metric, while training VOC dataset.
  4. We converted the VOC test dataset to COCO format offline, while reproducing mAP result as shown above. We will support to use COCO metric while training VOC dataset in later version.
  5. Hyperparameter reference from https://wandb.ai/glenn-jocher/YOLOv5_VOC_official.

Citation

@software{glenn_jocher_2022_7002879,
  author       = {Glenn Jocher and
                  Ayush Chaurasia and
                  Alex Stoken and
                  Jirka Borovec and
                  NanoCode012 and
                  Yonghye Kwon and
                  TaoXie and
                  Kalen Michael and
                  Jiacong Fang and
                  imyhxy and
                  Lorna and
                  Colin Wong and
                  曾逸夫(Zeng Yifu) and
                  Abhiram V and
                  Diego Montes and
                  Zhiqiang Wang and
                  Cristi Fati and
                  Jebastin Nadar and
                  Laughing and
                  UnglvKitDe and
                  tkianai and
                  yxNONG and
                  Piotr Skalski and
                  Adam Hogan and
                  Max Strobel and
                  Mrinal Jain and
                  Lorenzo Mammana and
                  xylieong},
  title        = {{ultralytics/yolov5: v6.2 - YOLOv5 Classification
                   Models, Apple M1, Reproducibility, ClearML and
                   Deci.ai integrations}},
  month        = aug,
  year         = 2022,
  publisher    = {Zenodo},
  version      = {v6.2},
  doi          = {10.5281/zenodo.7002879},
  url          = {https://doi.org/10.5281/zenodo.7002879}
}