<a href="https://colab.research.google.com/github/Prachiti68/sample/blob/main/Copy_of_YOLOv5_Classification_Tutorial.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# YOLOv5 Classification Tutorial

YOLOv5 supports classification tasks too. This is the official YOLOv5 classification notebook tutorial. YOLOv5 is maintained by [Ultralytics](https://github.com/ultralytics/yolov5).

This notebook covers:

*   Inference with out-of-the-box YOLOv5 classification on ImageNet
*  [Training YOLOv5 classification](https://blog.roboflow.com//train-YOLOv5-classification-custom-data) on custom data

*Looking for custom data? Explore over 66M community datasets on [Roboflow Universe](https://universe.roboflow.com).*

This notebook was created with Google Colab. [Click here](https://colab.research.google.com/drive/1FiSNz9f_nT8aFtDEU3iDAQKlPT8SCVni?usp=sharing) to run it.

# Setup

Pull in respective libraries to prepare the notebook environment.

In [1]:
!git clone https://github.com/ultralytics/yolov5  # clone
%cd yolov5


import torch
import utils
display = utils.notebook_init()  # checks

YOLOv5 🚀 v7.0-153-gff6a9ac Python-3.9.16 torch-2.0.0+cu118 CPU


Setup complete ✅ (2 CPUs, 12.7 GB RAM, 23.3/107.7 GB disk)


In [None]:
from google.colab import drive
drive.mount('/content/drive')

In [2]:
%pip install -qr requirements.txt  # install

[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m184.3/184.3 kB[0m [31m6.8 MB/s[0m eta [36m0:00:00[0m
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m62.7/62.7 kB[0m [31m8.4 MB/s[0m eta [36m0:00:00[0m
[?25h

# 1. Infer on ImageNet

To demonstrate YOLOv5 classification, we'll leverage an already trained model. In this case, we'll download the ImageNet trained models pretrained on ImageNet using YOLOv5 Utils.

In [None]:
from utils.downloads import attempt_download

p5 = ['n', 's', 'm', 'l', 'x']  # P5 models
cls = [f'{x}-cls' for x in p5]  # classification models

for x in cls:
    attempt_download(f'weights/yolov5{x}.pt')

Downloading https://github.com/ultralytics/yolov5/releases/download/v7.0/yolov5n-cls.pt to weights/yolov5n-cls.pt...
100%|██████████| 4.87M/4.87M [00:00<00:00, 25.1MB/s]

Downloading https://github.com/ultralytics/yolov5/releases/download/v7.0/yolov5s-cls.pt to weights/yolov5s-cls.pt...
100%|██████████| 10.5M/10.5M [00:00<00:00, 17.9MB/s]

Downloading https://github.com/ultralytics/yolov5/releases/download/v7.0/yolov5m-cls.pt to weights/yolov5m-cls.pt...
100%|██████████| 24.9M/24.9M [00:01<00:00, 18.0MB/s]

Downloading https://github.com/ultralytics/yolov5/releases/download/v7.0/yolov5l-cls.pt to weights/yolov5l-cls.pt...
100%|██████████| 50.9M/50.9M [00:02<00:00, 19.0MB/s]

Downloading https://github.com/ultralytics/yolov5/releases/download/v7.0/yolov5x-cls.pt to weights/yolov5x-cls.pt...
100%|██████████| 92.0M/92.0M [00:05<00:00, 17.1MB/s]



Now, we can infer on an example image from the ImageNet dataset.

In [3]:
#Download example image
import requests
image_url = "https://i.imgur.com/OczPfaz.jpg"
img_data = requests.get(image_url).content
with open('bananas.jpg', 'wb') as handler:
    handler.write(img_data)

In [6]:
#Infer using classify/predict.py
!python classify/predict.py --weights ./weigths/yolov5s-cls.pt --source bananas.jpg

[34m[1mclassify/predict: [0mweights=['./weigths/yolov5s-cls.pt'], source=bananas.jpg, data=data/coco128.yaml, imgsz=[224, 224], device=, view_img=False, save_txt=False, nosave=False, augment=False, visualize=False, update=False, project=runs/predict-cls, name=exp, exist_ok=False, half=False, dnn=False, vid_stride=1
[31m[1mrequirements:[0m /content/requirements.txt not found, check failed.
YOLOv5 🚀 v7.0-153-gff6a9ac Python-3.9.16 torch-2.0.0+cu118 CPU

Fusing layers... 
Model summary: 117 layers, 5447688 parameters, 0 gradients, 11.4 GFLOPs
image 1/1 /content/yolov5/bananas.jpg: 224x224 banana 0.96, zucchini 0.00, acorn squash 0.00, spaghetti squash 0.00, green mamba 0.00, 25.0ms
Speed: 0.0ms pre-process, 25.0ms inference, 0.1ms NMS per image at shape (1, 3, 224, 224)
Results saved to [1mruns/predict-cls/exp2[0m


From the output, we can see the ImageNet trained model correctly predicts the class `banana` with `0.95` confidence.

## 2. (Optional) Validate

Use the `classify/val.py` script to run validation for the model. This will show us the model's performance on each class.

First, we need to download ImageNet.

In [11]:
# # WARNING: takes ~20 minutes
!bash data/scripts/get_imagenet.sh --val

[1;30;43mStreaming output truncated to the last 5000 lines.[0m
mv: cannot stat 'ILSVRC2012_val_00044810.JPEG': No such file or directory
mv: cannot stat 'ILSVRC2012_val_00044811.JPEG': No such file or directory
mv: cannot stat 'ILSVRC2012_val_00044812.JPEG': No such file or directory
mv: cannot stat 'ILSVRC2012_val_00044813.JPEG': No such file or directory
mv: cannot stat 'ILSVRC2012_val_00044814.JPEG': No such file or directory
mv: cannot stat 'ILSVRC2012_val_00044815.JPEG': No such file or directory
mv: cannot stat 'ILSVRC2012_val_00044816.JPEG': No such file or directory
mv: cannot stat 'ILSVRC2012_val_00044817.JPEG': No such file or directory
mv: cannot stat 'ILSVRC2012_val_00044818.JPEG': No such file or directory
mv: cannot stat 'ILSVRC2012_val_00044819.JPEG': No such file or directory
mv: cannot stat 'ILSVRC2012_val_00044820.JPEG': No such file or directory
mv: cannot stat 'ILSVRC2012_val_00044821.JPEG': No such file or directory
mv: cannot stat 'ILSVRC2012_val_00044822.JPEG':

In [None]:
# # run the validation script
!python classify/val.py --weights ./weigths/yolov5s-cls.pt --data ../datasets/imagenet

[34m[1mclassify/val: [0mdata=../datasets/imagenet, weights=['./weigths/yolov5s-cls.pt'], batch_size=128, imgsz=224, device=, workers=8, verbose=True, project=runs/val-cls, name=exp, exist_ok=False, half=False, dnn=False
YOLOv5 🚀 v7.0-153-gff6a9ac Python-3.9.16 torch-2.0.0+cu118 CUDA:0 (Tesla T4, 15102MiB)

Fusing layers... 
Model summary: 117 layers, 5447688 parameters, 0 gradients, 11.4 GFLOPs
validating: 100% 391/391 [04:48<00:00,  1.35it/s]
                   Class      Images    top1_acc    top5_acc
                     all       50000       0.715       0.902
                   tench          50        0.94        0.98
                goldfish          50        0.88        0.92
       great white shark          50        0.78        0.96
             tiger shark          50        0.68        0.96
        hammerhead shark          50        0.82        0.92
            electric ray          50        0.76         0.9
                stingray          50         0.7         0.9


The output shows accuracy metrics for the ImageNet validation dataset including per class accuracy.

# 3. Train On Custom Data

To train on custom data, we need to prepare a dataset with custom labels.

To prepare custom data, we'll use [Roboflow](https://roboflow.com). Roboflow enables easy dataset prep with your team, including labeling, formatting into the right export format, deploying, and active learning with a `pip` package. 

If you need custom data, there are over 66M open source images from the community on [Roboflow Universe](https://universe.roboflow.com).

(For more guidance, here's a detailed blog on [training YOLOv5 classification on custom data](https://blog.roboflow.com/train-YOLOv5-classification-custom-data).)


Create a free Roboflow account, upload your data, and label. 

![](https://s4.gifyu.com/images/fruit-labeling.gif)

### Load Custom Dataset

Next, we'll export our dataset into the right directory structure for training YOLOv5 classification to load into this notebook. Select the `Export` button at the top of the version page, `Folder Structure` type, and `show download code`.

The ensures all our directories are in the right format:

```
dataset
├── train
│   ├── class-one
│   │   ├── IMG_123.jpg
│   └── class-two
│       ├── IMG_456.jpg
├── valid
│   ├── class-one
│   │   ├── IMG_789.jpg
│   └── class-two
│       ├── IMG_101.jpg
├── test
│   ├── class-one
│   │   ├── IMG_121.jpg
│   └── class-two
│       ├── IMG_341.jpg
```

![](https://i.imgur.com/BF9BNR8.gif)


Copy and paste that snippet into the cell below.

In [7]:
# Ensure we're in the right directory to download our custom dataset
import os
os.makedirs("../datasets/", exist_ok=True)
%cd ../datasets/

/content/datasets


In [8]:
# REPLACE the below with your exported code snippet from above
!pip install roboflow

from roboflow import Roboflow
rf = Roboflow(api_key="WlYdtdMNnYyCL614pWWr")
project = rf.workspace("miniproject-6").project("plant-leaf-classification")
dataset = project.version(5).download("folder")

Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Collecting roboflow
  Downloading roboflow-1.0.5-py3-none-any.whl (56 kB)
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m56.2/56.2 kB[0m [31m3.6 MB/s[0m eta [36m0:00:00[0m
Collecting requests-toolbelt
  Downloading requests_toolbelt-0.10.1-py2.py3-none-any.whl (54 kB)
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m54.5/54.5 kB[0m [31m7.2 MB/s[0m eta [36m0:00:00[0m
Collecting idna==2.10
  Downloading idna-2.10-py2.py3-none-any.whl (58 kB)
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m58.8/58.8 kB[0m [31m7.7 MB/s[0m eta [36m0:00:00[0m
Collecting pyparsing==2.4.7
  Downloading pyparsing-2.4.7-py2.py3-none-any.whl (67 kB)
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m67.8/67.8 kB[0m [31m8.8 MB/s[0m eta [36m0:00:00[0m
[?25hCollecting python-dotenv
  Downloading python_dotenv-1.0.0-py3-none-any.whl (19 kB

loading Roboflow workspace...
loading Roboflow project...
Downloading Dataset Version Zip in plant-leaf-classification-5 to folder: 99% [1287168000 / 1289782922] bytes

Extracting Dataset Version Zip to plant-leaf-classification-5 in folder:: 100%|██████████| 1514/1514 [00:05<00:00, 286.52it/s]


In [9]:
#Save the dataset name to the environment so we can use it in a system call later
dataset_name = dataset.location.split(os.sep)[-1]
os.environ["DATASET_NAME"] = dataset_name

### Train On Custom Data 🎉
Here, we use the DATASET_NAME environment variable to pass our dataset to the `--data` parameter.

Note: we're training for 100 epochs here. We're also starting training from the pretrained weights. Larger datasets will likely benefit from longer training. 

In [10]:
%cd ../yolov5
!python classify/train.py --model yolov5s-cls.pt --data $DATASET_NAME --epochs 30 --img 128 --pretrained weights/yolov5s-cls.pt

/content/yolov5
[34m[1mclassify/train: [0mmodel=yolov5s-cls.pt, data=plant-leaf-classification-5, epochs=30, batch_size=64, imgsz=128, nosave=False, cache=None, device=, workers=8, project=runs/train-cls, name=exp, exist_ok=False, pretrained=weights/yolov5s-cls.pt, optimizer=Adam, lr0=0.001, decay=5e-05, label_smoothing=0.1, cutoff=None, dropout=None, verbose=False, seed=0, local_rank=-1
remote: Enumerating objects: 1, done.[K
remote: Counting objects: 100% (1/1), done.[K
remote: Total 1 (delta 0), reused 0 (delta 0), pack-reused 0[K
Unpacking objects: 100% (1/1), 798 bytes | 798.00 KiB/s, done.
From https://github.com/ultralytics/yolov5
   ff6a9ac..f3ee596  master     -> origin/master
[34m[1mgithub: [0m⚠️ YOLOv5 is out of date by 1 commit. Use 'git pull' or 'git clone https://github.com/ultralytics/yolov5' to update.
YOLOv5 🚀 v7.0-153-gff6a9ac Python-3.9.16 torch-2.0.0+cu118 CPU

[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs/train-cls', view at http://localh

### Validate Your Custom Model

Repeat step 2 from above to test and validate your custom model.

In [12]:
!python classify/val.py --weights runs/train-cls/exp/weights/best.pt --data ../datasets/$DATASET_NAME

[34m[1mclassify/val: [0mdata=../datasets/plant-leaf-classification-5, weights=['runs/train-cls/exp/weights/best.pt'], batch_size=128, imgsz=224, device=, workers=8, verbose=True, project=runs/val-cls, name=exp, exist_ok=False, half=False, dnn=False
YOLOv5 🚀 v7.0-153-gff6a9ac Python-3.9.16 torch-2.0.0+cu118 CPU

Fusing layers... 
Model summary: 117 layers, 4170531 parameters, 0 gradients, 10.4 GFLOPs
testing: 100% 3/3 [00:51<00:00, 17.32s/it]
                   Class      Images    top1_acc    top5_acc
                     all         300       0.997           1
              lemongrass         100        0.99           1
                    neem         100           1           1
                   tulsi         100           1           1
Speed: 0.0ms pre-process, 60.4ms inference, 0.0ms post-process per image at shape (1, 3, 224, 224)
Results saved to [1mruns/val-cls/exp[0m


### Infer With Your Custom Model

In [13]:
#Get the path of an image from the test or validation set
if os.path.exists(os.path.join(dataset.location, "test")):
  split_path = os.path.join(dataset.location, "test")
else:
  os.path.join(dataset.location, "valid")
example_class = os.listdir(split_path)[0]
example_image_name = os.listdir(os.path.join(split_path, example_class))[0]
example_image_path = os.path.join(split_path, example_class, example_image_name)
os.environ["TEST_IMAGE_PATH"] = example_image_path

print(f"Inferring on an example of the class '{example_class}'")

#Infer
!python classify/predict.py --weights runs/train-cls/exp/weights/best.pt --source $TEST_IMAGE_PATH

Inferring on an example of the class 'tulsi'
[34m[1mclassify/predict: [0mweights=['runs/train-cls/exp/weights/best.pt'], source=/content/datasets/plant-leaf-classification-5/test/tulsi/-96-_jpg.rf.ff5b81a98c8e631749a71746e31828ff.jpg, data=data/coco128.yaml, imgsz=[224, 224], device=, view_img=False, save_txt=False, nosave=False, augment=False, visualize=False, update=False, project=runs/predict-cls, name=exp, exist_ok=False, half=False, dnn=False, vid_stride=1
YOLOv5 🚀 v7.0-153-gff6a9ac Python-3.9.16 torch-2.0.0+cu118 CPU

Fusing layers... 
Model summary: 117 layers, 4170531 parameters, 0 gradients, 10.4 GFLOPs
image 1/1 /content/datasets/plant-leaf-classification-5/test/tulsi/-96-_jpg.rf.ff5b81a98c8e631749a71746e31828ff.jpg: 224x224 tulsi 0.77, neem 0.12, lemongrass 0.11, 25.3ms
Speed: 0.1ms pre-process, 25.3ms inference, 0.3ms NMS per image at shape (1, 3, 224, 224)
Results saved to [1mruns/predict-cls/exp3[0m


We can see the inference results show ~3ms inference and the respective classes predicted probabilities.

## (OPTIONAL) Improve Our Model with Active Learning

Now that we've trained our model once, we will want to continue to improve its performance. Improvement is largely dependent on improving our dataset.

We can programmatically upload example failure images back to our custom dataset based on conditions (like seeing an underrpresented class or a low confidence score) using the same `pip` package.

In [None]:
# # Upload example image
# project.upload(image_path)


In [None]:
# # Example upload code 
# min_conf = float("inf")
# for pred in results:
#     if pred["score"] < min_conf:
#         min_conf = pred["score"]
# if min_conf < 0.4:
#     project.upload(image_path)

# (BONUS) YOLOv5 classify/predict.py Accepts Several Input Methods
- Webcam: `python classify/predict.py --weights yolov5s-cls.pt --source 0`
- Image `python classify/predict.py --weights yolov5s-cls.pt --source img.jpg`
- Video: `python classify/predict.py --weights yolov5s-cls.pt --source vid.mp4`
- Directory: `python classify/predict.py --weights yolov5s-cls.pt --source path/`
- Glob: `python classify/predict.py --weights yolov5s-cls.pt --source 'path/*.jpg'`
- YouTube: `python classify/predict.py --weights yolov5s-cls.pt --source 'https://youtu.be/Zgi9g1ksQHc'`
- RTSP, RTMP, HTTP stream: `python classify/predict.py --weights yolov5s-cls.pt --source 'rtsp://example.com/media.mp4'`

In [14]:
!python classify/predict.py --weights yolov5s-cls.pt --source 0

[34m[1mclassify/predict: [0mweights=['yolov5s-cls.pt'], source=0, data=data/coco128.yaml, imgsz=[224, 224], device=, view_img=False, save_txt=False, nosave=False, augment=False, visualize=False, update=False, project=runs/predict-cls, name=exp, exist_ok=False, half=False, dnn=False, vid_stride=1
YOLOv5 🚀 v7.0-153-gff6a9ac Python-3.9.16 torch-2.0.0+cu118 CPU

Fusing layers... 
Model summary: 117 layers, 5447688 parameters, 0 gradients, 11.4 GFLOPs

[ WARN:0@2.523] global cap_v4l.cpp:982 open VIDEOIO(V4L2:/dev/video0): can't open camera by index
[ERROR:0@2.524] global obsensor_uvc_stream_channel.cpp:156 getStreamChannelGroup Camera index out of range
Traceback (most recent call last):
  File "/content/yolov5/classify/predict.py", line 226, in <module>
    main(opt)
  File "/content/yolov5/classify/predict.py", line 221, in main
    run(**vars(opt))
  File "/usr/local/lib/python3.9/dist-packages/torch/utils/_contextlib.py", line 115, in decorate_context
    return func(*args, **kwargs)

###Directory Example

In [None]:
#Directory infer
os.environ["TEST_CLASS_PATH"] = test_class_path = os.path.join(*os.environ["TEST_IMAGE_PATH"].split(os.sep)[:-1])
print(f"Infering on all images from the directory {os.environ['TEST_CLASS_PATH']}")
!python classify/predict.py --weights runs/train-cls/exp/weights/best.pt --source /$TEST_CLASS_PATH/

###YouTube Example

In [None]:
#YouTube infer
!python classify/predict.py --weights runs/train-cls/exp/weights/best.pt --source 'https://www.youtube.com/watch?v=7AlYA4ItA74'