Classification with Ultralytics

This tutorial covers image classification using the Ultralytics YOLOv8 framework, which provides accurate and efficient classification capabilities.

Introduction to Image Classification

Image classification is a fundamental computer vision task that involves assigning a label or category to an entire image. Unlike object detection, which identifies and localizes multiple objects within an image, classification provides a single prediction for the entire image, answering the question "What is in this image?"

Key aspects of image classification:

Assigns a single label to the entire image
Typically outputs a probability distribution across all possible classes
Forms the foundation for many computer vision applications
Can be used for binary classification (two classes) or multi-class classification (multiple classes)

YOLOv8 for Classification

While YOLO (You Only Look Once) is primarily known for object detection, Ultralytics YOLOv8 also offers specialized classification models that are designed for image classification tasks. These models are based on efficient architectures and are trained on large datasets like ImageNet.

YOLOv8 classification models:

YOLOv8n-cls: Nano classification model
YOLOv8s-cls: Small classification model
YOLOv8m-cls: Medium classification model
YOLOv8l-cls: Large classification model
YOLOv8x-cls: Extra large classification model

Implementation

Basic Usage

from ultralytics import YOLO
import cv2
import numpy as np

# Load a pretrained YOLOv8 classification model
model = YOLO('yolov8n-cls.pt')  # 'n' for nano, other options: 's', 'm', 'l', 'x'

# Run inference on an image
results = model('path/to/image.jpg')

# Process results
for result in results:
    # Get the original image
    original_img = result.orig_img
    
    # Get the probs tensor
    probs = result.probs
    
    if probs is not None:
        # Get the top 5 class indices and their probabilities
        top5_indices = probs.top5
        top5_probs = probs.top5conf
        
        # Get class names
        class_names = model.names
        
        # Create a copy of the original image for display
        display_img = original_img.copy()
        
        # Add classification results to the image
        y_offset = 30
        for i in range(len(top5_indices)):
            class_idx = top5_indices[i]
            prob = top5_probs[i].item()
            class_name = class_names[class_idx]
            
            text = f"{class_name}: {prob:.2f}"
            cv2.putText(display_img, text, (10, y_offset), 
                        cv2.FONT_HERSHEY_SIMPLEX, 0.7, (0, 255, 0), 2)
            y_offset += 30
        
        # Display the result
        cv2.imshow("Classification Result", display_img)
        cv2.waitKey(0)
        cv2.destroyAllWindows()

Visualizing Results

import cv2
from ultralytics import YOLO
import numpy as np

# Load the model
model = YOLO('yolov8n-cls.pt')

# Load image
image = cv2.imread('path/to/image.jpg')

# Run inference
results = model(image)

# Create a copy of the image for display
display_img = image.copy()

# Get the probs from the first result
probs = results[0].probs

# Get the top 5 class indices and their probabilities
top5_indices = probs.top5
top5_probs = probs.top5conf

# Get class names
class_names = model.names

# Add classification results to the image
y_offset = 30
for i in range(len(top5_indices)):
    class_idx = top5_indices[i]
    prob = top5_probs[i].item()
    class_name = class_names[class_idx]
    
    text = f"{class_name}: {prob:.2f}"
    cv2.putText(display_img, text, (10, y_offset), 
                cv2.FONT_HERSHEY_SIMPLEX, 0.7, (0, 255, 0), 2)
    y_offset += 30

# Display the image
cv2.imshow("YOLOv8 Classification", display_img)
cv2.waitKey(0)
cv2.destroyAllWindows()

Real-time Classification

import cv2
from ultralytics import YOLO

# Load the model
model = YOLO('yolov8n-cls.pt')

# Open the video capture
cap = cv2.VideoCapture(0)  # Use 0 for webcam

while cap.isOpened():
    # Read a frame from the video
    success, frame = cap.read()
    
    if success:
        # Run YOLOv8 inference on the frame
        results = model(frame)
        
        # Get the probs from the first result
        probs = results[0].probs
        
        # Get the top 3 class indices and their probabilities
        top3_indices = probs.top5[:3]  # Get only top 3
        top3_probs = probs.top5conf[:3]  # Get only top 3
        
        # Get class names
        class_names = model.names
        
        # Create a copy of the frame for display
        display_frame = frame.copy()
        
        # Add classification results to the frame
        y_offset = 30
        for i in range(len(top3_indices)):
            class_idx = top3_indices[i]
            prob = top3_probs[i].item()
            class_name = class_names[class_idx]
            
            text = f"{class_name}: {prob:.2f}"
            cv2.putText(display_frame, text, (10, y_offset), 
                        cv2.FONT_HERSHEY_SIMPLEX, 0.7, (0, 255, 0), 2)
            y_offset += 30
        
        # Display the annotated frame
        cv2.imshow("YOLOv8 Classification", display_frame)
        
        # Break the loop if 'q' is pressed
        if cv2.waitKey(1) & 0xFF == ord("q"):
            break
    else:
        # Break the loop if the end of the video is reached
        break

# Release the video capture object and close the display window
cap.release()
cv2.destroyAllWindows()

Batch Processing

from ultralytics import YOLO
import os
import cv2
import numpy as np
from pathlib import Path

# Load a pretrained YOLOv8 classification model
model = YOLO('yolov8n-cls.pt')

# Define input and output directories
input_dir = 'path/to/input/images'
output_dir = 'path/to/output/images'

# Create output directory if it doesn't exist
os.makedirs(output_dir, exist_ok=True)

# Get all image files
image_extensions = ['.jpg', '.jpeg', '.png', '.bmp']
image_files = [f for f in os.listdir(input_dir) if any(f.lower().endswith(ext) for ext in image_extensions)]

# Process images in batches
batch_size = 4
for i in range(0, len(image_files), batch_size):
    batch_files = image_files[i:i+batch_size]
    batch_paths = [os.path.join(input_dir, f) for f in batch_files]
    
    # Run inference on batch
    results = model(batch_paths)
    
    # Process each result
    for j, result in enumerate(results):
        # Get the original image
        original_img = result.orig_img
        
        # Get the probs tensor
        probs = result.probs
        
        if probs is not None:
            # Get the top 3 class indices and their probabilities
            top3_indices = probs.top5[:3]
            top3_probs = probs.top5conf[:3]
            
            # Get class names
            class_names = model.names
            
            # Create a copy of the original image for display
            display_img = original_img.copy()
            
            # Add classification results to the image
            y_offset = 30
            for k in range(len(top3_indices)):
                class_idx = top3_indices[k]
                prob = top3_probs[k].item()
                class_name = class_names[class_idx]
                
                text = f"{class_name}: {prob:.2f}"
                cv2.putText(display_img, text, (10, y_offset), 
                            cv2.FONT_HERSHEY_SIMPLEX, 0.7, (0, 255, 0), 2)
                y_offset += 30
            
            # Save the result
            output_path = os.path.join(output_dir, f"classified_{batch_files[j]}")
            cv2.imwrite(output_path, display_img)
            
            print(f"Processed {batch_files[j]}: Top class is {class_names[top3_indices[0]]} with probability {top3_probs[0]:.2f}")

Custom Training

Training a custom YOLOv8 classification model requires a dataset with labeled images organized in a specific folder structure.

from ultralytics import YOLO

# Load a model
model = YOLO('yolov8n-cls.pt')  # load a pretrained classification model

# Train the model on a custom dataset
results = model.train(
    data='path/to/dataset',
    epochs=100,
    imgsz=224,
    batch=16,
    name='yolov8n_cls_custom'
)

Dataset Preparation

For classification, your dataset should be organized in the following structure:

dataset/
├── train/
│   ├── class1/
│   │   ├── image1.jpg
│   │   ├── image2.jpg
│   │   └── ...
│   ├── class2/
│   │   ├── image1.jpg
│   │   ├── image2.jpg
│   │   └── ...
│   └── ...
├── val/
│   ├── class1/
│   │   ├── image1.jpg
│   │   ├── image2.jpg
│   │   └── ...
│   ├── class2/
│   │   ├── image1.jpg
│   │   ├── image2.jpg
│   │   └── ...
│   └── ...
└── test/ (optional)
    ├── class1/
    │   ├── image1.jpg
    │   ├── image2.jpg
    │   └── ...
    ├── class2/
    │   ├── image1.jpg
    │   ├── image2.jpg
    │   └── ...
    └── ...

Each class should have its own folder, and images belonging to that class should be placed in the corresponding folder.

Best Practices

Model Selection
- Choose the appropriate classification model size based on your requirements
- Larger models provide better accuracy but are slower
Data Preparation
- Ensure balanced class distribution
- Include diverse examples for each class
- Use data augmentation to improve robustness
- Remove duplicate or very similar images
Training Tips
- Start with a pre-trained classification model
- Use appropriate batch size based on available GPU memory
- Monitor accuracy and loss during training
- Use early stopping to save the best model
- Consider learning rate scheduling
Inference Optimization
- Adjust confidence thresholds for optimal results
- Use batch processing for multiple images
- Consider model quantization for deployment
- Use hardware acceleration for real-time applications

Applications

Content Categorization
- Image sorting and organization
- Content filtering
- Media asset management
- Automatic tagging
Medical Imaging
- Disease classification
- Medical image categorization
- Diagnostic assistance
- Pathology image analysis
Industrial Inspection
- Product quality control
- Defect classification
- Material identification
- Manufacturing process monitoring
Agriculture
- Crop disease identification
- Plant species classification
- Fruit ripeness assessment
- Weed detection
Retail
- Product recognition
- Visual search
- Inventory management
- Customer behavior analysis

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
classification.py		classification.py
classification_simple.py		classification_simple.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Classification with Ultralytics

Table of Contents

Introduction to Image Classification

YOLOv8 for Classification

Implementation

Basic Usage

Visualizing Results

Real-time Classification

Batch Processing

Custom Training

Dataset Preparation

Best Practices

Applications

Further Reading

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

alideploy/opencv-python-tutorials

Folders and files

Latest commit

History

Repository files navigation

Classification with Ultralytics

Table of Contents

Introduction to Image Classification

YOLOv8 for Classification

Implementation

Basic Usage

Visualizing Results

Real-time Classification

Batch Processing

Custom Training

Dataset Preparation

Best Practices

Applications

Further Reading

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages