Object Detection Project

A collection of object detection and image classification projects using deep learning models, primarily built with YOLO (You Only Look Once) and other computer vision frameworks.

Overview

This repository contains multiple computer vision projects including object detection, image classification, and facial emotion recognition. Each project focuses on detecting or classifying specific objects or categories in images using state-of-the-art deep learning models.

Project Types:

Image Classification: Categorizing entire images (e.g., Mushroom Type Prediction, Emotion Detection)
Object Detection: Locating and identifying multiple objects in images (future projects)
Facial Analysis: Recognizing emotions and facial features (Emotion Detection)

What is Object Detection?

Object detection is a computer vision task that involves:

Localization: Finding the location of objects in an image (bounding boxes)
Classification: Identifying what type of objects are present
Confidence Scoring: Providing probability scores for each detection

Unlike image classification (which identifies what's in an entire image), object detection can identify and locate multiple objects within a single image.

Project Structure

Object_Detection/
│
├── README.md                          # This file - general project documentation
│
├── Mushroom_Type_Prediction/          # Mushroom classification project
│   ├── README.md                      # Project-specific documentation
│   ├── main.py                        # Main prediction script
│   ├── mushroom-prediction.ipynb      # Training notebook
│   ├── Models/                        # Trained model files
│   │   └── mushroom.pt
│   └── images/                        # Input images folder
│
├── Emotion_Detection/                 # Facial emotion recognition project
│   ├── predict.py                     # Prediction script
│   ├── train.py                       # Training script
│   ├── evaluate_model.py              # Model evaluation script
│   ├── prepare_data.py                # Data preparation utility
│   ├── prepare_kaggle.py              # Kaggle dataset preparation
│   ├── requirements.txt               # Project dependencies
│   ├── models/                        # Trained model files
│   │   └── emotion_model_best.pth
│   ├── Data/                          # Dataset folder
│   │   ├── train/                     # Training images
│   │   ├── valid/                     # Validation images
│   │   └── test/                      # Test images
│   └── results/                       # Evaluation results and plots
│
└── [Future Projects]/                 # Additional projects will be added here

Projects

1. Mushroom Type Prediction

Type: Image Classification
Model: YOLO11 Classification Model
Purpose: Classify mushroom images into different species

Location: Mushroom_Type_Prediction/
Model File: Models/mushroom.pt
Usage: See Mushroom_Type_Prediction/README.md

Features:

Processes multiple images from a folder
Outputs predicted mushroom species with confidence scores
Supports multiple image formats (JPG, PNG, BMP, TIFF, WebP)

2. Emotion Detection

Type: Facial Emotion Recognition / Image Classification
Model: ResNet50 (PyTorch)
Purpose: Detect and classify facial emotions from images

Location: Emotion_Detection/
Model File: models/emotion_model_best.pth
Emotion Classes: Angry, Fear, Happy, Sad, Surprise

Features:

Single image emotion prediction with confidence scores
Complete training pipeline with data augmentation
Model evaluation with confusion matrix and classification reports
Data preparation utilities for organizing datasets
Visualization of results and performance metrics

Scripts:

predict.py: Predict emotion from a single image
train.py: Train the ResNet50 model on emotion dataset
evaluate_model.py: Evaluate model performance on test set
prepare_data.py: Prepare and organize dataset structure
prepare_kaggle.py: Prepare Kaggle datasets for training

Usage:

cd Emotion_Detection
# Install dependencies
pip install -r requirements.txt

# Train model
python train.py

# Predict emotion from image
python predict.py path/to/image.jpg

# Evaluate model
python evaluate_model.py

More projects will be added here as they are developed

Getting Started

Prerequisites

Python 3.8 or higher
pip (Python package manager)
CUDA-capable GPU (optional, but recommended for faster inference)

Installation

Clone the repository:

git clone <repository-url>
cd Object_Detection

Install core dependencies:

# For YOLO-based projects (Mushroom Type Prediction)
pip install ultralytics

# For PyTorch-based projects (Emotion Detection)
pip install torch torchvision

Note: Each project may have specific requirements. Check individual project folders for requirements.txt files:

Mushroom_Type_Prediction/: Uses Ultralytics YOLO
Emotion_Detection/: Uses PyTorch, ResNet50 - see requirements.txt

Object Detection Concepts

Key Terms

Bounding Box: A rectangular box that outlines the location of an object in an image
Confidence Score: A probability value (0-1) indicating how certain the model is about a detection
Class: The category or type of object being detected (e.g., "person", "car", "mushroom")
IoU (Intersection over Union): A metric used to evaluate detection accuracy
NMS (Non-Maximum Suppression): A technique to remove duplicate detections

Detection vs Classification

Feature	Object Detection	Image Classification
Output	Bounding boxes + classes	Single class label
Multiple Objects	Yes	No
Location Info	Yes	No
Use Case	Finding objects in scenes	Categorizing entire images

Model Types

YOLO (You Only Look Once)

YOLO is a popular real-time object detection system that processes entire images in a single pass.

Variants:

YOLOv8: Fast and accurate detection
YOLOv11: Latest version with improved performance
YOLO-nano: Lightweight version for mobile/edge devices

Modes:

Detection: Detects objects with bounding boxes
Classification: Classifies entire images
Segmentation: Pixel-level object segmentation
Pose Estimation: Detects human poses

Other Models

ResNet: Deep residual networks for image classification (used in Emotion Detection)
R-CNN Family: Region-based CNN models (slower but more accurate)
SSD: Single Shot Detector (balance between speed and accuracy)
RetinaNet: Feature pyramid network for detection
CNN Architectures: Various convolutional neural networks for classification tasks

Requirements

Core Dependencies

YOLO Projects (Mushroom Type Prediction):

ultralytics>=8.0.0
torch>=1.8.0
torchvision>=0.9.0
opencv-python>=4.6.0
pillow>=7.1.2
numpy>=1.23.0

PyTorch Projects (Emotion Detection):

torch>=2.0.0
torchvision>=0.15.0
numpy>=1.25.2,<2.0.0
pillow>=9.0.0
matplotlib>=3.5.0
scikit-learn>=1.0.0
pandas>=1.3.0
seaborn>=0.11.0
scipy>=1.10.0
tqdm>=4.65.0

Optional Dependencies

matplotlib>=3.3.0    # For visualization
pandas>=1.1.4        # For data handling
tqdm>=4.64.0         # For progress bars

Usage

Running a Project

Each project has its own main script. Navigate to the project directory and run:

cd [Project_Name]
python main.py

General Workflow

Prepare Data: Place images in the designated images/ folder
Load Model: The script automatically loads the trained model
Run Inference: Process images and get predictions
View Results: Check console output or saved result files

Examples

Mushroom Classification:

cd Mushroom_Type_Prediction
# Add images to the images/ folder
python main.py

Emotion Detection:

cd Emotion_Detection
# Predict emotion from an image
python predict.py path/to/image.jpg

# Or train a new model
python train.py

Adding New Projects

To add a new object detection project:

Create a new folder: [Project_Name]/

Add project structure:

[Project_Name]/
├── README.md          # Project documentation
├── main.py            # Main script
├── Models/            # Model files
└── images/            # Input images

Update this README.md to include the new project
Follow the existing code structure and documentation style

Project Template

# main.py template
from ultralytics import YOLO
from pathlib import Path

def main():
    # Load model
    model = YOLO('Models/model.pt')
    
    # Process images
    images_folder = Path('images')
    for image_path in images_folder.glob('*.jpg'):
        results = model(str(image_path))
        # Process results
        print(f"Results for {image_path.name}")

if __name__ == "__main__":
    main()

Model Training

For training new models, refer to individual project notebooks or training scripts. General training steps:

Data Preparation: Organize images into train/val/test splits
Annotation: Label objects (for detection) or organize by class (for classification)
Training: Run training script with appropriate hyperparameters
Validation: Evaluate model performance on validation set
Export: Save trained model for inference

Performance Metrics

Common metrics used in object detection:

mAP (mean Average Precision): Overall detection accuracy
Precision: Ratio of correct detections to total detections
Recall: Ratio of detected objects to total objects
F1-Score: Harmonic mean of precision and recall
Inference Speed: FPS (frames per second) for real-time applications

Troubleshooting

Common Issues

Model not found: Ensure model file exists in Models/ folder
No images found: Check that images are in the correct folder with supported formats
CUDA errors: Install CUDA-compatible PyTorch or use CPU mode
Memory errors: Reduce batch size or image resolution

Getting Help

Check individual project README files
Review model documentation: Ultralytics Docs
Check error messages for specific guidance

Contributing

When contributing:

Follow the existing project structure
Add comprehensive documentation
Include example usage
Test with sample images
Update this README with new projects

Acknowledgments

Ultralytics for YOLO framework
Open source computer vision community

Resources

Last Updated: 10-12-2025 Maintained by: Yahya

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
Emotion_Detection		Emotion_Detection
Gun_Detection		Gun_Detection
Mushroom_Type_Prediction		Mushroom_Type_Prediction
X-ray_Contraband_Detection		X-ray_Contraband_Detection
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md

License

YahyaSoker/Object_Detection

Folders and files

Latest commit

History

Repository files navigation

Object Detection Project

Table of Contents

Overview

What is Object Detection?

Project Structure

Projects

1. Mushroom Type Prediction

2. Emotion Detection

Getting Started

Prerequisites

Installation

Object Detection Concepts

Key Terms

Detection vs Classification

Model Types

YOLO (You Only Look Once)

Other Models

Requirements

Core Dependencies

Optional Dependencies

Usage

Running a Project

General Workflow

Examples

Adding New Projects

Project Template

Model Training

Performance Metrics

Troubleshooting

Common Issues

Getting Help

Contributing

Acknowledgments

Resources

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages