YOLOv1-Paper-Implementation-using-PyTorch-from-scratch

This project is a from-scratch implementation of the YOLOv1 (You Only Look Once) object detection paper using PyTorch. I implemented the entire pipeline — architecture, loss function, dataset parsing, and model training — to deeply understand how YOLO works at its core.

Live App Link

🚀 What I Built

🧩 ResNet-18 Backbone (for feature extraction)
⚙️ YOLOv1 Loss Function (custom-built using MSELoss)
📦 Custom Dataset Loader for PASCAL VOC 2007 + 2012
🕒 15+ Hours of Training with Checkpointing & Logging
🧮 Complete Web App using Flask for image detection demo

🛠️ Technical Details

Optimizer: Adam
Learning Rate Scheduler: StepLR
Loss Function: MSE with λ_coord = 5, λ_noobj = 0.5
Dataset: PASCAL VOC 2007 + 2012 (XML annotations parsed)
Framework: PyTorch
Web Framework: Flask

🧩 YOLOv1 Loss Function

The YOLOv1 loss function combines:

Localization Loss (for bounding box coordinates)
Confidence Loss (for objectness score)
Classification Loss (for class probabilities)

I replicated the official YOLOv1 loss equation and implemented it using torch.nn.MSELoss with custom weighting factors: λ_coord = 5.0 λ_noobj = 0.5

This ensures bounding box coordinates are penalized more heavily, while boxes without objects contribute less to the loss.

🧱 YOLOv1 Architecture

The YOLOv1 head is built on top of a ResNet-18 backbone pre-trained on ImageNet. It outputs a grid structure that predicts bounding boxes and class probabilities in a single forward pass, enabling real-time object detection without region proposals.

🧪 Web App Demo: click here for live app

The Flask web app allows users to upload images and view detection results instantly. Due to Hugging Face Spaces limitations, the live webcam detection feature is disabled — but you can find a recorded demo video of live detections on my LinkedIn.

🧍 Detected Classes

👨 Person 🚗 Car 🐱 Cat 🐶 Dog 🚌 Bus 🚲 Bicycle ✈️ Aeroplane 🚤 Boat 🐄 Cow 🐑 Sheep 🐦 Bird 🏍️ Motorbike 🚂 Train 🛵 Horse 🪑 Chair 🛋️ Sofa 🍽️ Diningtable 📺 TVmonitor 🪴 Pottedplant 🍷 Bottle

💬 Final Thoughts

This project became a cornerstone of my AI research journey. I now have a deep understanding of YOLO’s architecture, loss design, and real-time detection principles — and this project represents my growth from reading research papers to building real implementations.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
checkpoints		checkpoints
data		data
logs		logs
logs1		logs1
model		model
static		static
templates		templates
README.md		README.md
Screenshot 2025-10-10 003319.png		Screenshot 2025-10-10 003319.png
app.py		app.py
app_st.py		app_st.py
bottle.jpg		bottle.jpg
debug_boxes_batch_0.png		debug_boxes_batch_0.png
dog.jpg		dog.jpg
func.py		func.py
loss_func.png		loss_func.png
new_image.jpg		new_image.jpg
practice.ipynb		practice.ipynb
pred_epoch1.pt		pred_epoch1.pt
prediction.py		prediction.py
sheep.jpg		sheep.jpg
test_image_0.jpg		test_image_0.jpg
test_image_1.jpg		test_image_1.jpg
test_image_3.jpg		test_image_3.jpg
yolo_arch.py		yolo_arch.py
yolo_architechture.png		yolo_architechture.png
yolo_backbone_final_training.ipynb		yolo_backbone_final_training.ipynb
yolo_new.ipynb		yolo_new.ipynb
yolo_small.ipynb		yolo_small.ipynb
yolo_training_1.ipynb		yolo_training_1.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

YOLOv1-Paper-Implementation-using-PyTorch-from-scratch

🚀 What I Built

🛠️ Technical Details

🧩 YOLOv1 Loss Function

🧱 YOLOv1 Architecture

🧪 Web App Demo: click here for live app

🧍 Detected Classes

💬 Final Thoughts

About

Uh oh!

Releases

Packages

Languages

ajstyle007/YOLOv1-Paper-Implementation-using-PyTorch-from-scratch

Folders and files

Latest commit

History

Repository files navigation

YOLOv1-Paper-Implementation-using-PyTorch-from-scratch

🚀 What I Built

🛠️ Technical Details

🧩 YOLOv1 Loss Function

🧱 YOLOv1 Architecture

🧪 Web App Demo: click here for live app

🧍 Detected Classes

💬 Final Thoughts

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages