Multi-Object Tracking Full-Stack System

Overview

This is a full-stack multi-object tracking system built with React (frontend) and Flask (backend), utilizing YOLOv5 for object detection and a combination of multiple tracking metrics including the Hungarian algorithm for optimal association. The processed videos and related information are stored in Amazon S3, while MongoDB is used as the primary database to store metadata about the uploaded files.

Key Features

Real-time object detection using YOLOv5
Multi-metric tracking system combining:
- IoU (Intersection over Union)
- Sanchez-Matilla distance
- Yu exponential cost function
- Deep feature matching using Siamese networks
Hungarian algorithm for optimal detection-track association
Track management with age-based filtering
Full-stack integration with React frontend, Flask backend, MongoDB database, and AWS S3 for file storage
Unique file ID generation upon upload, which allows retrieval of tracking statistics and processed videos

System Architecture

Frontend (React + TypeScript)

Built with Vite + TypeScript for fast and modular development
Provides a user-friendly UI to:
- Upload files
- Retrieve tracking statistics
- View processed videos stored in Amazon S3

Backend (Flask + Python)

Handles file uploads and processing
Stores file metadata in MongoDB
Generates and returns a unique file ID for each uploaded file
Provides APIs to fetch tracking results and serve processed videos from S3

Storage (AWS S3 + MongoDB)

Amazon S3 stores processed video files
MongoDB stores metadata including file ID, file name, and S3 links

Project Structure

├── dataset/
│   ├── images/
│   ├── nvidia_ai_challenge_images/
│   └── surveillance_videos/
├── models/
│   ├── coco.names
│   ├── model640.pt
│   └── yolov5s.pt
├── object_tracking.py
├── object_tracking_api.py
├── siamese_net.py
├── requirements.txt
├── object-tracking-frontend/
│   ├── src/
│   │   ├── components/
│   │   ├── pages/
│   │   ├── assets/
│   │   ├── App.tsx
│   │   ├── Navbar.tsx
│   │   ├── UploadComponent.tsx
│   │   ├── FetchResults.tsx
│   │   ├── ViewVideo.tsx
│   ├── package.json
│   ├── vite.config.ts
│   ├── tsconfig.json
│   └── README.md
└── README.md

Installation

Clone the repository:

git clone https://github.com/yourusername/object-tracking.git
cd object-tracking

Setup the backend:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

Setup the frontend:

cd object-tracking-frontend
npm install
npm run dev

Usage

Uploading a File

Navigate to the Upload File page.
Upload a video file.
A unique file ID will be generated and stored in the database.

Querying a File ID

The Home page displays all stored file IDs.
The user can copy a file ID and use it to fetch tracking statistics.

Viewing Processed Video

The processed video is stored in S3, and the link is retrieved from MongoDB.
Enter a File ID in the View Processed Video page to watch the processed result.

Screenshots

Uploading a File

Querying Tracking Statistics

Viewing Processed Video

Home Page with Stored Files

Copy File ID Feature

Processed Video Example

AWS S3 Bucket

Local MongoDB

Performance Metrics

Processing Speed: ~20-30 FPS (hardware-dependent)
Detection Accuracy: >90% mAP@0.5 (YOLOv5s)
Tracking Robustness: Effectively handles occlusions and object interactions

Technical Considerations

Memory Management

Efficient frame processing with OpenCV
Batch processing for feature extraction
GPU acceleration supported for both detection and feature extraction

Optimization

Vectorized operations for cost matrix computation
Hungarian algorithm for efficient tracking
Parallel processing of detection and feature extraction

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Question in interview.

How will you handle failure? OR What will be the system behavior if backend goes down or ML part of script is down ? Ans: Currently system architecture is tightly coupled. Video uploaded -> sent to backend -> sent to ML part -> results recived -> pushed to MongoDB & S3. Anything of this goes down, complete application is down. So it is tightly coupled. To make it robust, Use AWS SQS and lambda. between All stages of application so if any of part goes down, application is not down and data is not lost.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
SCREENSHOTS		SCREENSHOTS
models		models
object-tracking-frontend		object-tracking-frontend
.gitignore		.gitignore
README.md		README.md
object_tracking.py		object_tracking.py
object_tracking_api.py		object_tracking_api.py
old_working_code.py		old_working_code.py
output.txt		output.txt
output_demo.mp4		output_demo.mp4
output_video.mp4		output_video.mp4
requirements.txt		requirements.txt
siamese_net.py		siamese_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Object Tracking Full-Stack System

Overview

Key Features

System Architecture

Frontend (React + TypeScript)

Backend (Flask + Python)

Storage (AWS S3 + MongoDB)

Project Structure

Installation

Usage

Uploading a File

Querying a File ID

Viewing Processed Video

Screenshots

Uploading a File

Querying Tracking Statistics

Viewing Processed Video

Home Page with Stored Files

Copy File ID Feature

Processed Video Example

AWS S3 Bucket

Local MongoDB

Performance Metrics

Technical Considerations

Memory Management

Optimization

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-Object Tracking Full-Stack System

Overview

Key Features

System Architecture

Frontend (React + TypeScript)

Backend (Flask + Python)

Storage (AWS S3 + MongoDB)

Project Structure

Installation

Usage

Uploading a File

Querying a File ID

Viewing Processed Video

Screenshots

Uploading a File

Querying Tracking Statistics

Viewing Processed Video

Home Page with Stored Files

Copy File ID Feature

Processed Video Example

AWS S3 Bucket

Local MongoDB

Performance Metrics

Technical Considerations

Memory Management

Optimization

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages