AI-Powered Object-Detection Service

A Next.js application that uses AI to detect and classify objects in images. Upload an image and get instant results showing what objects are detected and how many of each type.

🌟 Features

AI Object Detection: Powered by Hugging Face's Transformers.js library using the DETR ResNet-50 model
Real-time Processing: Instant object detection with confidence scoring
Image Storage: Automatic upload and storage using UploadThing and AWS ECR
Confidence Filtering: Only displays objects detected with >80% confidence
Object Counting: Automatically counts multiple instances of the same object

🏗️ Architecture

The application follows a containerized architecture deployed on AWS:

Component Flow

Frontend (Next.js): User interface for image upload and result display
Web Server: Processes requests and serves TSX files
AI Model: Xenova/detr-resnet-50 for object detection
UploadThing: Handles file uploads and temporary storage
AWS ECR: Stores Docker images
AWS ECS: Manages and orchestrates containers

🚀 Getting Started

Prerequisites

Node.js 18+
npm or yarn
UploadThing account and API keys
AWS account (for deployment)

Installation

Clone the repository:

git clone https://github.com/aleung910/Object_Detect_App.git
cd Object_Detect_App

Install dependencies:

npm install
# or
yarn install

Set up environment variables: Create a .env.local file in the root directory:

UPLOADTHING_SECRET=your_uploadthing_secret
UPLOADTHING_APP_ID=your_app_id

Run the development server:

npm run dev
# or
yarn dev

Open http://localhost:3000 in your browser

📖 Usage

Navigate to the home page
Click "Upload Object Image" button
Select an image file (.png or .jpg)
Click upload and wait for processing
View detected objects and their counts

Technology Stack

Frontend

Next.js 14: React framework with App Router
TypeScript: Type-safe development
Tailwind CSS: Utility-first styling
Shadcn/ui: Component library

AI/ML

Transformers.js: Hugging Face library for browser-based ML
DETR ResNet-50: Object detection model
Pipeline API: Simplified model inference

Backend/Infrastructure

UploadThing: File upload handling
AWS ECR: Docker image registry
AWS ECS: Container orchestration
Docker: Containerization

📁 Project Structure

Object_Detect_App/
├── app/
│   ├── api/
│   │   └── detect-objects/
│   │       └── route.ts          # Object detection API endpoint
│   ├── image-classification/
│   │   └── page.tsx              # Main upload/detection page
│   ├── styles/
│   │   └── global.css            # Global styles
│   └── page.tsx                  # Home page
├── components/
│   └── ui/                       # Reusable UI components
├── utils/
│   └── uploadthing.ts            # UploadThing configuration
├── public/
│   ├── diagram.png               # Architecture diagram
│   └── paperTexture.jpg          # Background texture
└── package.json

🔧 API Reference

POST /api/detect-objects

Detects objects in an uploaded image.

Request:

Content-Type: multipart/form-data
Body: FormData with files field

Response:

{
  "url": "https://uploadthing.com/...",
  "label": "{\"person\":2,\"chair\":4}"
}

🎯 Key Features Explained

Confidence Filtering

Only objects detected with >80% confidence are included in results:

outPut.forEach(({ score, label }: any) => {
  if (score > 0.80) {
    // Count and display object
  }
});

Object Counting

Automatically aggregates multiple detections of the same object type:

if (countObj[label]) {
  countObj[label]++;
} else {
  countObj[label] = 1;
}

🐳 Docker Deployment

Build and run the Docker container:

# Build image
docker build -t object-detect-app .

# Run container
docker run -p 3000:3000 object-detect-app

🌐 AWS Deployment

Push Docker image to ECR
Create ECS task definition
Deploy to ECS cluster
Configure load balancer and domain

📝 License

This project is open source and available under the MIT License.

🙏 Acknowledgments

Hugging Face for Transformers.js
UploadThing for file upload handling
Xenova for the DETR ResNet-50 model
Shadcn for UI components

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.vscode		.vscode
app		app
components/ui		components/ui
lib		lib
public		public
utils		utils
.dockerignore		.dockerignore
.gitignore		.gitignore
DockerFile		DockerFile
README.md		README.md
components.json		components.json
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Powered Object-Detection Service

🌟 Features

🏗️ Architecture

Component Flow

🚀 Getting Started

Prerequisites

Installation

📖 Usage

Technology Stack

Frontend

AI/ML

Backend/Infrastructure

📁 Project Structure

🔧 API Reference

POST /api/detect-objects

🎯 Key Features Explained

Confidence Filtering

Object Counting

🐳 Docker Deployment

🌐 AWS Deployment

📝 License

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI-Powered Object-Detection Service

🌟 Features

🏗️ Architecture

Component Flow

🚀 Getting Started

Prerequisites

Installation

📖 Usage

Technology Stack

Frontend

AI/ML

Backend/Infrastructure

📁 Project Structure

🔧 API Reference

POST /api/detect-objects

🎯 Key Features Explained

Confidence Filtering

Object Counting

🐳 Docker Deployment

🌐 AWS Deployment

📝 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages