🧠 AI-Powered Medical Document Classifier

Hackathon Submission | Full-stack AI system that classifies medical documents in real-time using a fine-tuned Small Language Model (SLM), with live RAM monitoring and full PostgreSQL persistence.

📌 Features

4-Class Document Classification: Handwritten Prescription, Printed Prescription, Printed Lab Report, Medical Scans (X-Ray/MRI)
Adaptive Binarization: OpenCV-based preprocessing strips paper noise — the AI sees only ink strokes, drastically improving handwriting detection accuracy
Real-Time RAM Monitoring: Tracks memory footprint (MB) per classification using psutil
Full PostgreSQL Persistence: Every classification stores the filename, label, confidence, inference time, RAM usage, and the raw image binary in the database
History Dashboard: View the last 10 classifications with all metrics in the UI

🏗️ Architecture

PROJECTS/
├── backend/                  # FastAPI + PyTorch inference engine
│   ├── app/
│   │   ├── main.py           # API endpoints (/classify, /history)
│   │   ├── engine.py         # ML model + Adaptive Binarization preprocessing
│   │   ├── models.py         # SQLAlchemy ORM (PostgreSQL table)
│   │   └── database.py       # DB connection & session management
│   ├── train.py              # Model fine-tuning script
│   ├── download_data.py      # Training data downloader
│   ├── check_db.py           # Manual DB inspector tool
│   ├── requirements.txt
│   └── Dockerfile
├── frontend/                 # React + Vite dashboard
│   └── src/app/
│       ├── App.tsx           # Main app logic & state
│       └── components/
│           ├── AnalysisResults.tsx  # Classification result cards
│           └── HistoryTable.tsx     # Historical records table
├── docker-compose.yml        # One-command full stack deployment
└── README.md

🤖 Model Justification

Model: EfficientNetV2-S (Small) — a Convolutional Neural Network fine-tuned via transfer learning on a custom medical document dataset.

Why EfficientNetV2-S?

Memory Efficient: The quantized CPU version consumes ~300–500 MB RAM per inference, well within the constraints of a standard server. This is measured and displayed live on the dashboard via psutil.
Fast Inference: Achieves ~150–200ms latency on CPU — fast enough for real-time demo without a GPU.
Transfer Learning Ready: Pre-trained on ImageNet, it generalizes well to document classification with minimal data (~30 images per class).
Adaptive Binarization Pre-processing: Before the image reaches the model, OpenCV's Otsu thresholding converts it to a high-contrast black-and-white "ink map". This forces the model to focus purely on stroke patterns (jagged handwriting vs. uniform printed fonts), eliminating background noise and dramatically improving accuracy for handwritten documents.

Approximate Resource Usage:

Metric	Value
Model Size	~82 MB (weights file)
RAM per Inference	~330–500 MB (RSS)
Inference Latency	150–250 ms (CPU)
GPU Required	❌ No

⚙️ Local Setup

Prerequisites

Python 3.10+
Node.js 18+
PostgreSQL 15 running locally

1. Clone the Repository

git clone https://github.com/YOUR_USERNAME/evodoc.git
cd evodoc

2. Backend Setup

cd backend

# Create and activate virtual environment
python -m venv venv
venv\Scripts\activate      # Windows
# source venv/bin/activate  # Mac/Linux

# Install dependencies
pip install -r requirements.txt

# Configure environment
cp .env.example .env
# Edit .env and set your PostgreSQL password if different

# Run the server
python -m uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

3. Frontend Setup

cd frontend
npm install
npm run dev

Open http://localhost:5173 in your browser.

🐳 Docker Deployment (One Command)

Make sure Docker Desktop is running, then from the project root:

docker-compose up --build

This starts PostgreSQL, the FastAPI backend, and the React frontend automatically.

Service	URL
Frontend	http://localhost:5173
Backend API	http://localhost:8000
API Docs	http://localhost:8000/docs

🗄️ Database Verification

After classifying an image, verify it's stored in PostgreSQL:

cd backend
python check_db.py

Expected output:

============================================================
  🩺 EVODOC - PostgreSQL Database Inspector
============================================================

  ID    Filename                       Result                     Conf%    RAM(MB)    Image Size     Time
  -----------------------------------------------------------------------------------------------------------------------
  1     prescription.jpg               Handwritten Prescription   91.2%    334.1      215.6 KB       2026-03-29 13:45:22

📡 API Reference

`POST /classify`

Upload a medical document image for classification.

Request: multipart/form-data with file field

Response:

{
  "prediction": "Handwritten Prescription",
  "confidence": 0.912,
  "latency": 183,
  "ram_mb": 334.1
}

`GET /history`

Returns the last 10 classification records from PostgreSQL.

🔧 Environment Variables

Copy .env.example to .env and fill in your values:

DATABASE_URL=postgresql://postgres:YOUR_PASSWORD@localhost:5432/evodoc_db
FRONTEND_URL=http://localhost:5173

📦 Dependencies

Backend (`requirements.txt`)

Package	Purpose
`fastapi`	REST API framework
`uvicorn`	ASGI server
`torch` + `torchvision`	ML inference (CPU)
`opencv-python`	Adaptive Binarization preprocessing
`pillow`	Image loading
`sqlalchemy`	ORM for PostgreSQL
`psycopg2-binary`	PostgreSQL driver
`psutil`	RAM usage monitoring

Frontend

Package	Purpose
`react` + `vite`	UI framework
`typescript`	Type safety

👥 Team

Built for the FULL STACK Intern Hackathon — March 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 AI-Powered Medical Document Classifier

📌 Features

🏗️ Architecture

🤖 Model Justification

⚙️ Local Setup

Prerequisites

1. Clone the Repository

2. Backend Setup

3. Frontend Setup

🐳 Docker Deployment (One Command)

🗄️ Database Verification

📡 API Reference

`POST /classify`

`GET /history`

🔧 Environment Variables

📦 Dependencies

Backend (`requirements.txt`)

Frontend

👥 Team

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Folders and files

Latest commit

History

Repository files navigation

🧠 AI-Powered Medical Document Classifier

📌 Features

🏗️ Architecture

🤖 Model Justification

⚙️ Local Setup

Prerequisites

1. Clone the Repository

2. Backend Setup

3. Frontend Setup

🐳 Docker Deployment (One Command)

🗄️ Database Verification

📡 API Reference

POST /classify

GET /history

🔧 Environment Variables

📦 Dependencies

Backend (requirements.txt)

Frontend

👥 Team

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`POST /classify`

`GET /history`

Backend (`requirements.txt`)

Packages