HealthLens

Multimodal AI for dermatology — aligning skin lesion images with clinical symptom text
to support faster, more reliable diagnosis.

Multimodal AI · RAG · Edge Deployment | 4,010 training images | MSE 0.0025

Why This Exists

Dermatology has a data problem. Clinicians rely on visual inspection and symptom history, but manual diagnosis is slow, inconsistent, and constrained by specialist availability. Most AI approaches treat image and text data in isolation — missing the richer signal that emerges when both are aligned.

HealthLens treats this as a business problem with a technical solution: build a production-ready diagnostic support system that reasons over both modalities simultaneously, delivers explainable outputs, and can scale without requiring constant human annotation.

Key Numbers

4,010	0.0025	2-in-1	E2E
Training Images	Alignment MSE	Image + Text Input	Deployed Pipeline

Diagnostic Pipeline

User Input  ──►  Image + symptom description
                        │
                        ▼
Preprocessing  ──►  Normalization + tokenization
                        │
                        ▼
ALIGN Encoders  ──►  Image vector + text vector (shared embedding space)
                        │
                        ▼
Cosine Similarity  ──►  Closest clinical match identified
                        │
                        ▼
RAG Retrieval  ──►  Clinical description fetched from Qdrant
                        │
                        ▼
Output  ──►  Disease label + confidence score + clinical context

Technical Decisions

Layer	Choice	Rationale
Multimodal model	`kakaobrain/align-base`	Shared embedding space for image + text
Similarity	Cosine similarity	Direction-invariant, fast at inference
Explainability	RAG + Qdrant	Clinically sourced descriptions, not black-box
Backend	FastAPI	Async, lightweight, production-ready
Frontend	Streamlit	Fast iteration for clinician-facing UI
Augmentation	Torchvision	Flips, color jitter, cropping on 4,010 images

Business Framing

What's the operational problem?

Dermatology diagnosis is a bottleneck — time-intensive, specialist-gated, and difficult to audit. An AI-assisted pipeline reduces time-to-decision and creates a reproducible, auditable record.

What does better look like?

Higher precision on unseen samples, outputs a clinician can interrogate, and a system that improves as labeled data grows — without full retraining.

Is this deployable?

Yes. FastAPI + Qdrant (Docker) + Streamlit = a self-contained stack that runs locally or on cloud infrastructure with minimal configuration.

Tech Stack

Files

├── models/               # Trained .pth model checkpoints
├── data-info/            # Clinical disease descriptions (JSON)
├── scripts/              # Inference and utility logic
├── notebooks/            # EDA and training notebooks
├── docker-compose.yaml   # Qdrant vector DB setup
└── requirements.txt      # Dependencies

About

Built by Syam Preetham — aspiring PM/BA with a focus on AI products and data-backed decision making.

Bridging the gap between medical symptoms and accurate diagnosis through AI alignment.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
app		app
augmented-data		augmented-data
data/train		data/train
scripts		scripts
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
Syam Preetham_final project.pptx		Syam Preetham_final project.pptx
docker-compose.yaml		docker-compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HealthLens

Why This Exists

Key Numbers

Diagnostic Pipeline

Technical Decisions

Business Framing

Tech Stack

Files

About

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HealthLens

Why This Exists

Key Numbers

Diagnostic Pipeline

Technical Decisions

Business Framing

Tech Stack

Files

About

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages