Med-Chat: Hybrid Medical Knowledge Graph with Vector Search

🏥 RAG-based Medical Chat Assistant leveraging Neo4j Knowledge Graph + Supabase pgvector for semantic search, powered by Groq LLM and HuggingFace embeddings.

🏗️ System Architecture

┌─────────────────────────────────────────────────────────────────┐
│                    FRONTEND (React + Vite)                      │
│              Port 5000 - Chat & Graph Visualization             │
│  - Chat Interface (RAG QA)                                      │
│  - Neo4j Graph Visualization (interactive force graph)          │
│  - Dark/Light Theme                                             │
└────────────────────────────┬──────────────────────────────────┘
                             │ REST API (JSON)
                             ↓
┌─────────────────────────────────────────────────────────────────┐
│              BACKEND (Node.js + Express)                        │
│                    Port 3001 - Hybrid Engine                    │
├──────────────────────────────────────────────────────────────────┤
│ 🧠 RAG Pipeline                                                 │
│  ├─ User Question                                               │
│  ├─ Neo4j Graph Query (Cypher)                                  │
│  ├─ Semantic Vector Search (if enabled)                         │
│  ├─ Context Enrichment                                          │
│  └─ Groq LLM Response Generation                                │
└─────┬────────────────────────────┬─────────────────────┬────────┘
      │                            │                     │
      ↓                            ↓                     ↓
  ┌─────────────┐        ┌──────────────────┐    ┌──────────────┐
  │  Neo4j DB   │        │  Supabase        │    │  Groq API    │
  │  (Graph)    │        │  pgvector        │    │  (LLM)       │
  │  Medical    │        │  (Semantic)      │    │              │
  │  Knowledge  │        │  Embeddings      │    │              │
  └─────────────┘        └──────────────────┘    └──────────────┘
                                │
                                ↓
                    ┌──────────────────────┐
                    │  HuggingFace API     │
                    │  (Free Embeddings)   │
                    │  all-MiniLM-L6-v2    │
                    └──────────────────────┘

📊 Tech Stack

Layer	Technology	Purpose
Frontend	React 18 + Vite + React Router	Chat UI + Graph Visualization
Graph Rendering	react-force-graph-2d	Interactive 2D force-directed graph
Backend API	Node.js + Express	REST endpoints & RAG engine
Graph DB	Neo4j 5.14	Medical knowledge graph
Vector DB	Supabase PostgreSQL + pgvector	Semantic search embeddings
LLM	Groq (openai/gpt-oss-120b)	Fast response generation
Embeddings	HuggingFace (all-MiniLM-L6-v2)	Free semantic embeddings (384-dim)
Styling	CSS3 + CSS Variables	Dark/Light themes

🚀 Quick Start

Prerequisites

Node.js 16+ installed
Neo4j cloud/local instance running
Supabase PostgreSQL instance (pgvector pre-enabled)
Groq API key
HuggingFace API token (free tier)

Terminal 1: Start Backend

cd backend
npm install
npm run dev    # Uses nodemon for auto-reload (server.new.js)
# Runs on http://localhost:3001

Terminal 2: Start Frontend

cd frontend
npm install
npm run dev
# Runs on http://localhost:5001 (or next available port)

Visit http://localhost:5001 in your browser

🔑 Environment Configuration

Create backend/.env:

# Neo4j
NEO4J_URI=neo4j+s://your-instance-uri.databases.neo4j.io
NEO4J_USER=neo4j
NEO4J_PASSWORD=your-password

# Supabase (PostgreSQL with pgvector)
SUPABASE_URL=https://your-project.supabase.co
SUPABASE_DB_URL=postgres://user:password@db.your-project.supabase.co:5432/postgres

# Groq LLM
GROQ_API_KEY=your-groq-api-key
GROQ_MODEL=openai/gpt-oss-120b

# HuggingFace (Free embeddings)
HUGGINGFACE_API_KEY=your-hf-api-token

# Feature Toggle
ENABLE_EMBEDDINGS=yes  # Set to 'no' to disable vector search

📚 Database Schema

Neo4j Medical Knowledge Graph

Nodes:
├─ Patient (patient_id, name, age, gender, blood_type, email, phone)
├─ Disease (disease_id, name, icd_code, description)
├─ Drug (drug_id, name, category, dosage, frequency)
├─ Symptom (symptom_id, name, severity)
├─ Allergen (allergen_id, name, reaction, severity)
└─ LabResult (lab_id, test_name, value, unit, date, status)

Relationships:
├─ Patient -[HAS_DISEASE]-> Disease
├─ Disease -[PRESENTS_WITH]-> Symptom
├─ Disease -[TREATED_BY]-> Drug
├─ Drug -[TREATS]-> Disease
├─ Patient -[CURRENTLY_TAKING]-> Drug
├─ Patient -[ALLERGIC_TO]-> Allergen
├─ Patient -[HAS_LAB_RESULT]-> LabResult
└─ Drug -[INTERACTS_WITH]-> Drug

Supabase Vector Table

CREATE TABLE IF NOT EXISTS embeddings (
  id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
  content TEXT NOT NULL,
  embedding vector(384),
  metadata JSONB,
  created_at TIMESTAMP DEFAULT NOW()
);
CREATE INDEX ON embeddings USING ivfflat (embedding vector_cosine_ops);

🔌 API Endpoints

Core Chat Endpoint

POST /api/chat
Content-Type: application/json

{
  "question": "What diseases does P001 have?",
  "role": "doctor",
  "user_id": "D001",
  "patient_id": "P001"
}

Response:
{
  "success": true,
  "answer": "Patient P001 (John Doe) has...",
  "source": "neo4j",
  "query_type": "patient_diseases",
  "patient_id": "P001",
  "records_retrieved": 2
}

Semantic Vector Search

POST /api/search
Content-Type: application/json

{
  "query": "medications for diabetes",
  "type": "drug",
  "limit": 5
}

Response (if ENABLE_EMBEDDINGS=yes):
{
  "success": true,
  "query": "medications for diabetes",
  "type": "drug",
  "results": [...],
  "count": 5
}

Graph Visualization Data

GET /api/graph

Response:
{
  "success": true,
  "graph": {
    "nodes": [...],      // 33 nodes (patients, diseases, drugs, etc)
    "links": [...]       // 41 relationships with types
  },
  "nodeTypes": {
    "Patient": 3,
    "Disease": 5,
    "Drug": 7,
    "Symptom": 8,
    "Allergen": 4,
    "LabResult": 6
  },
  "stats": {
    "totalNodes": 33,
    "totalEdges": 41
  }
}

Health Status

GET /api/health

Response:
{
  "status": "ok",
  "timestamp": "2026-01-21T...",
  "services": {
    "llm": { "provider": "groq", "model": "..." },
    "database": { "type": "neo4j", "connected": true },
    "embeddings": { "enabled": true, "provider": "huggingface" },
    "vector_db": { "type": "supabase/pgvector" }
  }
}

💾 Database Seeding

# Seed Neo4j with medical data (3 patients, 5 diseases, 7 drugs, etc)
cd backend && npm run seed:neo4j

# Seed Supabase with vector embeddings (if ENABLE_EMBEDDINGS=yes)
cd backend && npm run seed:vectors

# Seed both at once
cd backend && npm run seed:all

Sample data loaded:

3 Patients (P001-P003)
5 Diseases with ICD codes
7 Medications with dosages
8 Symptoms with severity
4 Allergens with reactions
6 Lab results with dates

🎨 Frontend Features

💬 Chat Interface

Real-time Q&A about patient medical records
Role-based access (Doctor/Patient)
Patient selection for doctors
Quick query buttons (Diseases, Medications, Lab Results, etc)
Dark/Light theme toggle
Server status indicator
Message history with metadata

🕸️ Graph Visualization Page (`/visualize`)

Interactive Force-Directed Graph
- Drag nodes to reposition
- Scroll to zoom in/out
- Click and drag to pan
- Auto-fit to view (press F)
- Keyboard shortcuts (+/- to zoom)
Node Filters
- Toggle visibility by type (Patient, Disease, Drug, Symptom, etc)
- Live count of each node type
- Color-coded by category
Relationship Legend
- HAS_DISEASE (red)
- CURRENTLY_TAKING (green)
- TREATS (light green)
- PRESENTS_WITH (yellow)
- ALLERGIC_TO (orange)
- HAS_LAB_RESULT (purple)
- And more...
Node Details Panel
- Click any node to see properties
- Shows all metadata
- Connection count
- Auto-zoom on selection

🔧 Feature Toggle: Enable/Disable Embeddings

Control vector search via environment variable:

ENABLE_EMBEDDINGS=yes   # Enables Supabase pgvector + HuggingFace embeddings
ENABLE_EMBEDDINGS=no    # Uses only Neo4j graph queries (faster)

When enabled:

/search endpoint performs semantic vector search
/treatments endpoint uses embedding similarity
Chat enriches context with vector results
Higher quality responses

When disabled:

/search returns error
/treatments returns error
Chat uses only graph data
Faster inference, no API costs

📈 Performance & Optimization

Metric	Value
Embedding Cache	Dynamic
LLM Model	Groq 120B parameters
LLM Latency	< 1s per response
Vector Search Latency	< 100ms
Graph Query Latency	< 500ms

🚢 Deployment

Render.com (Recommended for Free Tier)

Backend Deployment:

Connect GitHub repo to Render
Create Web Service (Node.js)
Set environment variables
Deploy from backend/ directory
Runs on free dyno (0.5 CPU, 512MB RAM)

Frontend Deployment:

Build: npm run build (creates dist/)
Deploy to Vercel, Netlify, or Render Static Site
Update API_URL to production backend URL

🎯 Use Cases

Clinical Decision Support - Query patient history instantly
Drug Interaction Checking - Find potential medication conflicts
Treatment Recommendations - Suggest therapies based on symptoms
Patient Education - Explain diseases in plain language
Data Exploration - Visualize complex patient relationships

📊 Graph Statistics

Total Nodes: 33
├─ Patients: 3
├─ Diseases: 5
├─ Drugs: 7
├─ Symptoms: 8
├─ Allergens: 4
└─ Lab Results: 6

Total Relationships: 41
├─ HAS_DISEASE: 6
├─ CURRENTLY_TAKING: 10
├─ TREATS: 7
├─ PRESENTS_WITH: 8
├─ ALLERGIC_TO: 4
└─ HAS_LAB_RESULT: 6

🔐 Security

Environment variables for sensitive credentials
Role-based access control (Doctor/Patient)
Patient ID isolation in queries
Input validation on all endpoints
CORS enabled for cross-origin requests
No sensitive data logged

📝 Example Queries

"What diseases does P001 have?"
"Show me P002's medications"
"What are P003's lab results?"
"List all allergies for P001"
"What symptoms does Type 2 Diabetes cause?"
"Which drugs treat Hypertension?"
"Show me P001's profile information"
"What are the drug interactions for P003?"

🛠️ Development

Backend File Structure (Modular Architecture)

backend/
├─ src/
│  ├─ server.new.js              # Entry point - Initialize DB & start server
│  ├─ app.js                     # Express app configuration with middleware
│  │
│  ├─ config/
│  │  └─ index.js                # Centralized config from environment variables
│  │
│  ├─ db/                        # Database Layer
│  │  ├─ index.js                # Exports neo4j & supabase modules
│  │  ├─ neo4j/
│  │  │  ├─ driver.js            # Connection, verifyConnectivity, runQuery
│  │  │  ├─ queries.js           # All Cypher queries
│  │  │  └─ index.js             # Re-exports driver + queries
│  │  └─ supabase/
│  │     ├─ driver.js            # pgvector connection & operations
│  │     └─ index.js             # Re-exports
│  │
│  ├─ services/                  # Business Logic Layer
│  │  ├─ index.js                # Exports all services
│  │  ├─ chat.service.js         # RAG pipeline orchestration
│  │  ├─ llm.service.js          # Groq LLM API wrapper
│  │  ├─ entity.service.js       # Extract patient ID & query type
│  │  ├─ embedding.service.js    # HuggingFace embeddings with caching
│  │  └─ vector.service.js       # Semantic search + Neo4j enrichment
│  │
│  ├─ api/                       # HTTP Layer
│  │  ├─ index.js                # Exports routes & middleware
│  │  ├─ controllers/            # HTTP request handlers
│  │  │  ├─ chat.controller.js
│  │  │  ├─ graph.controller.js
│  │  │  ├─ search.controller.js
│  │  │  ├─ health.controller.js
│  │  │  └─ index.js
│  │  ├─ routes/                 # Express route definitions
│  │  │  ├─ index.js             # Aggregates all routes
│  │  │  ├─ chat.routes.js
│  │  │  ├─ graph.routes.js
│  │  │  ├─ search.routes.js
│  │  │  └─ health.routes.js
│  │  └─ middleware/             # Request processing chain
│  │     ├─ index.js             # Exports all middleware
│  │     ├─ error.middleware.js  # Global error handler
│  │     ├─ validation.middleware.js # Input validation
│  │     └─ logger.middleware.js # Request logging
│  │
│  └─ scripts/                   # Database seeding utilities
│     ├─ seed-neo4j.js           # Populate with test data
│     └─ seed-vectors.js         # Generate embeddings
│
├─ .env                          # Environment variables
└─ package.json                  # Dependencies & scripts

Frontend File Structure (Modular Architecture)

frontend/
├─ src/
│  ├─ main.jsx                  # Entry point - renders App
│  ├─ App.jsx                   # Root component
│  ├─ index.css                 # Global styles
│  │
│  ├─ api/                      # API Client Layer
│  │  ├─ index.js               # Central API exports
│  │  ├─ config.js              # API base URL & request wrapper
│  │  ├─ chat.api.js            # sendChatMessage, getChatHistory
│  │  ├─ graph.api.js           # getGraphData, getGraphStats
│  │  └─ health.api.js          # checkServerHealth, getServerStatus
│  │
│  ├─ hooks/                    # Custom React Hooks (State Management)
│  │  ├─ index.js               # Central hooks exports
│  │  ├─ useChat.js             # Chat state: messages, loading, error
│  │  ├─ useTheme.js            # Theme state: darkMode, toggleTheme
│  │  ├─ useAuth.js             # Auth state: role, userId, patientId
│  │  └─ useServerStatus.js     # Server status polling every 10s
│  │
│  ├─ components/               # Reusable UI Components
│  │  ├─ index.js               # Central component exports
│  │  ├─ GraphVisualization.jsx # Knowledge graph visualization
│  │  ├─ chat/                  # Chat-specific components
│  │  │  ├─ ChatMessage.jsx     # Display user/AI messages
│  │  │  ├─ ChatInput.jsx       # Input form with send button
│  │  │  ├─ LoadingMessage.jsx  # Loading state indicator
│  │  │  ├─ QuickQueries.jsx    # Quick query buttons
│  │  │  └─ index.js
│  │  ├─ common/                # Reusable utility components
│  │  │  ├─ ThemeToggle.jsx     # Dark/light mode toggle
│  │  │  ├─ StatusIndicator.jsx # Server status badge
│  │  │  └─ index.js
│  │  └─ layout/                # Page layout components
│  │     ├─ Sidebar.jsx         # Navigation sidebar
│  │     ├─ ChatHeader.jsx      # Chat page header
│  │     └─ index.js
│  │
│  ├─ pages/                    # Page-level Components
│  │  ├─ index.js               # Central page exports
│  │  ├─ ChatPage.jsx           # Main chat interface
│  │  └─ VisualizePage.jsx      # Graph visualization page
│  │
│  └─ utils/                    # Constants & Utilities
│     ├─ constants.js           # PATIENTS, ROLES, QUERY_TYPES, COLORS
│     └─ index.js
│
├─ index.html                   # HTML template
├─ vite.config.js               # Vite configuration
├─ .env                         # Environment variables
└─ package.json                 # Dependencies & scripts

Data Flow Architecture

┌─────────────────────────────────────────────────────────────────┐
│                      FRONTEND                                    │
├─────────────────────────────────────────────────────────────────┤
│  Pages (ChatPage, VisualizePage)                                │
│     ↓ uses                                                      │
│  Hooks (useChat, useAuth, useTheme, useServerStatus)            │
│     ↓ calls                                                     │
│  API Layer (chat.api, graph.api, health.api)                    │
│     ↓ HTTP requests                                             │
└─────────────────────────────────────────────────────────────────┘
                            │
                            ↓
┌─────────────────────────────────────────────────────────────────┐
│                      BACKEND                                     │
├─────────────────────────────────────────────────────────────────┤
│  Routes (chat, graph, search, health)                           │
│     ↓ delegates to                                              │
│  Controllers (handle HTTP req/res)                              │
│     ↓ calls                                                     │
│  Services (business logic)                                      │
│     ├─ chat.service (RAG orchestration)                         │
│     ├─ llm.service (Groq API)                                   │
│     ├─ entity.service (extraction)                              │
│     ├─ embedding.service (HuggingFace)                          │
│     └─ vector.service (semantic search)                         │
│           ↓ uses                                                │
│  DB Layer (neo4j, supabase)                                     │
│     ↓ queries                                                   │
└─────────────────────────────────────────────────────────────────┘
                            │
              ┌─────────────┴─────────────┐
              ↓                           ↓
     ┌─────────────┐            ┌──────────────────┐
     │   Neo4j     │            │    Supabase      │
     │   Graph DB  │            │    pgvector      │
     └─────────────┘            └──────────────────┘

🐛 Troubleshooting

Graph not loading?

Check /health endpoint
Verify Neo4j connection
Check firewall rules

Embeddings disabled?

Check .env file: ENABLE_EMBEDDINGS=yes
Verify Supabase credentials
Check HuggingFace API key

LLM slow response?

Default is Groq free tier (limited RPS)
Check network connectivity
Verify API key validity

📄 License

MIT - Open source for educational purposes

Project Structure

med-chat/
├── backend/                 # Independent Backend Stack
│   ├── src/
│   │   ├── server.js        # Express API (Port 3001)
│   │   ├── chat.js          # RAG handler
│   │   ├── neo4j-driver.js  # Neo4j connection
│   │   ├── graph-queries.js # Cypher queries
│   │   ├── entity-extractor.js
│   │   ├── llm.js           # Groq LLM
│   │   └── seed-neo4j.js    # Test data
│   ├── .env                 # Neo4j + Groq credentials
│   ├── package.json         # Backend dependencies
│   └── README.md
│
├── frontend/                # Independent Frontend Stack
│   ├── index.html
│   ├── css/style.css
│   ├── js/app.js
│   ├── package.json         # Frontend dependencies
│   └── README.md
│
└── README.md (this file)

Test Patients

ID	Name	Conditions
P001	John Doe	Type 2 Diabetes, Hyperlipidemia
P002	Jane Smith	Hypertension, Asthma
P003	Robert Johnson	Diabetes, Hypertension, Arthritis

API Endpoints

POST /chat

{
  "question": "What diseases does P001 have?",
  "role": "doctor",
  "user_id": "D001",
  "patient_id": "P001"
}

Response:

{
  "success": true,
  "answer": "Patient P001 (John Doe) has Type 2 Diabetes and Hyperlipidemia...",
  "source": "neo4j",
  "query_type": "diseases",
  "patient_id": "P001",
  "records_retrieved": 2
}

GET /health

Health check endpoint

Usage

Authenticate - Select role (Doctor/Patient) and enter ID
Query - Ask natural language questions like:
- "What diseases does P001 have?"
- "What medications is P002 taking?"
- "Show me P003's lab results"
Get Answer - Groq LLM generates natural language response from Neo4j graph data

Graph Schema

(Patient)
    ├──HAS_DISEASE──>(Disease)
    │                   ├──PRESENTS_WITH──>(Symptom)
    │                   └──TREATED_BY──>(Drug)
    │
    ├──CURRENTLY_TAKING──>(Drug)
    ├──ALLERGIC_TO──>(Allergen)
    └──HAS_LAB_RESULT──>(LabResult)

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Med-Chat: Hybrid Medical Knowledge Graph with Vector Search

🏗️ System Architecture

📊 Tech Stack

🚀 Quick Start

Prerequisites

Terminal 1: Start Backend

Terminal 2: Start Frontend

🔑 Environment Configuration

📚 Database Schema

Neo4j Medical Knowledge Graph

Supabase Vector Table

🔌 API Endpoints

Core Chat Endpoint

Semantic Vector Search

Graph Visualization Data

Health Status

💾 Database Seeding

🎨 Frontend Features

💬 Chat Interface

🕸️ Graph Visualization Page (/visualize)

🔧 Feature Toggle: Enable/Disable Embeddings

📈 Performance & Optimization

🚢 Deployment

Render.com (Recommended for Free Tier)

🎯 Use Cases

📊 Graph Statistics

🔐 Security

📝 Example Queries

🛠️ Development

Backend File Structure (Modular Architecture)

Frontend File Structure (Modular Architecture)

Data Flow Architecture

🐛 Troubleshooting

📄 License

Project Structure

Test Patients

API Endpoints

POST /chat

GET /health

Usage

Graph Schema

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

🕸️ Graph Visualization Page (`/visualize`)

Packages