WEPR - Hallucination Detection System

A research implementation of Entropy Production Rate (EPR) for detecting hallucinations in Large Language Models (LLMs). This tool visualizes the uncertainty of the model at the token level, helping to identify potential fabrication or reasoning errors.

🚀 Features

Real-time Entropy Visualization: See the model's uncertainty for every generated token.
Interactive Charts: Explore the entropy flow and token probability distribution.
Risk Scoring: Automatic classification of tokens as Low, Medium, or High risk.
Generation Control: Adjust Temperature, Top-K, and Seed to experiment with model behavior.
Modular Architecture: Clean separation between the Flask backend and the core algorithmic logic.

🛠️ Prerequisites

Python 3.8+
Node.js 16+ (for the frontend)
Ollama (running locally)

📦 Installation & Setup

1. Setup Ollama (The LLM Engine)

Download and install Ollama from ollama.com.
Start the Ollama server (usually runs in the background).
Pull a model (i use cas/ministral-8b-instruct-2410_q4km):
```
ollama pull cas/ministral-8b-instruct-2410_q4km
```
Note: You can configure available models in models_config.py.

2. Setup Backend (Python/Flask)

Clone the repository and navigate to the root:
```
cd wepr
```
Install Python dependencies:
```
pip install flask flask-cors requests
```
Start the backend server:
```
python3 app.py
```
The server will start on http://127.0.0.1:5001.

3. Setup Frontend (React/Vite)

Navigate to the frontend directory:
```
cd frontend
```
Install dependencies:
```
npm install
```
Start the development server:
```
npm run dev
```
The app will open at http://localhost:5173.

🖥️ Usage

Open your browser at http://localhost:5173.
Select a model from the dropdown (top right).
(Optional) Click the Settings icon to adjust Temperature or Top-K.
Type a prompt and hit Enter.
Analyze the results:
- Green tokens: The model is confident.
- Red tokens: The model is uncertain (High Entropy).
- Click on any token to see the Top-20 candidates and their probabilities.

📂 Project Structure

app.py: The Flask API entry point.
core_epr/: Contains the core logic.
- client.py: Handles communication with Ollama.
- entropy.py: Mathematical functions (Shannon Entropy, Normalization).
- processing.py: Data pipeline and risk calculation.
frontend/: The React application.

📚 References

Based on the research paper: "Entropy-based Hallucination Detection" (arXiv:2509.04492).

Note: If you are here Charles, I hope it's clear for you!

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
core_epr		core_epr
core_wepr		core_wepr
frontend		frontend
2509.04492v1.pdf		2509.04492v1.pdf
README.md		README.md
app.py		app.py
models_config.py		models_config.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WEPR - Hallucination Detection System

🚀 Features

🛠️ Prerequisites

📦 Installation & Setup

1. Setup Ollama (The LLM Engine)

2. Setup Backend (Python/Flask)

3. Setup Frontend (React/Vite)

🖥️ Usage

📂 Project Structure

📚 References

About

Uh oh!

Releases

Packages

Languages

ramosleandre/wepr

Folders and files

Latest commit

History

Repository files navigation

WEPR - Hallucination Detection System

🚀 Features

🛠️ Prerequisites

📦 Installation & Setup

1. Setup Ollama (The LLM Engine)

2. Setup Backend (Python/Flask)

3. Setup Frontend (React/Vite)

🖥️ Usage

📂 Project Structure

📚 References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages