Loci

Your personal knowledge map. Loci transforms any document into a visual, explorable knowledge space with automatic flashcard generation and spaced repetition for long-term retention.

What is Loci?

Loci is a learning system that:

Ingests anything: PDFs, markdown, text files, handwritten notes (photos), audio transcriptions, images
Extracts knowledge: Automatically identifies concepts and their relationships
Visualizes spatially: Displays your knowledge as an interactive honeycomb map where related concepts cluster together
Strengthens memory: Generates flashcards and schedules reviews using spaced repetition (FSRS algorithm)

Features

Visual Knowledge Map

Navigate your knowledge like a map. Zoom from high-level categories down to individual concepts. Related ideas appear close together thanks to semantic clustering.

Universal File Ingestion

Drop in any file type:

Documents (PDF, Markdown, TXT)
Handwritten notes (photos processed via LLM vision)
Images with text or diagrams
Audio transcriptions

Automatic Flashcard Generation

Each concept generates smart flashcards using two formats:

Cloze deletions (easy/medium): Fill-in-the-blank with context cues
Q&A (hard): Direct questions for deeper active recall

Spaced Repetition (FSRS)

Reviews are scheduled at the optimal moment—just before you'd forget. The system tracks:

Stability: How long until memory drops to 90%
Difficulty: How hard the material is for you
Retrievability: Current probability of recall

Semantic Search

Find any concept instantly. Search queries match semantically, not just by keywords.

How It Works

1. Document Processing

Files are parsed and split into meaningful chunks using LangChain text splitters. Images and handwritten notes are processed through LLM vision capabilities.

2. Concept Extraction

An LLM analyzes each chunk to extract:

A clear title
A concise summary
The core content

3. Embedding Generation

Each concept is converted to a high-dimensional vector (embedding) that captures its semantic meaning. Similar concepts have similar embeddings.

4. Dimensionality Reduction (UMAP)

Embeddings are projected from high-dimensional space to 2D coordinates using UMAP, preserving semantic relationships. This creates the spatial layout where similar concepts appear near each other.

5. Clustering (HDBSCAN)

Concepts are grouped into clusters using HDBSCAN, a density-based algorithm that finds natural groupings without requiring a predefined number of clusters.

6. Honeycomb Visualization

The 2D coordinates are rendered as an interactive honeycomb grid. Each hexagon represents a concept. Opacity reflects learning progress—mastered concepts fade while due reviews stand out.

7. Flashcard Generation

For each concept, the LLM generates flashcards:

Cloze deletions provide context cues for easier learning
Q&A format tests deeper recall for challenging material

8. Spaced Repetition (FSRS)

The FSRS algorithm schedules reviews based on your performance:

Again: Reset, review soon
Hard: Small interval increase
Good: Normal interval increase
Easy: Large interval increase, decrease difficulty

Getting Started

Prerequisites

Python 3.11+
Node.js 18+ with pnpm
Either:
- OpenAI (recommended): gpt-4.1-mini + text-embedding-3-small
- Ollama (local): ministral-3:8b + embeddinggemma

Quick Start

# Clone and enter the project
git clone <repo-url>
cd loci

# Run the start script
./start.sh

The script will guide you through setup and start both servers.

Manual Setup

Backend

cd backend

# Install dependencies with uv
uv sync

# Set your OpenAI API key
export OPENAI_API_KEY="sk-..."

# Or for local mode with Ollama:
# 1. Install Ollama: https://ollama.ai
# 2. Pull models: ollama pull ministral-3:8b && ollama pull embeddinggemma

# Start the server
uv run python main.py

Backend runs at http://localhost:4224

Frontend

cd frontend

# Install dependencies
pnpm install

# Start dev server
pnpm dev

Frontend runs at http://localhost:3000

API Endpoints

Documents

Endpoint	Method	Description
`/upload`	POST	Upload and process a file
`/concepts`	GET	List all concepts
`/concepts/{id}`	GET	Get concept details
`/search`	GET	Semantic search

Reviews

Endpoint	Method	Description
`/review/due`	GET	Get flashcards due for review
`/review/due/count`	GET	Count of due cards
`/review/submit`	POST	Submit review with rating (1-4)
`/review/grade`	POST	Grade answer using LLM
`/review/stats`	GET	Review statistics

Tech Stack

Backend: FastAPI, LangChain, SQLite + sqlite-vec, UMAP, HDBSCAN

Frontend: Nuxt 4, Vue 3, Honeycomb Grid, D3, Panzoom

AI Models:

OpenAI: gpt-4.1-mini (VLM) + text-embedding-3-small (embeddings)
Ollama: ministral-3:8b (VLM) + embeddinggemma (embeddings)

Algorithm: FSRS (Free Spaced Repetition Scheduler)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
backend		backend
frontend		frontend
images		images
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Loci

What is Loci?

Features

Visual Knowledge Map

Universal File Ingestion

Automatic Flashcard Generation

Spaced Repetition (FSRS)

Semantic Search

How It Works

1. Document Processing

2. Concept Extraction

3. Embedding Generation

4. Dimensionality Reduction (UMAP)

5. Clustering (HDBSCAN)

6. Honeycomb Visualization

7. Flashcard Generation

8. Spaced Repetition (FSRS)

Getting Started

Prerequisites

Quick Start

Manual Setup

Backend

Frontend

API Endpoints

Documents

Reviews

Tech Stack

License

About

Uh oh!

Releases

Packages

Languages

lmanhes/loci

Folders and files

Latest commit

History

Repository files navigation

Loci

What is Loci?

Features

Visual Knowledge Map

Universal File Ingestion

Automatic Flashcard Generation

Spaced Repetition (FSRS)

Semantic Search

How It Works

1. Document Processing

2. Concept Extraction

3. Embedding Generation

4. Dimensionality Reduction (UMAP)

5. Clustering (HDBSCAN)

6. Honeycomb Visualization

7. Flashcard Generation

8. Spaced Repetition (FSRS)

Getting Started

Prerequisites

Quick Start

Manual Setup

Backend

Frontend

API Endpoints

Documents

Reviews

Tech Stack

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages