🧠 Transformer Explainer

Interactive LLM Attention Simulation — GPT-2 Style

Visualize how transformers think, one attention head at a time.

🔍 Overview

An interactive, educational web simulation that lets you see inside a transformer model — from tokenization to next-token prediction. No black boxes. No abstract equations. Just a live, explorable pipeline modeled after GPT-2 Small.

💡 Built for students, educators, and anyone curious about how Large Language Models actually work under the hood.

✨ Key Features

Feature	Description
🔤 Token Embedding	Watch input text split into tokens and map to embedding vectors with positional encoding
🔑 Q/K/V Inspector	Examine Query, Key, and Value projections for each attention head
🗺️ Attention Heatmap	Interactive matrix with causal masking — see which tokens attend to which
📊 Probability Distribution	Real-time softmax output showing candidate tokens and their probabilities
⚡ Autoregressive Generation	Generate tokens step-by-step and observe how each new token reshapes attention
🎛️ Sampling Controls	Tune Temperature (0.3–1.5), Top-k (1–12), and generation length live
👁️ Multi-Head View	Switch between attention heads to compare learned patterns

🏗️ Architecture (Simulated)

┌─────────────────────────────────────────────────┐
│              GPT-2 Small (Simulation)            │
├──────────────────┬──────────────────────────────┤
│  🧱 Layers       │  12                          │
│  🧠 Attn Heads   │  4 (visual)                  │
│  📐 Hidden Size  │  16                          │
│  🔢 Head Dim     │  4                           │
│  🎭 Causal Mask  │  Active                      │
└──────────────────┴──────────────────────────────┘

🛠️ Tech Stack

	Technology	Purpose
📄	HTML5	Semantic layout & SVG attention diagrams
🎨	CSS3	Custom properties, Grid, Flexbox
⚙️	Vanilla JS	Simulation engine — zero dependencies
🔤	Google Fonts	Orbitron, Space Grotesk, IBM Plex Mono
🌐	Vercel	Static hosting & CDN

No frameworks. No build step. No bundler. Just clean, dependency-free code.

📁 Project Structure

simulasiLLM/
├── 📄 index.html        # UI layout — toolbar, attention SVG, token stream, panels
├── 🎨 style.css         # Styling with CSS custom properties
├── ⚙️ app.js            # Engine — tokenizer, attention math, rendering, generation
├── 🚀 DEPLOY.md         # Deployment guide (Vercel, Cloudflare, Netlify, GH Pages)
└── 📦 .github/
    └── workflows/       # CI/CD configuration

🚀 Getting Started

Run Locally

No install required — just serve the static files:

git clone https://github.com/romizone/simulasiLLM.git
cd simulasiLLM
python3 -m http.server 8081

Open http://127.0.0.1:8081 in your browser.

Alternatively:

# Node.js
npx serve .

# PHP
php -S localhost:8081

Deploy

See DEPLOY.md for guides on Vercel, Cloudflare Pages, Netlify, and GitHub Pages.

⚙️ How It Works

  📝 Input Text
       │
       ▼
  ┌──────────┐
  │ 🔤 Token │  Split text into tokens (Unicode-aware regex)
  │   izer   │
  └────┬─────┘
       │
       ▼
  ┌──────────┐
  │ 📐 Embed │  Map tokens → dense vectors + positional encoding
  │   ding   │
  └────┬─────┘
       │
       ▼
  ┌──────────┐
  │ 🔑 Q/K/V │  Project embeddings into Query, Key, Value spaces
  │  Project │
  └────┬─────┘
       │
       ▼
  ┌──────────┐
  │ 🗺️ Attn  │  score = (Q · Kᵀ) / √d_k  →  causal mask  →  softmax
  │  Matrix  │
  └────┬─────┘
       │
       ▼
  ┌──────────┐
  │ 🎲 Sample│  Apply temperature & top-k filtering
  │          │
  └────┬─────┘
       │
       ▼
  ⚡ Next Token  ──→  (autoregressive loop)

📄 Research Paper

📄	"Simulating the Attention Mechanism in Large Language Models Based on the GPT-2 Architecture" 📖 Read the full paper →

Topics covered:

🔤 Token processing pipeline (BPE tokenization, embedding, positional encoding)
📐 Mathematical formulation of scaled dot-product attention
🎭 Causal masking in autoregressive generation
🌡️ Temperature scaling and top-k sampling strategies
🏗️ GPT-2 Small architecture specifications

🔗 Links

	Link
🚀	Live Simulation — simulasillm.vercel.app
📄	Research Paper — paper-llm-attention.vercel.app
💻	Source Code — github.com/romizone/simulasiLLM

📜 License

This project is open source and available for educational purposes.

Made with ❤️ by Romi Nur Ismanto

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
DEPLOY.md		DEPLOY.md
README.md		README.md
app.js		app.js
index.html		index.html
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Transformer Explainer

Interactive LLM Attention Simulation — GPT-2 Style

🔍 Overview

✨ Key Features

🏗️ Architecture (Simulated)

🛠️ Tech Stack

📁 Project Structure

🚀 Getting Started

Run Locally

Deploy

⚙️ How It Works

📄 Research Paper

🔗 Links

📜 License

About

Uh oh!

Releases 1

Packages

Languages

romizone/simulasiLLM

Folders and files

Latest commit

History

Repository files navigation

🧠 Transformer Explainer

Interactive LLM Attention Simulation — GPT-2 Style

🔍 Overview

✨ Key Features

🏗️ Architecture (Simulated)

🛠️ Tech Stack

📁 Project Structure

🚀 Getting Started

Run Locally

Deploy

⚙️ How It Works

📄 Research Paper

🔗 Links

📜 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages