GameSenseSandbox

This project is 100% built by AI. Every line of code, every test, every config file, every batch script, and even this README — all written by AI agents (Claude, DeepSeek, Qwen). No human wrote a single line of code. Yes, it's a bit absurd. But it works.

An AI-native 2D game engine prototype. Core innovation: closed-loop AI game testing.

中文文档：README.zh-CN.md

Developer describes requirements in natural language -> AI writes scripts -> sandbox executes headlessly -> AI analyzes results -> AI iterates -> developer reviews deterministic replay.

How It Works

Developer: "Make the player jump over the gap and land on the platform"
    |
    v
LLM Bridge (DeepSeek V3.2 / Qwen3-Max / Custom)
    |  Writes game.onTick() script
    v
Docker Sandbox (headless, network=none, memory-limited)
    |  Runs 180 frames of physics simulation
    v
AI Probe & Analysis
    |  Checks: did player reach target? How close? Trajectory events?
    v
AI Iterates (up to 40 rounds)
    |  Adjusts jumpForce, timing, moveSpeed based on precision feedback
    v
Deterministic Replay -> Developer watches result in browser

Architecture

GameSenseSandbox/
  src/
    engine/         # Pure-logic state engine
    |  state.ts       - GameState, Entity, Input type definitions
    |  tick.ts        - Core tick loop (60 FPS fixed timestep)
    |  collision.ts   - AABB collision detection & resolution
    |  scene.ts       - Scene config -> initial state builder
    |  rng.ts         - Deterministic PRNG (seedrandom)
    |
    sandbox/        # Script execution via Docker container
    |  index.ts       - Sandbox.run() orchestrator
    |  docker.ts      - Docker container lifecycle management
    |
    replay/         # State recording and replay
    |  recorder.ts    - Frame-by-frame state recording
    |  player.ts      - Replay playback controller
    |
    mcp/            # MCP (Model Context Protocol) server
    |  index.ts       - MCP module entry
    |  server.ts      - Tool registration (scene_load, sandbox_run, etc.)
    |  scene-manager.ts - Scene state management singleton
    |  probes.ts      - Runtime probe queries
    |
    adapter/        # LLM bridge layer
    |  llm-bridge.ts  - Core AI iteration loop with function calling
    |  models.ts      - LLM provider configurations
    |  config.ts      - Runtime config management
    |
    viewer/         # HTML5 Canvas frontend
    |  app.ts         - Main application controller
    |  canvas.ts      - Canvas renderer (replay + scene editor)
    |  index.html     - Entry point
    |
    scenarios/      # Built-in test scenario definitions
       index.ts      - Scenario registry

  docker/
    runner/
      runner.js     - Headless physics engine (mirrors src/engine)
      Dockerfile    - Minimal Node.js sandbox image

  tests/            # Vitest test suites (132 tests, 16 files)
  server.mjs        # Production static file server
  vite.config.ts    # Vite config with API plugin

Core Design Decisions

Deterministic physics: Fixed 60 FPS timestep, seeded PRNG, no Math.random() or Date.now() in game logic. Same seed + same inputs = identical replay, always.
Docker isolation: AI-generated scripts run in a network-disabled, memory-limited, read-only Docker container. The host process is never at risk.
Dual physics engine: src/engine/collision.ts (TypeScript, for the viewer) and docker/runner/runner.js (JavaScript, for headless sandbox) implement identical AABB collision & grounding logic.
Pure functions: Game state is immutable between ticks. Each tick() produces a new state object.
AI iteration loop: The LLM bridge uses OpenAI-compatible function calling to orchestrate scene setup, script writing, sandbox execution, and result analysis in a closed loop.

Physics Engine Details

Coordinate system: Y-axis points UP (positive Y = higher). Origin at bottom-left.
Gravity: Default 980 px/s^2 downward, applied as vy -= gravity * dt each frame.
Collision: AABB overlap detection with edge-biased normal calculation. Vertical collisions set isGrounded = true when entity lands on top of another.
Input handling: applyInput() sets player velocity from keyboard/AI input. Jump only fires when isGrounded is true. Double jump supported (edge-triggered, 85% force).
Tick order: applyInput -> applyPendingCommands -> resetFrameState -> applyPhysics -> detectCollisions -> resolveCollisions -> detectTriggerOverlaps

Features

Replay Viewer

Play/Pause, step forward/backward, timeline scrubbing
Speed control (0.25x - 4x)
Replay archive (last 5 sessions)
Import/Export replay JSON
Export scene config from replay

Scene Editor

Unity-style editor with Edit/Play/Pause modes
Create/move/resize entities (player, platform, obstacle, trigger)
WASD/arrow keys control in play mode with real-time physics
Mouse wheel zoom + middle-button pan
Export/Import scene layout JSON
Auto-stop when player falls below y=-1000

Test Scenarios

Built-in challenge scenes bundled into the viewer
Click to load into scene editor
Includes narrow landing chains, vertical/horizontal mixed routes

AI Chat

Natural language game scene testing via LLM
Connects to DeepSeek V3.2 / Qwen3-Max / custom OpenAI-compatible endpoints
AI writes sandbox scripts, runs, probes, and iterates automatically
Up to 40 iterations with progressive stagnation escalation
Precision trajectory feedback (takeoff/peak/landing events, closest-approach offset)
AI obeys exact engine physics rules (gravity, jump limits, AABB)
Console.log output from sandbox scripts displayed in chat
Scene changes synced back to editor

Supported LLM Providers

Provider	Model	API Endpoint
DeepSeek	deepseek-chat (V3.2)	api.deepseek.com
Qwen	qwen3-max	dashscope.aliyuncs.com
Custom	user-defined	user-defined

Note: You can use any OpenAI-compatible service by setting custom baseUrl + model in the UI.

Tech Stack

Component	Technology	Version
Language	TypeScript (strict)	5.5+
Runtime	Node.js	>= 20
Sandbox	Docker (network=none, read-only, memory limit)	-
Testing	Vitest	2.0+
Linting	Biome	1.9+
Bundler	Vite	6.0+
PRNG	seedrandom	3.0+
MCP SDK	@modelcontextprotocol/sdk	1.27+
Schema	Zod	3.23+

Prerequisites

Node.js >= 20 (required)
Docker (required for AI sandbox execution)
An API key for at least one supported LLM provider (for AI chat feature)

Quick Start

Windows (recommended)

Double-click start.bat. It will:

Check Node.js and Docker availability
Install npm dependencies
Build TypeScript
Generate a demo replay
Build the viewer
Start the server

Manual

npm install
npm start          # Start Vite dev server at http://localhost:5173

Build for Production

npm run build:viewer   # Build viewer to dist/
node server.mjs        # Serve from dist/

Or on Windows, double-click build-release.bat to produce a release/ folder.

Development

npm install

# Type check
npx tsc --noEmit

# Run all 132 tests
npx vitest run

# Run specific test suites
npx vitest run tests/engine/
npx vitest run tests/sandbox/
npx vitest run tests/integration/

# Lint
npx biome check src/

# Dev server with hot reload
npm run dev

Project Stats

~10,000 lines of TypeScript/JavaScript
52 source files
132 tests across 16 test files
7 MCP tools
4 LLM provider integrations

Important Notes

Docker must be running for sandbox execution (AI chat / test scenarios that run scripts). The viewer's scene editor and replay features work without Docker.
API keys are entered in the browser UI and sent only to the configured LLM endpoint. Keys are cleared from the frontend when you click the exit button.
The physics engine runs at 60 FPS fixed timestep. All timing calculations assume dt = 1/60. Gravity default is 980 px/s^2.
Y-axis points UP. This is important for understanding position values: a player at y=200 is above a platform at y=100.
Deterministic replay: Given the same seed and inputs, the simulation produces identical results every time. This is guaranteed by using seeded PRNG and avoiding all non-deterministic APIs.

License

MIT

Built entirely by AI agents. The humans just provided the requirements and pressed Enter.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.vscode		.vscode
context		context
docker/runner		docker/runner
docs		docs
requirements		requirements
src		src
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
biome.json		biome.json
build-release.bat		build-release.bat
opencode.jsonc		opencode.jsonc
package-lock.json		package-lock.json
package.json		package.json
server.mjs		server.mjs
start.bat		start.bat
tsconfig.json		tsconfig.json
vite-api-plugin.ts		vite-api-plugin.ts
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GameSenseSandbox

How It Works

Architecture

Core Design Decisions

Physics Engine Details

Features

Replay Viewer

Scene Editor

Test Scenarios

AI Chat

Supported LLM Providers

Tech Stack

Prerequisites

Quick Start

Windows (recommended)

Manual

Build for Production

Development

Project Stats

Important Notes

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GameSenseSandbox

How It Works

Architecture

Core Design Decisions

Physics Engine Details

Features

Replay Viewer

Scene Editor

Test Scenarios

AI Chat

Supported LLM Providers

Tech Stack

Prerequisites

Quick Start

Windows (recommended)

Manual

Build for Production

Development

Project Stats

Important Notes

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages