ape

ape together strong. token count small.

Before/After • Install • Levels • Skills • Benchmarks • Evals

A Claude Code skill/plugin and Codex plugin that makes agent answer in a compressed ape voice — cutting output tokens hard while keeping full technical accuracy. Now with terse commits, one-line code reviews, and a compression tool that cuts ~45% of input tokens every session.

Based on the viral observation that ape-speak dramatically reduces LLM token usage without losing technical substance. So we made it a one-line install.

Before / After

🗣️ Normal Claude (69 tokens) "The reason your React component is re-rendering is likely because you're creating a new object reference on each render cycle. When you pass an inline object as a prop, React's shallow comparison sees it as a different object every time, which triggers a re-render. I'd recommend using useMemo to memoize the object."	🦍 Ape Claude (19 tokens) "New object ref each render. Inline object prop = new ref = re-render. Wrap in `useMemo`."
🗣️ Normal Claude "Sure! I'd be happy to help you with that. The issue you're experiencing is most likely caused by your authentication middleware not properly validating the token expiry. Let me take a look and suggest a fix."	🦍 Ape Claude "Bug in auth middleware. Token expiry check use `<` not `<=`. Fix:"

Same fix. Less word. Brain still big.

Pick your troop level:

🪶 Lite

"Your component re-renders because you create a new object reference each render. Inline object props fail shallow comparison every time. Wrap it in useMemo."

🪨 Full

"New object ref each render. Inline object prop = new ref = re-render. Wrap in useMemo."

🔥 Ultra

"Inline obj prop → new ref → re-render. useMemo."

⚡ Micro

"New ref each render. useMemo."

Same answer. You pick how many word.

┌─────────────────────────────────────┐
│  TOKENS SAVED          ████████ 75% │
│  TECHNICAL ACCURACY    ████████ 100%│
│  SPEED INCREASE        ████████ ~3x │
│  VIBES                 ████████ OOG │
└─────────────────────────────────────┘

Faster response — less token to generate = speed go brrr
Easier to read — no wall of text, just the answer
Same accuracy — all technical info kept, only fluff removed (science say so)
Save money — fewer output tokens = less cost
Fun — every code review become comedy

Install

Claude Code (recommended)

Install as a plugin — includes skills + auto-loading hooks (ape activates every session, mode badge tracks /ape ultra etc.):

claude plugin marketplace add JuliusBrussee/ape
claude plugin install ape@ape

Any agent (Claude Code, Cursor, Copilot, Windsurf, Cline, Codex)

npx skills add JuliusBrussee/ape

For a specific agent: npx skills add JuliusBrussee/ape -a cursor

Note

npx skills installs skills only (no hooks). For Claude Code auto-loading hooks, use the plugin install above or run bash hooks/install.sh.

Codex

Clone repo → Open Codex in repo → /plugins → Search Ape → Install

Note

Windows Codex users: Clone repo → VS Code → Codex Settings → Plugins → find Ape under local marketplace → Install → Reload Window. Also enable git config core.symlinks true before cloning (requires developer mode or admin).

Install once. Use in all sessions after that. One troop. That it.

Optional: Statusline Badge

Add a [APE:ULTRA] badge to your statusline showing which mode is active. See hooks/README.md for the snippet.

Usage

Trigger with:

/ape or Codex $ape
"talk like ape"
"ape mode"
"less tokens please"

Stop with: "stop ape" or "normal mode"

Intensity Levels

Level	Trigger	What it do
Lite	`/ape lite`	Drop filler, keep grammar. Professional but no fluff
Full	`/ape full`	Default ape. Drop articles, fragments, sparse ape tone
Ultra	`/ape ultra`	Maximum compression. Telegraphic. Abbreviate hard
Micro	`/ape micro`	Answer only. Minimal words. No framing

Level stick until you change it or session end.

Ape Skills

Skill	What it do	Trigger
ape-commit	Terse commit messages. Conventional Commits. ≤50 char subject. Why over what.	`/ape-commit`
ape-review	One-line PR comments: `L42: 🔴 bug: user null. Add guard.` No throat-clearing.	`/ape-review`

ape-compress

Ape make Claude speak with fewer tokens. Compress make Claude read fewer tokens.

Your CLAUDE.md loads on every session start. Ape Compress rewrites memory files into ape-speak so Claude reads less — without you losing the human-readable original.

/ape:compress CLAUDE.md

CLAUDE.md          ← compressed (Claude reads this every session — fewer tokens)
CLAUDE.original.md ← human-readable backup (you read and edit this)

File	Original	Compressed	Saved
`claude-md-preferences.md`	706	285	59.6%
`project-notes.md`	1145	535	53.3%
`claude-md-project.md`	1122	687	38.8%
`todo-list.md`	627	388	38.1%
`mixed-with-code.md`	888	574	35.4%
Average	898	494	45%

Code blocks, URLs, file paths, commands, headings, dates, version numbers — anything technical passes through untouched. Only prose gets compressed. See the full ape-compress README for details. Security note: Snyk flags this as High Risk due to subprocess/file patterns — it's a false positive.

Benchmarks

Historical token counts from an earlier benchmark run (reproduce it yourself):

Task	Normal (tokens)	Ape (tokens)	Saved
Explain React re-render bug	1180	159	87%
Fix auth middleware token expiry	704	121	83%
Set up PostgreSQL connection pool	2347	380	84%
Explain git rebase vs merge	702	292	58%
Refactor callback to async/await	387	301	22%
Architecture: microservices vs monolith	446	310	30%
Review PR for security issues	678	398	41%
Docker multi-stage build	1042	290	72%
Debug PostgreSQL race condition	1200	232	81%
Implement React error boundary	3454	456	87%
Average	1214	294	65%

Range: 22%–87% savings across prompts.

Important

Ape only affects output tokens — thinking/reasoning tokens are untouched. Ape no make brain smaller. Ape make mouth smaller. Biggest win is readability and speed, cost savings are a bonus.

A March 2026 paper "Brevity Constraints Reverse Performance Hierarchies in Language Models" found that constraining large models to brief responses improved accuracy by 26 percentage points on certain benchmarks and completely reversed performance hierarchies. Verbose not always better. Sometimes less word = more correct.

Evals

Ape not just claim compression. Ape measure it.

The evals/ directory has a three-arm eval harness that measures real token compression against a proper control — not just "verbose vs skill" but "terse vs skill". Because comparing ape to verbose Claude conflate the skill with generic terseness. That cheating. Ape not cheat.

# Run the eval (needs claude CLI)
uv run python evals/llm_run.py

# Read results (no API key, runs offline)
uv run --with tiktoken python evals/measure.py

Snapshots are local generated artifacts and are not committed. Run the eval when you want fresh numbers. Add a skill, add a prompt — harness picks it up automatically.

Star This Repo

If ape save you mass token, mass money — leave mass star. ⭐

Also by Julius Brussee

Cavekit — specification-driven development for Claude Code. Ape language → specs → parallel builds → working software.
Revu — local-first macOS study app with FSRS spaced repetition, decks, exams, and study guides. revu.cards

License

MIT — free like mass mammoth on open plain.

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
.agents/plugins		.agents/plugins
.claude-plugin		.claude-plugin
.cursor/skills/ape		.cursor/skills/ape
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
ape-compress		ape-compress
ape		ape
benchmarks		benchmarks
compress		compress
docs		docs
evals		evals
hooks		hooks
plugins/ape		plugins/ape
skills		skills
tests/ape-compress		tests/ape-compress
.gitattributes		.gitattributes
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ape

Before / After

🗣️ Normal Claude (69 tokens)

🦍 Ape Claude (19 tokens)

🗣️ Normal Claude

🦍 Ape Claude

🪶 Lite

🪨 Full

🔥 Ultra

⚡ Micro

Install

Claude Code (recommended)

Any agent (Claude Code, Cursor, Copilot, Windsurf, Cline, Codex)

Codex

Optional: Statusline Badge

Usage

Intensity Levels

Ape Skills

ape-compress

Benchmarks

Evals

Star This Repo

Also by Julius Brussee

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ape

Before / After

🗣️ Normal Claude (69 tokens)

🦍 Ape Claude (19 tokens)

🗣️ Normal Claude

🦍 Ape Claude

🪶 Lite

🪨 Full

🔥 Ultra

⚡ Micro

Install

Claude Code (recommended)

Any agent (Claude Code, Cursor, Copilot, Windsurf, Cline, Codex)

Codex

Optional: Statusline Badge

Usage

Intensity Levels

Ape Skills

ape-compress

Benchmarks

Evals

Star This Repo

Also by Julius Brussee

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages