context-compiler

Deterministic local-first CLI for inspecting, linting, and compacting prompt/context files.

Quick Value (Start Here)

ctxc compact --text "You are helpful. You are helpful."

Expected compacted result:

You are helpful.

No model calls. No semantic rewriting. Only explicit deterministic transforms.

Install

npm install -g context-compiler-cli

Primary command:

ctxc --help

Compatibility alias:

context-compiler --help

30-Second Quick Start

# Preview deterministic compaction (no file writes)
ctxc compact --text "You are helpful. You are helpful."

# Analyze structure/tokens/warnings
ctxc analyze --text "You are helpful. You are helpful."

# Lint deterministic prompt debt
ctxc lint --text "You are helpful. You are helpful."

# Optimize pipeline preview
ctxc optimize --text "You are helpful. You are helpful." --dry-run --diff

Path and directory shortcuts:

ctxc @prompt.md
ctxc @prompts/
cat prompt.md | ctxc

Commands

Command	Purpose	Writes files by default
`ctxc compact`	Front door: preview deterministic compaction and see resulting text	No (preview-only)
`ctxc analyze`	Inspect structure, token counts, and warnings	No
`ctxc lint`	Detect prompt/context debt with deterministic rules	No
`ctxc optimize`	Advanced pipeline workflow (`--write`, `--check`, transform controls)	No (`--write` required)

compact supports file, directory, --text, and --stdin like other commands.

ctxc compact examples/basic-prompt.md
ctxc compact examples --diff
ctxc compact --text "You are helpful. You are helpful."
echo "You are helpful. You are helpful." | ctxc compact --stdin

compact reuses optimize results and JSON shape; it does not introduce a separate output schema.

optimize is intentionally more operational than compact. Use compact for quick preview and readability; use optimize when you need pipeline controls, check mode, and explicit write workflows.

Tokenizer Selection

Default tokenizer is char. Optional real model-family tokenizer: o200k_base.

Use CLI override (highest precedence):

ctxc compact --text "You are helpful. You are helpful." --tokenizer o200k_base
ctxc analyze examples/basic-prompt.md --tokenizer char

Or use the built-in command to set it without editing JSON:

ctxc config set tokenizer.default o200k_base
ctxc config set tokenizer.default char

This creates or updates context-compiler.config.json in the current directory. Existing config keys are preserved. Use --config <path> to target a specific config file.

Or set it manually in config:

{
  "tokenizer": {
    "default": "o200k_base"
  }
}

Precedence (highest to lowest):

--tokenizer CLI flag
tokenizer.default in context-compiler.config.json
Built-in default (char)

Notes:

char is stable and lightweight.
o200k_base is more faithful for its model family than char fallback.
Neither option is a universal tokenizer for every model/runtime.

Input Modes

Inputs are explicit:

positional path => file or directory
raw text => --text "..."
stdin => --stdin

No guessing between path and raw text.

Deterministic Transform Scope

Current optimize/compact transforms:

remove-exact-duplicates
collapse-repeated-sentences
collapse-formatting-rules
truncate-tool-output
trim-oversized-examples

Protected blocks are preserved exactly:

<!-- context-compiler: protect:start -->
Do not modify this section.
<!-- context-compiler: protect:end -->

Config

By default, config is loaded from context-compiler.config.json in the current directory.

ctxc analyze examples/basic-prompt.md --config examples/context-compiler.config.json
ctxc compact examples/basic-prompt.md --config examples/context-compiler.config.json

Unknown lint rule IDs and unknown transform IDs fail clearly.

When To Use / Not Use

Use when you need:

deterministic, local prompt/context cleanup
explainable token savings
CI-friendly checks (--fail-on, --check, --max-tokens)

Do not use when you need:

semantic rewriting/paraphrasing
model-in-the-loop prompt generation
plugin/integration workflows

Verification And Benchmark

pnpm test
pnpm typecheck
pnpm build
pnpm benchmark

Or:

pnpm verify

Benchmark is local fixture-based measurement. Use it for same-machine comparisons, not universal performance claims.

Development (From Source)

git clone https://github.com/4l1n/context-compiler.git
cd context-compiler
pnpm install
pnpm build
pnpm cc compact --text "You are helpful. You are helpful."

pnpm cc is an in-repo alias. Outside the repo, use installed ctxc.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
apps/cli		apps/cli
examples		examples
packages		packages
scripts		scripts
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
GEMINI.md		GEMINI.md
LICENSE		LICENSE
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.base.json		tsconfig.base.json
turbo.json		turbo.json
vitest.workspace.ts		vitest.workspace.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

context-compiler

Quick Value (Start Here)

Install

30-Second Quick Start

Commands

Tokenizer Selection

Input Modes

Deterministic Transform Scope

Config

When To Use / Not Use

Verification And Benchmark

Development (From Source)

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

context-compiler

Quick Value (Start Here)

Install

30-Second Quick Start

Commands

Tokenizer Selection

Input Modes

Deterministic Transform Scope

Config

When To Use / Not Use

Verification And Benchmark

Development (From Source)

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages