GitHub - rajansagarwal/compression: llms can learn their own context compression via RL

LLMs can learn their own compression

TL;DR: I co-trained a summarizer and a generator to learn a compression scheme for text in the same token space as the base model, so it can continue with almost the same quality using an order of magnitude fewer context tokens. Along the way the model discovers its own compression tricks: aggressive pruning, dense punctuation (lots of semicolons), and even occasionally switching into Mandarin to pack more information per token.

You can read the full blog post here.

Run the Code

It's super simple.

Set your TINKER_API_KEY and WANDB_API_KEY in .env file.
Run uv sync
Run uv run run_train.py

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
run_train.py		run_train.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLMs can learn their own compression

Run the Code

About

Uh oh!

Releases

Packages

Languages

rajansagarwal/compression

Folders and files

Latest commit

History

Repository files navigation

LLMs can learn their own compression

Run the Code

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages