rote

Automatic, dependency-aware memoization for Python research scripts. No interpreter fork, no decorators required.

rote is a pure-Python reimplementation of IncPy (Guo & Engler, ISSTA 2011) on contemporary CPython (≥3.12). Same goal as the original: observe a script at runtime, find the function calls that are pure and long-running, and persist their results across runs. The implementation is new, built on sys.monitoring (PEP 669) and audit hooks (PEP 578), so no patched interpreter is needed.

There's a companion site that walks through the design, the speedups, and where rote diverges from the paper. If you're reading this for the first time, start there.

Why

You change one line in analyze.py, save, re-run. Plain Python re-does the 90 seconds of feature extraction, the 30 seconds of model training, and the 2 seconds of plotting, all to look at one tweaked plot. That re-work is what IncPy was built to remove in 2011. It's still the problem.

Install

pip install rote                  # core
pip install "rote[all]"           # plus pyarrow, numpy, safetensors

Or with uv:

uv add rote                       # or `uv add "rote[all]"`

Local development:

git clone https://github.com/puppyum/rote.git
cd rote
uv venv --python 3.13 && source .venv/bin/activate
uv pip install -e ".[dev,all]"

Requires Python 3.12 or later. Apache-2.0.

Use

Three ways, ordered by how much you have to opt in.

Zero-config, paper-style

Prefix your script invocation:

rote run analyze.py

The CLI AST-wraps every top-level function in your script and in any helper modules it imports. Run the script a second time after a downstream edit; only the changed function re-executes.

Decorator

When you want to be explicit:

import rote

@rote.cache
def build_features(df):
    ...

Inside a notebook or REPL

import rote
with rote.auto():
    result = my_pipeline(data)

In Jupyter, %load_ext rote.jupyter makes every cell a memoization candidate.

What gets cached

A function call is memoized when all of these hold:

It ran for at least min_duration_s (default 1 s). Below that, the cache write costs more than re-running.
No impure I/O happened during the call. Network, subprocess, file appends, exec/eval, and stdlib non-determinism sources (time.time(), random.random(), uuid.uuid4(), os.environ) all disqualify it.
No argument mutated. Arguments are fingerprinted on entry and re-checked on exit.
The function's source, every function it transitively calls, and every file it read are unchanged from the cached version.

If any check fails, the cache misses and the function runs. A cached value that can't be proven safe never gets returned; the tests/correctness/ suite includes 36 perturbation tests and 60 differential tests that fail loudly if a cached value drifts from a fresh run.

The serializer dispatches by type: Arrow IPC for DataFrames, numpy.save for arrays, safetensors for Torch tensors, msgpack for primitives, cloudpickle as a last resort.

Measured performance

Apple Silicon, Python 3.13. Warm-hit timings are medians of 20 iterations; the cross-process and pipeline numbers are medians of 5 runs.

Per-function warm-hit cost against joblib.Memory:

Workload	joblib warm	rote warm	speedup
2 M-term Leibniz	101 µs	49 µs	2.06×
Basel sum	90 µs	35 µs	2.58×
400×400 NumPy QR	226 µs	35 µs	6.40×
200K-char bag-of-words	98 µs	37 µs	2.64×
200×200 matrix inverse	88 µs	69 µs	1.29×

Geomean across the five workloads: 2.59× faster than joblib.Memory.

On the paper-style multi-stage pipeline (parse → aggregate → format), with an edit to the final stage and everything in one process: plain Python re-runs the whole thing in 252 ms; rote skips the upstream stages and finishes the warm run in 5.5 ms, about 46× faster than the cold pipeline. joblib.Memory is faster on the same benchmark (1.8 ms warm) because it keys purely on argument values, where rote content-hashes the intermediate files on every hit so a mtime-preserving edit cannot return a stale result.

The tradeoff at the level you actually live with — edit, save, rerun, fresh Python process each time:

	wall-clock	vs plain
plain Python (whole pipeline)	1.75 s	—
`rote` warm (fresh interpreter)	0.35 s	4.9×
`joblib` warm (fresh interpreter)	0.19 s	9.4×

A persistent stat → content-hash table in the cache store is what keeps rote's file-dep validation cheap across process boundaries: each warm subprocess does a stat() per dependency and reuses the stored hash unless (size, mtime_ns, ctime_ns) change. Joblib still wins here because it skips content validation outright. Full numbers, the correctness/speed tradeoff, and a serializer breakdown live in docs/BENCHMARKS.md.

Test suite: 381 tests pass, including 60 differential and 36 perturbation tests. On the corpus/realistic/ subset (five multi-second scripts), auto-mode eliminates 100% of cold compute on warm re-run. mypy --strict and ruff clean. CI runs Linux, macOS, and Windows on Python 3.12 and 3.13.

Public API

Name	Purpose
`rote.cache`	Decorator. The explicit escape hatch.
`rote.auto()`	Context manager. Every call inside the block is a candidate.
`rote.invalidate(target=None)`	Drop entries. `target` is a function, a qualname string, or `None` for everything.
`rote.clear()`	Wipe all tiers (in-memory + SQLite + blobs).
`rote.configure(**kwargs)`	Override defaults (cache dir, `min_duration_s`, fsync, telemetry, ...).
`rote.stats()`	Hits, misses, time saved, invalidation reasons.
`rote.graph()`	A `networkx.DiGraph` of observed caller → callee edges.
`rote run <script>`	CLI: run a script under auto-mode.
`rote status`	CLI: print stats for the cache in the CWD.
`rote clear`	CLI: wipe the cache in the CWD.

Layout

src/rote/         the package (13 modules, ~4K lines)
tests/            unit / property / integration / correctness suites
docs/             architecture, benchmarks, evaluation, joblib migration
bench/            workload + serializer microbenchmarks
corpus/           30 fast scripts for differential tests, plus a realistic/ subset for coverage
examples/         demos used by the integration tests

Architecture in detail: docs/architecture.md. Benchmarks: docs/BENCHMARKS.md. Recent changes: CHANGELOG.md.

Limitations

Python 3.12+ only. sys.monitoring (PEP 669) is the load-bearing primitive; there's no fallback for older interpreters.
Functions doing real I/O are skipped. Network reads, append-mode file writes, and subprocess calls all disqualify a call. The system is built for compute-heavy steps that take a data file in and return a value out.
First run pays an AST-transform cost. Auto-mode rewrites your script through libcst once per source change; the rewrite is cached on disk after that.
The 1-second default threshold is conservative. Sub-second calls aren't memoized unless you lower it explicitly with rote.configure(min_duration_s=0.05).

License

Apache-2.0. See LICENSE.

Citing IncPy

If you use rote in academic work, cite the original paper:

Guo, P. J., & Engler, D. (2011). Using automatic persistent memoization to
facilitate data analysis scripting. Proceedings of the 2011 International
Symposium on Software Testing and Analysis (ISSTA '11), 287–297.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rote

Why

Install

Use

Zero-config, paper-style

Decorator

Inside a notebook or REPL

What gets cached

Measured performance

Public API

Layout

Limitations

License

Citing IncPy

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github/workflows		.github/workflows
bench		bench
corpus		corpus
docs		docs
examples		examples
site		site
src/rote		src/rote
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

rote

Why

Install

Use

Zero-config, paper-style

Decorator

Inside a notebook or REPL

What gets cached

Measured performance

Public API

Layout

Limitations

License

Citing IncPy

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages