LoopedHRM

A research codebase for Two-Level Adaptive Compute — Hierarchical Iterative Solver with Sudoku as the first environment.

This repository is intentionally scaffolded as a skeleton. It captures project structure, conceptual interfaces, and implementation placeholders while preserving the core design goals:

iterative latent refinement (Looped-style)
hierarchical reasoning stages (HRM-style)
learned halting at both refinement and segment levels
adaptive compute allocation based on puzzle difficulty

Project Status

🚧 Early scaffold stage. Modules currently contain interfaces and TODO markers.

Core Research Goal

Learn how much computation to use while solving structured reasoning tasks.

Two compute levels are modeled as interacting modules:

Lower module for local latent refinement (variable refine-step bursts)
Upper module for global update, evaluation, and continuation/stop decisions

The model should learn to stop both levels dynamically and allocate lower/upper compute based on puzzle difficulty.

Repository Layout

.
├── configs/                     # Experiment and model configuration skeletons
├── docs/
│   └── design_doc.md            # Living design document from the proposal
├── notebooks/                   # Analysis notebooks (future)
├── scripts/                     # Training/evaluation entrypoints
├── src/looped_hrm/
│   ├── data/                    # Sudoku generation/loading and batching
│   ├── models/                  # Hierarchical iterative solver components
│   ├── compute/                 # Halting and compute budget policies
│   ├── training/                # Losses, trainer loops, curriculum
│   ├── evaluation/              # Metrics and benchmarking
│   └── utils/                   # Common helpers
└── tests/                       # Unit/integration test skeletons

Quick Start (uv)

uv sync --extra dev
uv run python scripts/train.py --config configs/experiment/sudoku_baseline.yaml

The training pipeline is not yet implemented; current scripts are placeholders.
uv run ... automatically uses the project environment, so manual activation is optional.

Sudoku Dataset Source

This project now expects Sudoku data in the HRM dataset format (generated by dataset/build_sudoku_dataset.py from https://github.com/sapientinc/HRM).

Place the generated data folder under data/ (for example data/sudoku-extreme-full) so looped_hrm.data.sudoku_dataset.SudokuDataset can load train/ and test/ splits. Alternatively all scripts that use a dataset can be pointed to a directory.

Pre-merge Testing Workflow

Use this workflow before merging dataset-loader changes.

Install dependencies (runtime + dev/test):
```
uv sync --extra dev
```
Run unit tests:
```
uv run pytest -q
```

Run synthetic smoke test (no external data required):

uv run python scripts/validate_sudoku_dataset.py --self-test

Validate against a real HRM dataset folder (if available):

uv run python scripts/validate_sudoku_dataset.py --dataset-path data/sudoku-extreme-full


## Development Priorities

1. Implement Sudoku dataset and difficulty buckets.
2. Implement latent state update/refinement module.
3. Implement dual halting heads:
   - refinement halt
   - segment halt
4. Add training objective balancing correctness vs compute penalty.
5. Instrument compute-usage metrics by difficulty.

## Design Source

See `docs/design_doc.md` for the full conceptual plan and constraints.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
configs		configs
docs		docs
scripts		scripts
src/looped_hrm		src/looped_hrm
tests		tests
.gitignore		.gitignore
.gitkeep		.gitkeep
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LoopedHRM

Project Status

Core Research Goal

Repository Layout

Quick Start (uv)

Sudoku Dataset Source

Pre-merge Testing Workflow

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LoopedHRM

Project Status

Core Research Goal

Repository Layout

Quick Start (uv)

Sudoku Dataset Source

Pre-merge Testing Workflow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages