SCSI — Self-Consistent Stochastic Interpolants

Official implementation of Generative Modeling from Black-box Corruptions via Self-Consistent Stochastic Interpolants (Chirag Modi, Jiequn Han, Eric Vanden-Eijnden, Joan Bruna — ICLR 2026). [link]

Research code for training generative models (diffusion / stochastic interpolants) from corrupted observations — i.e. learning a clean-data prior when only degraded samples are available. Corruptions include Gaussian noise, random/block masking, Gaussian/motion blur, JPEG compression, and random projections. Targets CIFAR-10, MNIST, CelebA, SDSS DR16 quasar spectra (1-D), and 2-D synthetic distributions (two_moons, checkerboard).

Installation

pip install -r requirements.txt

Python 3.9+ and a recent PyTorch (CUDA for serious training; CPU is fine for the 2-D MLP smoke tests) are expected.

Quickstart: 2-D synthetic example

A small MLP trained on two_moons with additive Gaussian noise. Runs on CPU in a couple of minutes and exercises the full training loop.

python scsi_synthetic.py \
    --dataset two_moons \
    --corruption gaussian_noise \
    --corruption_levels 0.5 \
    --n_samples 6000 \
    --train_steps 5000 \
    --learning_rate 3e-3 \
    --suffix test

Other settings use driver defaults, including batch_size=2000, fc_width=256, fc_depth=3, resamples=1, alpha=0.9, ode_steps=40, and t_emb_dim=32.

Outputs (loss curve, intermediate denoising snapshots, final model) land under ./results/two_moons-gaussian_noise-0.50-test/. tests/test_synthetic.py runs a much smaller configuration as a wiring check.

Quickstart: MNIST with random masking

A tiny U-Net trained on 50%-masked MNIST. Runs in a couple of minutes on a single GPU and exercises the image training pipeline end-to-end.

python -u scsi_image.py \
    --dataset mnist \
    --corruption random_mask \
    --corruption_levels 0.5 0.0 \
    --train_steps 100 \
    --channels 32 \
    --ode_steps 64 \
    --alpha 0.9 \
    --resamples 2 \
    --learning_rate 3e-4 \
    --save_every 10 \
    --suffix test

Other settings use driver defaults; for random_mask, omitting a third corruption level means the masked-region noise parameter defaults to 0.0.

Outputs land under ./results/singleview/mnist-random_mask-0.50-0.00-test/. tests/test_image.py runs a much smaller configuration as a wiring check.

A real example: CIFAR-10 with random masking

python -u scsi_image.py \
    --dataset cifar10 --corruption random_mask \
    --corruption_levels 0.5 0.0 1.0 \
    --train_steps 50000 --channels 32 --ode_steps 64 \
    --alpha 0.9 --resamples 2 --learning_rate 5e-4 --lr_scheduler

CIFAR-10 downloads to $SCSI_DATA (default ./data/cifar10/) on first call. Outputs go to $SCSI_RESULTS/singleview/... (or multiview/... with --multiview).

Distributed (single-node, multi-GPU) training uses torchrun:

torchrun --standalone --nproc_per_node=4 scsi_distributed.py [args...]

Paths

Two environment variables (also exposed as CLI flags) control where data and outputs live:

Variable	CLI flag	Default	Purpose
`SCSI_DATA`	`--data_root`	`./data`	Dataset caches (MNIST, CIFAR-10, CelebA, synthetic, QSO)
`SCSI_RESULTS`	`--results_root`	`./results`	Training outputs, `model-best.pt`, logs

Datasets

All drivers pull data via get_dataset(name, data_root, seed=42) from src/custom_datasets.py. Supported names:

mnist, cifar10 — downloaded via torchvision on first call.
celebA — not auto-downloaded; place img_align_celeba/ under $SCSI_DATA/celebA/.
two_moons, checkerboard — synthetic; cached under $SCSI_DATA/<name>/seed_<seed>/.
qso — SDSS DR16 quasar spectra; downloaded via src/qso_download.py on first call (small default: 1000 spectra in redshift range 2.75–3.25; configurable via --max_spectra, --z_min, --z_max on qsos.py). Requires astropy.

To pre-download the QSO cache:

python scripts/download_qso.py --data-root ./data --max-spectra 1000

Driver map

Top-level drivers add <repo>/src to sys.path via a __file__-relative sys.path.append, so they can be invoked from any working directory — no install step needed.

Script	Purpose
`scsi_synthetic.py`	2-D synthetic MLP experiments (`two_moons`, `checkerboard`)
`scsi_image.py`	Single-GPU training from corrupted images
`scsi_distributed.py`	DDP variant via `torchrun`
`awgn.py`	Specialization for the additive-Gaussian-noise case
`qsos.py`	1-D quasar-spectra variant; uses `KarrasUnet1D`
`train.py`, `sample.py`, `fid_eval.py`	Plain EDM diffusion baseline (clean data only)
`fid_eval_interpolants.py`, `fid_eval_awgn.py`, `fid_eval_dps.py`	FID evaluation of trained interpolants / DPS baseline
`lpips_eval_interpolants.py`, `lpips_eval_dps.py`	LPIPS / PSNR / SSIM evaluation
`clean_interpolants.py`	Produces "cleaned" samples from a trained interpolant

The SLURM scripts job-cifar10.sh, job-dist.sh, job-sample.sh are canonical invocation examples.

Repository layout

src/                          # Library code — not an installable package yet
  custom_datasets.py          # Dataset registry, CorruptedDataset, QSODataset
  forward_maps.py             # Corruption factories (corruption_dict)
  interpolant_utils.py        # SCSInterpolant and variants
  trainer_si.py               # Production training loop (EMA, DDP, checkpointing)
  trainer.py                  # Older/simpler loop used by train.py (EDM baseline)
  networks.py, karras_unet*.py, mlps.py
  paths.py                    # SCSI_DATA / SCSI_RESULTS resolution
  qso_download.py             # SDSS DR16 spectra downloader
scripts/                      # CLI wrappers (currently just QSO download)

Top-level *.py files are thin argparse + config wrappers around src/.

Citation

If you use this code, please cite:

@inproceedings{modi2025generative,
  title={Generative Modeling from Black-box Corruptions via Self-Consistent Stochastic Interpolants},
  author={Modi, Chirag and Han, Jiequn and Vanden-Eijnden, Eric and Bruna, Joan},
  booktitle={International Conference on Learning Representations (ICLR)},
  url={https://arxiv.org/abs/2512.10857},
  year={2026}
}

License

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) — see LICENSE. This matches the license of the vendored NVIDIA EDM code under src/ (networks.py, generate.py, dnnlib.py, torch_utils/).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SCSI — Self-Consistent Stochastic Interpolants

Installation

Quickstart: 2-D synthetic example

Quickstart: MNIST with random masking

A real example: CIFAR-10 with random masking

Paths

Datasets

Driver map

Repository layout

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
awgn.py		awgn.py
clean_interpolants.py		clean_interpolants.py
fid_eval.py		fid_eval.py
fid_eval_awgn.py		fid_eval_awgn.py
fid_eval_dps.py		fid_eval_dps.py
fid_eval_interpolants.py		fid_eval_interpolants.py
fid_eval_stage.py		fid_eval_stage.py
lpips_eval_dps.py		lpips_eval_dps.py
lpips_eval_interpolants.py		lpips_eval_interpolants.py
qsos.py		qsos.py
requirements.txt		requirements.txt
sample.py		sample.py
scsi_distributed.py		scsi_distributed.py
scsi_image.py		scsi_image.py
scsi_synthetic.py		scsi_synthetic.py
train.py		train.py

Folders and files

Latest commit

History

Repository files navigation

SCSI — Self-Consistent Stochastic Interpolants

Installation

Quickstart: 2-D synthetic example

Quickstart: MNIST with random masking

A real example: CIFAR-10 with random masking

Paths

Datasets

Driver map

Repository layout

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages