EmbedOpt

Robust Inference-Time Steering of Protein Diffusion Models via Embedding Optimization

Optimize trunk embeddings of a frozen protein diffusion model toward any differentiable reward — no retraining required.

_{Methodology schematic of EmbedOpt.}

EmbedOpt embedding optimization animation

_{Unlike coordinate-space steering (DPS), EmbedOpt shifts the prior itself — yielding smoother, more regularized trajectories.}

Overview

EmbedOpt extends Protenix (an AlphaFold3 implementation) with reward-guided sampling at inference time. Rather than modifying diffusion noise or retraining the model, EmbedOpt directly optimizes the trunk embeddings (s_trunk, z_trunk) to maximize a reward — such as cryo-EM density correlation or inter-residue distance satisfaction.

Three sampling strategies are supported:

Method	Description
`base`	Unguided sampling ; supports loading pre-optimized embeddings
`dps`	Diffusion Posterior Sampling - optimizes noisy coordinates using reward gradient at each diffusion step
`embedopt`	Our method — optimizes embeddings `s_trunk`/`z_trunk` using reward gradient at each diffusion step

Installation

pixi install

Prerequisites and details

Install pixi if you haven't already:

curl -fsSL https://pixi.sh/install.sh | bash

Run commands inside the environment without activating a shell:

pixi run python your_script.py

Or activate a persistent shell:

pixi shell

Adding dependencies — from conda-forge (preferred for compiled/scientific packages):

pixi add numpy openmm

From PyPI:

pixi add --pypi some-package

After adding a dependency, commit both pyproject.toml and pixi.lock so others get the exact same environment.

Examples

Real Cryo-EM Map Tutorial

Step-by-step walkthrough steering EmbedOpt with a real cryo-EM density map (EMD-64136, 3.52 Å, 9UGC_A). The map, reference structure, and sequence are already included in the folder.

→ examples/real_map_tutorial/

Synthetic Cryo-EM Map Benchmark

Reproduces the synthetic map benchmark from the paper: 77 PDB proteins paired with 5 Å synthetic density maps, comparing embedopt, dps, and base across 8 learning rates. Requires a SLURM cluster and Phenix for map–model validation.

→ examples/synthetic_map_benchmark/

AF Distance Constraint Benchmark

Reproduces the distance-constraint steered diffusion benchmark on the 24-system Distance-AF set, with learning-rate and diffusion-step sweeps. Requires a SLURM cluster.

→ examples/AF_distance_benchmark/

Correspondence

Minhuan Li · minhuanli@flatironinstitute.org
Luhuan Wu · luhuanwu0@gmail.com

Citation

If you use EmbedOpt in your work, please cite:

@article{li2026robust,
  title={Robust Inference-Time Steering of Protein Diffusion Models via Embedding Optimization},
  author={Li, Minhuan and Han, Jiequn and Cossio, Pilar and Wu, Luhuan},
  journal={arXiv preprint arXiv:2602.05285},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
examples		examples
src/embedopt		src/embedopt
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
pixi.lock		pixi.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EmbedOpt

Robust Inference-Time Steering of Protein Diffusion Models via Embedding Optimization

Overview

Installation

Examples

Real Cryo-EM Map Tutorial

Synthetic Cryo-EM Map Benchmark

AF Distance Constraint Benchmark

Correspondence

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EmbedOpt

Robust Inference-Time Steering of Protein Diffusion Models via Embedding Optimization

Overview

Installation

Examples

Real Cryo-EM Map Tutorial

Synthetic Cryo-EM Map Benchmark

AF Distance Constraint Benchmark

Correspondence

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages