fix(rf3): fix issues with 5I09 due to chain breaks and add associated tests by k-chrispens · Pull Request #190 · diff-use/sampleworks

k-chrispens · 2026-03-26T22:59:26Z

test: add tests that 5I09 with RF3 currently fails
fix: RF3 now passes tests on 5I09 + other structures

Summary by CodeRabbit

Bug Fixes
- Improved atomic structure validation to enforce atom count consistency in model processing.
Documentation
- Clarified default EDM sampler configuration parameters, including gamma_min value specifications.

coderabbitai · 2026-03-26T22:59:39Z

Caution

Review failed

The pull request is closed.

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 5ffece08-d666-4b60-83f7-6f9f2e3e37df

📥 Commits

Reviewing files that changed from the base of the PR and between 03fec81 and 292d42b.

📒 Files selected for processing (4)

src/sampleworks/core/samplers/edm.py
tests/integration/test_no_guidance_geometry.py
tests/models/test_rf3_atom_ordering.py
tests/resources/5I09/5I09_single_001_density_input.cif

📝 Walkthrough

Walkthrough

The PR updates EDM sampler documentation to clarify gamma_min defaults to 0.2 rather than AF3's 1.0, refactors RF3 wrapper's atom array construction to source from pipeline output with strict validation, and introduces comprehensive test suites for geometry validation and atom ordering regression testing.

Changes

Cohort / File(s)	Summary
EDM Sampler Configuration `src/sampleworks/core/samplers/edm.py`	Updated docstrings and comments for `EDMSamplerConfig` and `AF3EDMSampler` to explicitly document that `gamma_min` defaults to `0.2` (diverging from AF3's `1.0`); added trailing newline.
RF3 Wrapper Atom Array Logic `src/sampleworks/models/rf3/wrapper.py`	Replaced `model_aa` construction from filtered `inference_input.atom_array` with direct sourcing from `pipeline_output["atom_array"]`; added validation to ensure `atom_array` key exists and enforce exact length match against `num_atoms` derived from feature atom mapping, raising `ValueError` on mismatch.
Test Infrastructure `tests/conftest.py`	Added session-scoped pytest fixture `structure_5i09_density` for parsing density-input CIF structure resource.
Integration Tests `tests/integration/test_no_guidance_geometry.py`	New test module validating that unguided EDM sampling produces valid peptide backbone geometry; parametrized across wrapper types and structure fixtures, samples 20 steps with fixed RNG seed, and asserts ≥90% of consecutive backbone bonds fall within distance bounds [1.1, 1.7]Å.
Regression Tests `tests/models/test_rf3_atom_ordering.py`	New test module with three parametrized RF3 regression tests verifying `model_atom_array` length matches feature atom mapping, contains no `OXT` atoms, and contains no hydrogen atoms.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

#182: Modifies RF3 wrapper atom array handling with similar changes to model_atom_array construction and validation.
#140: Updates RF3 wrapper atom array initialization with NaN-coordinate and occupancy handling alongside atom array construction logic.

Suggested reviewers

marcuscollins

Poem

🐰 A hopping cheer for atoms true and ordered right,
Where gamma_min shines bright at 0.2's light,
Pipeline outputs guide the peptide's way,
Geometry tests dance through every day,
Validation strong, no hydrogen stray—robust paths ahead we lay! ✨

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the primary change: fixing RF3 issues with 5I09 (a structure with chain breaks) and adding associated tests.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch kmc/5i09-fixes

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Copilot

Pull request overview

Fixes an RF3 atom ordering/count mismatch observed on 5I09 (chain breaks / terminal atoms) by aligning RF3Wrapper’s model_atom_array with RF3’s own inference pipeline output, and adds regression + integration tests to guard against future misalignment.

Changes:

Add RF3 regression tests asserting model_atom_array length/order matches atom_to_token_map, and that OXT/hydrogens are absent.
Add a no-guidance sampling integration test that checks basic peptide geometry after a short trajectory across wrappers/structures (including 5I09 density).
Update RF3Wrapper.featurize() to source model_atom_array from RF3 pipeline output and require exact atom-count agreement.

Reviewed changes

Copilot reviewed 5 out of 7 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`tests/models/test_rf3_atom_ordering.py`	New RF3-specific regression tests for atom count/order and filtered atoms (OXT/H).
`tests/integration/test_no_guidance_geometry.py`	New integration test running short no-guidance sampling and validating backbone bond continuity.
`tests/conftest.py`	Adds `structure_5i09_density` session fixture used by new tests.
`src/sampleworks/models/rf3/wrapper.py`	Uses RF3 pipeline’s `atom_array` as `model_atom_array` and enforces exact match with `atom_to_token_map`.
`src/sampleworks/core/samplers/edm.py`	Clarifies that `gamma_min=0.2` differs from AF3 defaults in code comments/docstring.
`pixi.lock`	Dependency lockfile updates (package/build revisions).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (2)

tests/conftest.py (1)
485-487: Add a docstring for the new shared fixture.

This fixture is part of the shared test surface, so it should carry the repo's standard NumPy-style docstring too.
Proposed docstring
 `@pytest.fixture`(scope="session")
 def structure_5i09_density(resources_dir: Path) -> dict:
+    """Return the parsed 5I09 density-input structure.
+
+    Returns
+    -------
+    dict
+        Parsed Atomworks structure for the 5I09 density fixture.
+    """
     return parse(resources_dir / "5I09" / "5I09_single_001_density_input.cif", ccd_mirror_path=None)
As per coding guidelines, "Always include NumPy-style docstrings for every function and class."
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/conftest.py` around lines 485 - 487, The new pytest fixture
structure_5i09_density is missing a NumPy-style docstring; add a concise
NumPy-style docstring directly above the def structure_5i09_density(...) that
describes the fixture purpose (returns parsed density structure for 5I09), its
parameters (resources_dir: Path), the return type (dict), and any important
notes (uses parse with ccd_mirror_path=None and the file
"5I09_single_001_density_input.cif"); ensure the docstring follows the repo's
NumPy conventions and is placed inside the tests/conftest.py next to the fixture
definition.
tests/integration/test_no_guidance_geometry.py (1)
79-88: Prefer asserting geometry through the public/reconciled output path.

conditioning.model_atom_array is wrapper-internal state, so this test will fail on harmless internal reordering or reconciliation changes even when the user-visible sampled structure is still correct. Building the AtomArray from the same reconciled output path the product uses will keep this black-box and more stable.

As per coding guidelines, "Write black-box tests that verify behavior, not implementation. Test public interfaces with realistic inputs and avoid mocking."
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@tests/integration/test_no_guidance_geometry.py` around lines 79 - 88, Don't
inspect wrapper-internal state via features.conditioning.model_atom_array;
instead reconstruct the AtomArray through the same public/reconciled output path
the product uses (i.e. run the same postprocessing/reconciliation on state and
obtain the reconciled AtomArray), then set/assert its coordinates (instead of
directly setting output_aa.coord from state) so the test verifies the public
output; update references that currently use
features.conditioning.model_atom_array and output_aa.coord to use the
reconciled-output factory/function that the wrapper exposes (apply the same
transform to state as the runtime does) while keeping wrapper_type and
structure_fixture in the assertion message.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/sampleworks/core/samplers/edm.py`:
- Line 116: Update the EDMSamplerConfig class docstring to stop claiming
defaults match AF3 and explicitly document that gamma_min is overridden to 0.2
(not AF3's 1.0); locate the EDMSamplerConfig docstring and change its wording to
reflect the non-AF3 default for the gamma_min parameter and mention the
overridden value so the inline note at the gamma_min definition and the class
docstring are consistent.

---

Nitpick comments:
In `@tests/conftest.py`:
- Around line 485-487: The new pytest fixture structure_5i09_density is missing
a NumPy-style docstring; add a concise NumPy-style docstring directly above the
def structure_5i09_density(...) that describes the fixture purpose (returns
parsed density structure for 5I09), its parameters (resources_dir: Path), the
return type (dict), and any important notes (uses parse with
ccd_mirror_path=None and the file "5I09_single_001_density_input.cif"); ensure
the docstring follows the repo's NumPy conventions and is placed inside the
tests/conftest.py next to the fixture definition.

In `@tests/integration/test_no_guidance_geometry.py`:
- Around line 79-88: Don't inspect wrapper-internal state via
features.conditioning.model_atom_array; instead reconstruct the AtomArray
through the same public/reconciled output path the product uses (i.e. run the
same postprocessing/reconciliation on state and obtain the reconciled
AtomArray), then set/assert its coordinates (instead of directly setting
output_aa.coord from state) so the test verifies the public output; update
references that currently use features.conditioning.model_atom_array and
output_aa.coord to use the reconciled-output factory/function that the wrapper
exposes (apply the same transform to state as the runtime does) while keeping
wrapper_type and structure_fixture in the assertion message.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: a0ac8e6a-bc2c-4e66-9190-fe008eda9a05

📥 Commits

Reviewing files that changed from the base of the PR and between 488cd6c and e50dde2.

⛔ Files ignored due to path filters (1)

pixi.lock is excluded by !**/*.lock

📒 Files selected for processing (6)

src/sampleworks/core/samplers/edm.py
src/sampleworks/models/rf3/wrapper.py
tests/conftest.py
tests/integration/test_no_guidance_geometry.py
tests/models/test_rf3_atom_ordering.py
tests/resources/5I09/5I09_single_001_density_input.cif

marcuscollins

Take a look at my comments and see what you think. I think there are improvements we could make still, but this is definitely forward progress. (And I need to add tests for some of my residue numbering operations as well...)

marcuscollins · 2026-03-27T00:09:45Z

+        torch.manual_seed(42)
+        state = wrapper.initialize_from_prior(batch_size=1, features=features)
+
+        for i in range(self.NUM_STEPS):


Is there some way to run this without explicitly including this loop here? It would be better to test the actual code path we use during inference.

This is the inference code path, we do a for loop over the schedule (e.g. src/sampleworks/scalers/pure_guidance.py:108)

marcuscollins · 2026-03-27T00:15:09Z

+        # Generous bounds for a 20-step stochastic sample.
+        # Ideal backbone bonds are 1.33–1.52 Å
+        # We're a little more generous because the short trajectory.
+        con_mask = filter_linear_bond_continuity(bb, min_len=1.1, max_len=1.7)


Could we be more strict, say by looking at only the peptide bond?

You can get strict ranges from RDKit, for instance (see the bond length script)

Yeah, I just figure for speed of testing we might not expect all models to pass the most rigorous geometry checks (but 5I09 was failing this before)

marcuscollins · 2026-03-27T01:07:43Z

+            "structure_6ni6_density",
+        ],
+    )
+    def test_no_oxt_atoms_in_model_atom_array(self, rf3_wrapper, structure_fixture, request):


Shouldn't there be one OXT per chain? Does RF3's pipeline actually remove the final terminal oxygen?

There should, but it is removed by most models (except protenix)

As a reference, I presume this is done to standardize all the residues such that a residue token corresponds to a specific set of atoms always. I think protenix does something a bit fancier as their outputs do have OXT, but I don't know the specifics (only that I had to deal with them requiring it)

marcuscollins · 2026-03-27T01:10:16Z

+(``atom_to_token_map``).
+"""
+
+import numpy as np


This maybe doesn't need to be in here for purposes of this PR, but I feel like these tests are still not stringent enough. (I admit there are basically no tests yet for my CIF patching operations, which pose similar risks). We probably should actually check the alignment and make sure the atoms are correct and match each other.

I'll add an issue for this

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/sampleworks/models/protenix/structure_processing.py`:
- Around line 353-369: The current code collapses gaps by using
enumerate(start=1) after filtering, causing sequence offsets to be lost and
misaligning PTM serialization; instead compute 1-indexed sequence positions
relative to the chain subsequence start (like raw_res_id - min_res_id + 1). In
the branch where valid_positions and chain_id are present, capture the raw
res_ids (from chain_array.res_id at starts[:-1]), compute min_res_id =
min(chain_valid), and build pos_res_pairs by pairing (raw_id - min_res_id + 1)
with the corresponding res_name only for raw_id in chain_valid (use the same
res_ids and res_names variables) so pos_res_pairs preserves original residue
offsets for get_sequences() and detect_modifications().

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 71aac233-8564-41ec-91f2-d6776d9a89ba

📥 Commits

Reviewing files that changed from the base of the PR and between e50dde2 and f439c8e.

📒 Files selected for processing (1)

src/sampleworks/models/protenix/structure_processing.py

coderabbitai · 2026-03-27T16:26:04Z

+                        # Filter by valid_positions (res_id membership) before
+                        # assigning sequential positions, so that the 1 indexed
+                        # indices match character positions in the sequence
+                        # string produced by get_sequences()
                        if valid_positions is not None and chain_id in valid_positions:
                            chain_valid = valid_positions[chain_id]
-                            min_valid = min(chain_valid) if chain_valid else 1
-                            valid_seq_positions = {r - min_valid + 1 for r in chain_valid}
-                            pos_res_pairs = [
-                                (p, r) for p, r in pos_res_pairs if p in valid_seq_positions
-                            ]
+                            if hasattr(chain_array, "res_id"):
+                                res_ids = cast(np.ndarray, chain_array.res_id)[starts[:-1]].tolist()
+                                res_names = [
+                                    rn
+                                    for rn, rid in zip(res_names, res_ids, strict=True)
+                                    if rid in chain_valid
+                                ]
+
+                        # 1 indexed positions that correspond to
+                        # character indices in the sequence string.
+                        pos_res_pairs = list(enumerate(res_names, start=1))


⚠️ Potential issue | 🟠 Major

Preserve sequence offsets when renumbering filtered residues.

After the res_id filter, enumerate(start=1) collapses internal gaps. get_sequences() still exports the subsequence from min(chain_valid) to max(chain_valid), so a chain with valid residues {41, 42, 45} has sequence positions 1, 2, 5, not 1, 2, 3. That makes detect_modifications() serialize PTMs onto the wrong residue for broken chains. This should mirror the raw_res_id - min_res_id + 1 mapping already used later for covalent bonds.

💡 Proposed fix

- if valid_positions is not None and chain_id in valid_positions: - chain_valid = valid_positions[chain_id] - if hasattr(chain_array, "res_id"): - res_ids = cast(np.ndarray, chain_array.res_id)[starts[:-1]].tolist() - res_names = [ - rn - for rn, rid in zip(res_names, res_ids, strict=True) - if rid in chain_valid - ] - - # 1 indexed positions that correspond to - # character indices in the sequence string. - pos_res_pairs = list(enumerate(res_names, start=1)) + if valid_positions is not None and chain_id in valid_positions: + chain_valid = valid_positions[chain_id] + if hasattr(chain_array, "res_id"): + res_ids = cast(np.ndarray, chain_array.res_id)[starts[:-1]].tolist() + min_res_id = min(chain_valid) if chain_valid else 1 + pos_res_pairs = [ + (int(rid) - min_res_id + 1, rn) + for rn, rid in zip(res_names, res_ids, strict=True) + if rid in chain_valid + ] + else: + pos_res_pairs = list(enumerate(res_names, start=1)) + else: + pos_res_pairs = list(enumerate(res_names, start=1))

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/sampleworks/models/protenix/structure_processing.py` around lines 353 - 369, The current code collapses gaps by using enumerate(start=1) after filtering, causing sequence offsets to be lost and misaligning PTM serialization; instead compute 1-indexed sequence positions relative to the chain subsequence start (like raw_res_id - min_res_id + 1). In the branch where valid_positions and chain_id are present, capture the raw res_ids (from chain_array.res_id at starts[:-1]), compute min_res_id = min(chain_valid), and build pos_res_pairs by pairing (raw_id - min_res_id + 1) with the corresponding res_name only for raw_id in chain_valid (use the same res_ids and res_names variables) so pos_res_pairs preserves original residue offsets for get_sequences() and detect_modifications().

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@tests/integration/test_no_guidance_geometry.py`:
- Around line 99-102: Replace the EN DASH character in the comment "Ideal
backbone bonds are 1.33–1.52 Å" with a standard hyphen-minus so the range reads
"1.33-1.52 Å"; locate the comment near the call to filter_linear_bond_continuity
(con_mask = filter_linear_bond_continuity(bb, min_len=1.1, max_len=1.7)) and
edit that comment only to use the ASCII hyphen-minus character.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: d6a4394f-9a52-422f-bd79-13ade0bb424e

📥 Commits

Reviewing files that changed from the base of the PR and between f439c8e and 03fec81.

📒 Files selected for processing (3)

src/sampleworks/core/samplers/edm.py
tests/integration/test_no_guidance_geometry.py
tests/models/test_rf3_atom_ordering.py

✅ Files skipped from review due to trivial changes (1)

src/sampleworks/core/samplers/edm.py

🚧 Files skipped from review as they are similar to previous changes (1)

tests/models/test_rf3_atom_ordering.py

…protein chain I accidentally added a stale density_input CIF

k-chrispens added 2 commits March 26, 2026 22:55

test: add tests that 5I09 with RF3 currently fails

f6bd6a4

fix: RF3 now passes tests on 5I09 + other structures

e50dde2

Copilot AI review requested due to automatic review settings March 26, 2026 22:59

k-chrispens had a problem deploying to gpu-testing March 26, 2026 22:59 — with GitHub Actions Failure

Copilot started reviewing on behalf of k-chrispens March 26, 2026 22:59 View session

k-chrispens requested a review from marcuscollins March 26, 2026 23:00

Copilot AI reviewed Mar 26, 2026

View reviewed changes

Comment thread tests/models/test_rf3_atom_ordering.py Outdated

Comment thread tests/integration/test_no_guidance_geometry.py Outdated

Comment thread src/sampleworks/models/rf3/wrapper.py

Comment thread src/sampleworks/models/rf3/wrapper.py

coderabbitai Bot reviewed Mar 26, 2026

View reviewed changes

Comment thread src/sampleworks/core/samplers/edm.py

marcuscollins approved these changes Mar 27, 2026

View reviewed changes

k-chrispens had a problem deploying to gpu-testing March 27, 2026 16:19 — with GitHub Actions Error

coderabbitai Bot reviewed Mar 27, 2026

View reviewed changes

k-chrispens had a problem deploying to gpu-testing March 27, 2026 16:34 — with GitHub Actions Error

coderabbitai Bot reviewed Mar 27, 2026

View reviewed changes

Comment thread tests/integration/test_no_guidance_geometry.py

fix(tests): replace malformed 5I09 test CIF that merged ligands into …

292d42b

…protein chain I accidentally added a stale density_input CIF

k-chrispens mentioned this pull request Mar 27, 2026

More stringent atom ordering tests in tests/models #191

Open

k-chrispens force-pushed the kmc/5i09-fixes branch from 03fec81 to 292d42b Compare March 27, 2026 17:18

k-chrispens had a problem deploying to gpu-testing March 27, 2026 17:18 — with GitHub Actions Failure

k-chrispens merged commit bc044a1 into main Mar 27, 2026
0 of 4 checks passed

k-chrispens deleted the kmc/5i09-fixes branch March 27, 2026 17:18

Conversation

k-chrispens commented Mar 26, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

marcuscollins left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

k-chrispens commented Mar 26, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Mar 26, 2026 •

edited

Loading