PanDDA2 for XChem by rowanwalker96 · Pull Request #330 · DiamondLightSource/python-dlstbx

rowanwalker96 · 2025-11-13T10:01:00Z

Auto parallel PanDDA2 pipeline for XChem/OpenBind.

Runs downstream from dimple, and waits until all upstream autoprocessing jobs & related dimple jobs have completed before triggering. The 'best' dataset is taken forward to use for PanDDA processing, based on user defined heuristic (I/sigIcompleteness# unique reflections as default as in XCE).

The pipeline is comprised of two stages:

PanDDA2 hit identification and auto ligand fitting. This can not begin until a sufficient number of comparator datasets have been collected. Once the number of datasets collected in the auto_model_building directory is equal to the prerun-threshold parameter, a SLURM array job is launched, allocating 1cpu per job. Any dataset that arrives after this point is launched as a single cpu job.
PanDDA2 postrun for collating event and site info. Only one PanDDA2_post job can run at any given time.

A custom Rhofit pipeline is also triggered after the PanDDA process step, for high quality ligand fitting to the PanDDA event maps. This results in a final autobuild of the protein and ligand in a dataset's modelled_structures directory.

The pipeline reads the ligand information for each dataset from the XChem soakDB .sqlite database, generates ligand restraints and auto creates the XChem directory structure, such that XCE can read the status of the XChem experiment via its update datasource from filesystem method.

User options for the autoprocessing can be specified by a .user.yaml file in the labxchem visit directory, but is not a requirement. Currently this has minimal options but will be fleshed out in due course:

data:
  acronym: A71EV2A

autoprocessing:
  pandda:
    prerun-threshold: 300
    heuristic: 'default'
    export: True

  ligand_build: True

Initially will only run on OpenBind visits lb42888 to allow for a testing phase.

rowanwalker96 added 16 commits August 11, 2025 16:24

Init trigger

c1ab138

Init wrapper

207b35a

Allow for triggering multiple recipes

0ea98af

Minimal approach

baa2029

OB implementation

c95c72f

Get unique ligand info

debd9d1

Fix up

28d5af0

Flesh out trigger

3595abb

Take forward 'best' dataset

477126d

Submit as array job

de7fa09

Clean up

e4b831c

Integrate with XCE

5f5e9fe

Add pandda postrun

fed828c

Add wrapper for PanDDA postrun

abbc441

Incorporate user yaml

678d84d

Integrate postrun

17f8096

rowanwalker96 marked this pull request as draft November 13, 2025 10:01

rowanwalker96 and others added 10 commits November 13, 2025 10:05

Merge branch 'main' into pandda_i04-1

9b6b3eb

Fix trigger merge conflict

7fac756

Make dedicated XChem trigger service

b3a888d

Add rhofit ligandfit wrapper for pandda

410ba5e

Add Trigger_XChem to setup

2f34daf

Fix new trigger setup

59ecf9f

Finish pandda_rhofit

2190f27

Test pipeline

9fa5e60

Fix rhofit command

e82cef6

Add grade2 restraints

a1a752f

rowanwalker96 marked this pull request as ready for review November 24, 2025 16:41

rowanwalker96 added 2 commits November 25, 2025 10:40

Tidy up

e265d79

Tidy wrapper

e8b7cb3

rowanwalker96 merged commit cece8ab into main Nov 25, 2025

rowanwalker96 deleted the pandda_i04-1 branch November 25, 2025 10:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PanDDA2 for XChem#330

PanDDA2 for XChem#330
rowanwalker96 merged 28 commits intomainfrom
pandda_i04-1

rowanwalker96 commented Nov 13, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rowanwalker96 commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rowanwalker96 commented Nov 13, 2025 •

edited

Loading