Create a independent PDB sanitize step before entering the haddock3 pipeline #143

joaomcteixeira · 2021-11-24T16:39:13Z

After the discussions in #140 and #142. PR #144 implements what is here discussed. The preprocessing steps happen before the haddock3 pipeline when the original input date is copied to the run_dir/data folder. The aim is also to have a CLI that the user can run and correct the PDBs (or dry-run) before submittting.

Done:

The list follows the execution order defined in the process_pdbs function, order matters:

Todo

Residues in the same chain cannot have repeated numbering
If there is a gap in the sequence, the gap must be maintained (from the above list nothing corrects for this, so it should be as is, to be tested)
All models in an ensemble (MODEL) should be equal, that is, same labels.
Add flag to skip the preprocessing step

Probably good to check what our current 2.4 server machinery is doing in terms of input PDB validation

The text was updated successfully, but these errors were encountered:

rvhonorato · 2021-11-24T16:48:43Z

I've added more points, the server also has checks specific to the moleculetypes

joaomcteixeira · 2021-11-25T14:30:22Z

When a PDB has multiple chains, what should be the behaviour? Keep only one chain or homogenize all chains to the same identifier?

amjjbonvin · 2021-11-25T14:43:11Z

Ideally we should offer different options to the user (I guess part of some config file): 1) Select a specific chain and sanitize 2) Keep all chains, shift the numbering to avoid overlap in residue numbering and assign a unique chainID

joaomcteixeira · 2021-12-03T12:15:31Z

Are ligands always given in independent PDBs, or can they be given in the same PDB together with the protein? And if the latter is the case, should HETATM be all at the end of the file, with TER, same chain?

rvhonorato · 2021-12-03T12:22:01Z

They can be either separated or together with any moleculetype actually, before or after the ATOM. Not sure about the effect of the TER record.

joaomcteixeira · 2021-12-03T12:31:00Z

pdb_tidy will add TER if there is an ATOM/HETATM break. If that can't be the case, I need to add something to tidy.

rvhonorato · 2021-12-03T12:40:20Z

Related? haddocking/pdb-tools#101

amjjbonvin · 2021-12-03T13:52:12Z

Sounds fine to me. And again, the ligand might be part of a PDB of a separate PDB on its own, depends on the docking scenario

…

pdb_tidy will add TER if there is an ATOM/HETATM break. If that can't be the case, I need to add something to tidy.

joaomcteixeira assigned joaomcteixeira, amjjbonvin and rvhonorato Nov 24, 2021

joaomcteixeira added the enhancement Enhancing an existing feature of adding a new one label Nov 24, 2021

joaomcteixeira added this to To Do in Features via automation Nov 24, 2021

joaomcteixeira added feature New feature request and removed enhancement Enhancing an existing feature of adding a new one labels Nov 24, 2021

joaomcteixeira added this to the v3.0.0 stable release milestone Nov 24, 2021

joaomcteixeira moved this from To Do to In Progress in Features Nov 24, 2021

joaomcteixeira mentioned this issue Nov 25, 2021

PDB pre-processing #144

Merged

joaomcteixeira linked a pull request Nov 26, 2021 that will close this issue

PDB pre-processing #144

Merged

rvhonorato removed their assignment Feb 1, 2022

joaomcteixeira mentioned this issue Feb 8, 2022

update pdbtools dependencies #310

Merged

joaomcteixeira mentioned this issue Feb 17, 2022

Error running example configs #329

Closed

joaomcteixeira added this to To do in PDB preprocessing via automation May 10, 2022

joaomcteixeira removed this from In Progress in Features May 10, 2022

joaomcteixeira moved this from To do to In progress in PDB preprocessing May 10, 2022

joaomcteixeira added the documentation Improve docs label May 10, 2022

This was referenced Jul 18, 2022

automatic update .tbl when preprocessing PDBs #495

Closed

Implement pdb_shiftres in preprocessing gear #502

Closed

joaomcteixeira closed this as completed in #144 Jul 21, 2022

PDB preprocessing automation moved this from In progress to Done Jul 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a independent PDB sanitize step before entering the haddock3 pipeline #143

Create a independent PDB sanitize step before entering the haddock3 pipeline #143

joaomcteixeira commented Nov 24, 2021 •

edited

Loading

rvhonorato commented Nov 24, 2021

joaomcteixeira commented Nov 25, 2021

amjjbonvin commented Nov 25, 2021 via email

joaomcteixeira commented Dec 3, 2021

rvhonorato commented Dec 3, 2021

joaomcteixeira commented Dec 3, 2021

rvhonorato commented Dec 3, 2021

amjjbonvin commented Dec 3, 2021 via email

Create a independent PDB sanitize step before entering the haddock3 pipeline #143

Create a independent PDB sanitize step before entering the haddock3 pipeline #143

Comments

joaomcteixeira commented Nov 24, 2021 • edited Loading

rvhonorato commented Nov 24, 2021

joaomcteixeira commented Nov 25, 2021

amjjbonvin commented Nov 25, 2021 via email

joaomcteixeira commented Dec 3, 2021

rvhonorato commented Dec 3, 2021

joaomcteixeira commented Dec 3, 2021

rvhonorato commented Dec 3, 2021

amjjbonvin commented Dec 3, 2021 via email

joaomcteixeira commented Nov 24, 2021 •

edited

Loading