-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create a independent PDB sanitize step before entering the haddock3 pipeline #143
Comments
I've added more points, the server also has checks specific to the |
When a PDB has multiple chains, what should be the behaviour? Keep only one chain or homogenize all chains to the same identifier? |
Ideally we should offer different options to the user (I guess part of some config file):
1) Select a specific chain and sanitize
2) Keep all chains, shift the numbering to avoid overlap in residue numbering and assign a unique chainID
|
Are ligands always given in independent PDBs, or can they be given in the same PDB together with the protein? And if the latter is the case, should |
They can be either separated or together with any moleculetype actually, before or after the |
|
Related? haddocking/pdb-tools#101 |
Sounds fine to me.
And again, the ligand might be part of a PDB of a separate PDB on its own, depends on the docking scenario
… pdb_tidy will add TER if there is an ATOM/HETATM break. If that can't be the case, I need to add something to tidy.
|
After the discussions in #140 and #142. PR #144 implements what is here discussed. The
preprocessing
steps happen before the haddock3 pipeline when the original input date is copied to therun_dir/data
folder. The aim is also to have aCLI
that the user can run and correct the PDBs (ordry-run
) before submittting.Done:
The list follows the execution order defined in the
process_pdbs
function, order matters:ANISOU
records can be discarded (usepdb_keepcoord
)REMARK lines are discarded
(usepdb_keepcoord
)altloc
with the highest occupancy (waiting for Improvedpdb_selaltloc
pdb-tools#117 to be merged)occupancy
to1.00
(usespdb_occ
)MSE
residues (selenomethionine) toMET
. Also ifMSE
are defined asHETATM
, make themATOM
.HSD
,HSE
,HIE
,HID
toHIS
and convert them also toATOM
if needed.cns/toppar/*.top
files. Automatically retrieves new residues from those files.ATOM
toHETATM
for residues that are expected to beHETATM
. If the user provides.top
file also residues defined there get converted toHETATM
if needed.HETATM
toATOM
for residues that should beATOM
and are defined asHETATM
(these are natural and modified aminoacids)HETATM
. Accepts user.top
file.ATOM
.AARG
,BARG
) (usespdb_fixinsert
)pdb_reatom
starting at 1)pdb_reres
starting at 1)pdb_tidy
(waiting CorrectMODEL
,END
andENDMDL
inpdb_tidy
pdb-tools#119)Todo
MODEL
) should be equal, that is, same labels.preprocessing
stepProbably good to check what our current 2.4 server machinery is doing in terms of input PDB validation
The text was updated successfully, but these errors were encountered: