Summary: The RBD:ACE2 complex (fully glycosylated) was saved as two monomeric structures (see ./01_PDBs/3_refined_ace2_rbd_RBD_ONLY_cleaned.pdb
and ./01_PDBs/3_refined_ace2_rbd_ACE2_ONLY_cleaned.pdb
). These monomeric structures were then read into tleap
for solvation and parametrization, because the GLYCAM_06j-1 forcefield is not available in OpenMM 7.4.2
. The .inpcrd
and .prmtop
files generated by tleap
were then read into OpenMM 7.4.2
and equilibrated for 10 ns. The output files from equilibration were then used to seed Folding@home simulations.
Glycosylation sites and patterns were determined by Elisa Fadda. Watanbe et al. and Shajahan et al. was used to propose the most likely glycosylation pattern in the RBD and ACE2 respectively.
Pre-equilibrated glycan structures, prepared by Aoife M. Harbison, are stored in the ../representative_glycan_structures
directory.
The glycan structures correspond to the most stable conformers obtained from multi microsecond MD simulations of cumulative sampling, see Harbison et al., 2019.
Glycosylation patterns were determined as:
ACE2
- N53 (~2% FA2, ~15% A3)
- N90 (~20% FA2[3/6]G1)
- N103 (~50% FA2)
- N322 (~1% FA2, ~60% other)
- N432 (~25% non-occupied, otherwise FA2G2 ok)
- N546 (15% A2, or A2[3/6]E1)
- N690 (~15% FA2)
RBD
- N343 (FA2G2)
The refined ACE2 model was obtained from a refined 1r42
crystal structure prepared by Tristan Croll. In PyMol, protein termini were capped with ACE and NME residues. All GlcNAc / NAG residues were present and therefore kept as scaffold from which to place the complex glycan structures. Using the Pair Fitting
wizard in PyMol, the base GlcNac / NAG residues in the appropriate glycan structures (../representative_glycan_structures
) were superimposed with the base GlcNaC / NAG residues in the refined 1r42
structure at each glycosylation site. Once aligned, the base GlcNac / NAG residue in the refined 1r42
structure was removed to leave only the complex glycan at each site.
Once all complex glycans were added the structure was checked for steric clashes. All sites had no major clashes, with the exception of the glycan at N53. Here, the base Fucose clashed with side chains of the protein and as a result it was manually rotated in PyMOL to remove severe clashes.
The 1r42
structure was used for hACE2 because:
1r42
is higher resolution (2.20 Å, whereas6m0j
is 2.45 Å)- The electron density map of
1r42
clearly reveals the NAG orientation at each glycosylated asparagine residue, providing a reliable building block on which to construct more complex glycan structures.
The same procedure as detailed above was followed for the refined RBD at N343 with the FA2G2 glycan. No major clashes were observed between the glycan and protein.
Location: ./01_PDBs/0_RBD_ACE2_complex_from_1r42ref_6m0j_FyllGlycos_alignto6m0jRefined_NOCAPS.pdb
./01_PDBs/0_RBD_ACE2_complex_from_1r42ref_6m0j_FyllGlycos_alignto6m0jRefined_NOCAPS.pdb
was constructed by aligning the glycosylated/refined 6m0j
structure RBD (from step 1) and the glycosylated/refined 1r42
structure ACE2 (from step 1) with the 6m0j
CST structure RBD:ACE2 complex. The 6m0j
CST structure was then deleted. The resulting aligned proteins were then written out to file in PyMol.
After visual inspection of the ACE2:RBD interface it was clear there were severe steric clashes between ACE2 and RBD protein chains. In order to refine the structure ISOLDE was used. For an in-depth discussion of steps taken within the ISOLDE package check ./01_PDBs/README.md
.
The refined complex structure (./01_PDBs/2_refined_ace2_rbd_complex.pdb
) was split into the ACE2 and RBD parts. This was done so that preparation with tleap
(see later) could be performed easily. In each file the order of residues was changed so that each had the following order: protein -> glycans -> ions (if present). Then, TER
cards were added between protein chains and glycan groups. Atoms were renumbered using pdb_reatom
(from PDB Tools).
The final structures are saved in ./01_PDBs
as:
3_refined_ace2_rbd_complex_ACE2_ONLY_cleaned.pdb
3_refined_ace2_rbd_complex_RBD_ONLY_cleaned.pdb
Both 3_refined_ace2_rbd_complex_ACE2_ONLY_cleaned.pdb
and 3_refined_ace2_rbd_complex_RBD_ONLY_cleaned.pdb
were then copied to ./02_run_tleap
.
Location: ./02_run_tleap/
Bonding between residues (such as disulphides and glycans etc) needed to be specified for both ACE2 and the RBD. The bonding patterns are explicitly specified in the rbd_ace2_complex_tleap.in
input file.
Within the rbd_ace2_complex_tleap.in
file the solvateBox
command solvates the system and the addIonsRand
command replaces water molecules with Na+ and Cl- ions.
The AMBER tleap
program does not automatically work out the correct number of ions for a given system / system charge. In order for this to be determined the methodology from OpenMM was used, an example of which is outlined below:
from math import floor
numWaters = 61992
numPositive = 0
numNegative = 0
totalCharge = -20
ionicStrength = 0.15
if totalCharge > 0:
numNegative += totalCharge
else:
numPositive -= totalCharge
numIons = (numWaters - numPositive - numNegative) * ionicStrength / (55.4) # Pure water is about 55.4 molar (depending on temperature)
numPairs = int(floor(numIons + 0.5))
numPositive += numPairs
numNegative += numPairs
The numWaters
variable was determined by running the rbd_ace2_complex_tleap.in
script without adding ions first. The above method gave values for numPositive
(i.e. Na+) and numNegative
(i.e. Cl-) that were then used in rbd_ace2_complex_tleap.in
to add the required number of ions.
The tleap
command was run by:
tleap -s -f rbd_ace2_complex_tleap.in
This produced a fully solvated (with ions) system. The .prmtop
and .inpcrd
files could now be used for minimisation and equilibration.
Once prepared the system was equilibrated here: ./03_equil