Getting a consistent set of conformers from EmbedMultipleConfs #9187

DavidACosgrove · 2026-03-19T11:53:47Z

DavidACosgrove
Mar 19, 2026

Hi,

I'm having problems getting a consistent set of conformers from EmbedMultipleConfs. I understand that as a distance geometry algorithm it is subject to the vagaries of random numbers, but I was expecting that if I generated enough conformers I would at least in general get similar ensembles from run to run. This doesn't seem to be the case.

This is a colab notebook showing what I get.
https://colab.research.google.com/drive/1bSV_kz-sVWcDfWAYkZdJg-JFH-iCm9YL?usp=sharing

I start with the PubChem conformation of osimertinib and generate 1000 conformations, pruning at 0.5 RMS. I then find the conformation with the lowest RMS to the input conformation, and also find the largest shape match using the PubChem shape overlay code. I repeat this 5 times, using as a start the conformer that with the best RMS to the previous starting point, apart from in the first round. Thus, after the first round, it is always generating conformations from one it generated on the previous round. As the notebook shows, there isn't a conformation generated that is a close match to the input structure, either by RMS or shape. This is a problem for my use case, which is searching databases of conformations of molecules by shape - the results will vary every time I generate a new database, and I run the risk of compounds not finding themselves because the conformations in the database don't resemble one generated in a different run of the embedding code.

Is there are way of ensuring consistency of conformation ensemble from run to run? Obviously I could set the random number seed each time, but that's just hiding the issue.

Thanks,
Dave

greglandrum · 2026-03-19T13:44:08Z

greglandrum
Mar 19, 2026
Maintainer

It's not an issue. Conformer generation using distance geometry is a stochastic process and the only way to get the same conformer out of it is by specifying the random number seed.

If you generate multiple conformers and do not do RMS pruning, it's possible to generate individual conformers in later runs if you know their IDs. Here's an example of re-generating conformer 487 from 1000:

from rdkit import Chem
from rdkit.Chem import rdDistGeom
m = Chem.AddHs(Chem.MolFromSmiles('C=CC(=O)Nc1cc(Nc2nccc(-c3cn(C)c4ccccc34)n2)c(OC)cc1N(C)CCN(C)C'))
ps = rdDistGeom.KDG()
ps.numThreads = 8
ps.randomSeed = 0xf00d
ps.enableSequentialRandomSeeds = True
rdDistGeom.EmbedMultipleConfs(m,1000,ps)
nmol = Chem.Mol(m)
ps.randomSeed = 0xf00d + 487
rdDistGeom.EmbedMolecule(nmol,ps)
from rdkit.Chem import rdMolAlign
print(rdMolAlign.GetBestRMS(m,nmol,prbId=487))
print(rdMolAlign.GetBestRMS(m,nmol,prbId=486))

The output of this for me is:

0.0
3.1641401703212746

You should get roughly the same result.

Unfortunately, at the moment there's no way to do this if you have done RMS pruning. I think it should be straightforward to add the option to allow it though.

0 replies

DavidACosgrove · 2026-03-19T14:31:15Z

DavidACosgrove
Mar 19, 2026
Author

I don't want to find the exact conformer in the ensemble. I just want to be confident that if I generate 2 ensembles from the same molecule I get roughly the same conformations in each, such that every conformer in ensemble 1 has a conformer in ensemble 2 with a shape tanimoto of >0.95 or so and vice versa. I was hoping that if I generated enough conformations that would be the case, but it appears not to be.

1 reply

greglandrum Mar 19, 2026
Maintainer

I don't want to find the exact conformer in the ensemble. I just want to be confident that if I generate 2 ensembles from the same molecule I get roughly the same conformations in each, such that every conformer in ensemble 1 has a conformer in ensemble 2 with a shape tanimoto of >0.95 or so and vice versa. I was hoping that if I generated enough conformations that would be the case, but it appears not to be.

The only way to guarantee this would be to use molecules that aren't at all flexible (I know this isn't an option!) or use a fixed random seed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting a consistent set of conformers from EmbedMultipleConfs #9187

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Getting a consistent set of conformers from EmbedMultipleConfs #9187

Uh oh!

DavidACosgrove Mar 19, 2026

Replies: 2 comments · 1 reply

Uh oh!

greglandrum Mar 19, 2026 Maintainer

Uh oh!

DavidACosgrove Mar 19, 2026 Author

Uh oh!

greglandrum Mar 19, 2026 Maintainer

DavidACosgrove
Mar 19, 2026

Replies: 2 comments 1 reply

greglandrum
Mar 19, 2026
Maintainer

DavidACosgrove
Mar 19, 2026
Author

greglandrum Mar 19, 2026
Maintainer