Sanitizing SMILES removes chirality information #94

tianyu-lu · 2022-05-17T15:57:12Z

On this line of sample_space(), chirality information of origin_smiles is removed. The output is then unsuitable as input to a chirality-aware ML model, e.g. to distinguish L vs. D amino acids which are important in models of binding affinity. Could the option to skip this sanitization step be provided to the user?

PS: Great code base and beautiful visualizations! We're finding it very useful in explaining our Gaussian Process models. The future of SAR ←→ ML looks exciting.

The text was updated successfully, but these errors were encountered:

whitead · 2022-05-17T17:58:40Z

Hi @tianyu-lu. Yup good find. I think we need to split that function to be either canonicalize or sanitize. Will fix this shortly.

whitead · 2022-05-18T18:44:42Z

@tianyu-lu check-out the latest pre-release on pip, which should fix your problem.

whitead mentioned this issue May 17, 2022

Stopped stripping chirality in sanitize #97

Merged

whitead closed this as completed in #97 May 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sanitizing SMILES removes chirality information #94

Sanitizing SMILES removes chirality information #94

tianyu-lu commented May 17, 2022

whitead commented May 17, 2022

whitead commented May 18, 2022

Sanitizing SMILES removes chirality information #94

Sanitizing SMILES removes chirality information #94

Comments

tianyu-lu commented May 17, 2022

whitead commented May 17, 2022

whitead commented May 18, 2022