SMILES reading benchmark

I call this a benchmark, but it is primarily a means to identify ambiguities in the spec and highlight cornercases, with the goal of ensuring that SMILES are transferred between different programs without any loss or corruption of information.

This benchmark focuses exclusively on whether different programs can agree on the identity of a molecule as read from a SMILES string, particularly an aromatic SMILES string. For each benchmark SMILES string, the result is the number of implicit hydrogens on each atom of the molecule (in the order in which they appear in the SMILES string).

If you are interested in looking at the results, unzip some of the entries in the results folder and diff them. Note that it only makes sense to compare entries where they are reading the same SMILES string, i.e. ProgramA-reading-ProgramC.txt and ProgramB-reading-ProgramC.txt.

If you are interested in adding results for a new toolkit, unzip everything (keeping the originals*) and create a new script in the scripts directory that does the conversion, first from the benchmark Kekule SMILES to aromatic SMILES, and then counting the number of implicit hydrogens in each of the aromatic SMILES strings.

Note: "gunzip -k" will do this, but is not available on some platforms. Otherwise, "for d in *.gz; do gunzip -c $d > ${d%.smi.gz}.smi; done".

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
1-benchmarks		1-benchmarks
2-aromaticsmiles		2-aromaticsmiles
3-results		3-results
4-stereosmiles		4-stereosmiles
5-results		5-results
scripts		scripts
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1-benchmarks

1-benchmarks

2-aromaticsmiles

2-aromaticsmiles

3-results

3-results

4-stereosmiles

4-stereosmiles

5-results

5-results

scripts

scripts

LICENSE

LICENSE

README.md

README.md

Repository files navigation

SMILES reading benchmark

About

Releases

Packages

Languages

License

rnaimehaom/smilesreading

Folders and files

Latest commit

History

Repository files navigation

SMILES reading benchmark

About

Resources

License

Stars

Watchers

Forks

Languages