GVAE4Smiles

Grammar Variational Autoencoder for Smiles.

Based off: https://github.com/kanojikajino/grammarVAE Paper: https://arxiv.org/abs/1703.01925

Now includes code to cache one-hot-encodings to disk via h5py and the corresponding data generator. This is needed to cope with training on large datasets e.g. six million SMILES strings sampled from ZincDB. [Not included here as too big for github].

Any SMILES strings need to be filtered to conform to the MAX_LENGTH criterion and the SMILES-Grammar used.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
data		data
.gitignore		.gitignore
EncodeSimilarityAndSLogP.ipynb		EncodeSimilarityAndSLogP.ipynb
GVAE.py		GVAE.py
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
dataTools.py		dataTools.py
doGVAE_CLR.py		doGVAE_CLR.py
doGVAE_FindLR.py		doGVAE_FindLR.py
doGVAE_genHPO.py		doGVAE_genHPO.py
doGVAE_genHPOneRun.py		doGVAE_genHPOneRun.py
doGVAEencode_from.py		doGVAEencode_from.py
environ_win10.yml		environ_win10.yml
filterBadSmiles.py		filterBadSmiles.py
findLR_CLR.py		findLR_CLR.py
generateData.py		generateData.py
smilesG.py		smilesG.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GVAE4Smiles

About

Releases

Packages

Contributors 2

Languages

License

dbkgroup/GVAE4Smiles

Folders and files

Latest commit

History

Repository files navigation

GVAE4Smiles

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages