Few-shot Learning for Low-Data Drug Discovery

Implementations for the following machine learning models:

Random Forests
Graph Convolutional Network
Siamese Networks
Matching Networks
Prototypical Networks
Relation Networks

The last 3 networks also include our implementation of the iterative refinement LSTM from Low Data Drug Discovery with One-Shot Learning.

The Jupyter notebooks are run on Google Colab, with Google Drive mounted. Before uploading the Repo to Google Drive, run the create_dirs.py script by running python create_dirs.py. Empty directories will be created for every technique, which will serve as the directories for the outputs from each respective Colab notebook. The experiments which utilise ECFP rather than GCNs can be run on Tox21 data using the Prototypical Nets Tox21 ECFP.ipynb notebook.

Tox21

The dataset is obtained from the DeepChem AWS bucket. Accessed from: https://deepchemdata.s3-us-west-1.amazonaws.com/datasets/tox21.csv.gz. Last Accessed: 08 Nov 2021 in CSV format.

MUV

The dataset was obtained from the DeepChem AWS bucket. Accessed from: https://deepchemdata.s3-us-west-1.amazonaws.com/datasets/muv.csv.gz. Last Accessed: 08 Nov 2021 in CSV format.

Database of Useful (Docking) Decoys — Enhanced (DUD-E)

The data for the GPCR subset was obtained directly from the DUD-E website.Accessed from: http://dude.docking.org/subsets. Last Accessed: 08 Nov 2021. The actives and decoys for the targets within the DUD-E subsets are provided as separate SMILES files. These files are loaded using the Pandas library and aggregated in a CSV file contained all the actives and decoys for the GPCR subset.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data/raw		data/raw
.gitignore		.gitignore
Create Dataset.ipynb		Create Dataset.ipynb
GCN Benchmark.ipynb		GCN Benchmark.ipynb
LICENSE		LICENSE
Matching Nets.ipynb		Matching Nets.ipynb
Prototypical Nets Tox21 ECFP.ipynb		Prototypical Nets Tox21 ECFP.ipynb
Prototypical Nets.ipynb		Prototypical Nets.ipynb
README.md		README.md
Random Forest Benchmark.ipynb		Random Forest Benchmark.ipynb
Relation Nets.ipynb		Relation Nets.ipynb
Siamese Nets.ipynb		Siamese Nets.ipynb
create_dirs.py		create_dirs.py
dude_create_csv.py		dude_create_csv.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Few-shot Learning for Low-Data Drug Discovery

Tox21

MUV

Database of Useful (Docking) Decoys — Enhanced (DUD-E)

About

Releases

Packages

Languages

License

stupidquestion/Few-Shot-Learning-for-Low-Data-Drug-Discovery

Folders and files

Latest commit

History

Repository files navigation

Few-shot Learning for Low-Data Drug Discovery

Tox21

MUV

Database of Useful (Docking) Decoys — Enhanced (DUD-E)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages