Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Examples #10

Merged
merged 21 commits into from
Aug 17, 2021
Merged

Examples #10

merged 21 commits into from
Aug 17, 2021

Conversation

t-kimber
Copy link
Contributor

@t-kimber t-kimber commented May 21, 2021

The aim of this PR is to have scripts that are running for featurization schemes and associated machine learning models on the latest ChEMBL (v.28) data set.

Featurizers

Ligand-based

To run these featurizers, type:
(experiments-binding-affinity) $ bash tests/test_featurization_ligand_only.sh

  • Morgan length 1024, radius 2
  • Molecular graph
  • One-hot SMILES

Kinase-informed

Using the full kinase sequence

To run these featurizers, type:
(experiments-binding-affinity) $ bash tests/test_featurization_kinase_informed.sh

  • morgan + hash
  • morgan + composition full sequence
  • smiles + sequence full sequence

Models

Ligand-based

To run these models, type:
(experiments-binding-affinity) $ bash tests/test_model_ligand_only.sh

  • Morgan & Fully connected neural network
  • One-hot SMILES & Convolutional neural network
  • Graph & Graph neural network

Kinase-informed

To run these models, type:
(experiments-binding-affinity) $ bash tests/test_model_kinase_informed.sh

  • morgan + hash & Fully connected neural network
  • morgan + composition & Fully connected neural network
  • smiles + sequence & Convolutional neural network

⚠️ If you get FileExistsError, use the flag --overwrite.

Status

  • Featurization Running

  • Models Running

  • Ready to go

@t-kimber t-kimber requested a review from jaimergp as a code owner May 21, 2021 09:48
@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@t-kimber t-kimber removed the request for review from jaimergp May 21, 2021 12:43
@t-kimber t-kimber removed the request for review from jchodera August 16, 2021 15:02
@schallerdavid
Copy link
Contributor

Think about switching from ChEMBL subset to Kinase subset, e.g. ABL1+EGFR+AURA.

@schallerdavid
Copy link
Contributor

Merge before fixing everything and create separate PRs afterwards.

@t-kimber t-kimber requested review from AndreaVolkamer and schallerdavid and removed request for schallerdavid August 17, 2021 12:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants