Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Set up an automated ML task to predict drug -> drug target edges #43

Open
3 tasks
justaddcoffee opened this issue Nov 11, 2021 · 3 comments
Open
3 tasks
Labels
enhancement New feature or request

Comments

@justaddcoffee
Copy link
Contributor

justaddcoffee commented Nov 11, 2021

Describe the desired behavior

It would be useful for Drug Central to have inferred drug -> drug target edges, which we could formulate as a graph ML link prediction task.

We maybe could/should do this in an automated way, on each build of KG-IDG.

A possible roadmap:

  • Write Jupyter notebook for link prediction task
  • Write some tooling to run this code from Jenkins
  • Write some code to push the results to KGHub

Additional context

Per convo with Tudor et al on IDG just now

@justaddcoffee justaddcoffee added the enhancement New feature or request label Nov 11, 2021
@justaddcoffee
Copy link
Contributor Author

Discussing this more with Harry - some thoughts.

We can use NEAT for this (i.e. spin up a Google Cloud instance from Jenkins, pass off a YAML file to the instance describing the graph ML task, wait for it to complete and upload artifacts to S3).

This strategy has the advantage of:

  • helping us get NEAT updated
  • helping us flesh out the general use case of getting automatic embedding and graph ML working with KGHub and other projects (Monarch, eco-kg, others)

@caufieldjh
Copy link
Collaborator

As per IDG meeting on Jan 20, following drug categories would be good for evaluation:

  • Those targeting GPCRs
  • antihistaminics (H1 blockers)
  • antipsychotics (multi-target)
  • antivirals (no human targets)
  • antiestrogens (mostly ERalpha)

@caufieldjh
Copy link
Collaborator

I don't think we include any viral proteins right now, so we'd expect the antivirals to have few/no human targets, especially because we aren't doing any inference based off sequence or structure.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants