Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Coding: Creating Mapping Testing Pipeline #19

Open
3 tasks done
callahantiff opened this issue Feb 2, 2020 · 2 comments
Open
3 tasks done

Coding: Creating Mapping Testing Pipeline #19

callahantiff opened this issue Feb 2, 2020 · 2 comments
Assignees
Labels
coding release v2.0 work related to v2.0

Comments

@callahantiff
Copy link
Owner

callahantiff commented Feb 2, 2020

GOALS: Create a pipeline that can be used to extend pediatric clinical-concept mappings to a new data source.

Workflow Update: @SteeleRobert has agreed to help with creating this pipeline.


Background:
We have created a large set of mappings from clinical diagnoses (n=29,128), medications (n=9,175 unique medications or 1,693 unique ingredients), and measurements (n=2,703 unique measurement results) to open biomedical ontologies.


TODO:


General Guidelines:

  • Build scripts in an object-oriented framework
    • Sketch architecture before building code
    • General parsing class with subclasses by clinical type (i.e. conditions, measurements, medications)
  • Test-driven development
  • Needs to be written using keras and TensorFlow
  • Inputs: a list of clinical codes, a list of ontologies

NEXT STEPS:

  • Discuss this issue
  • Agree on roles and authorship
  • Discuss plan for moving forward, starting with discussions of architecture, prior to beginning coding

@SteeleRobert - are you good with this plan?

@callahantiff callahantiff self-assigned this Feb 2, 2020
@callahantiff callahantiff added this to Needed Coding in Coding Tasks via automation Feb 2, 2020
@SteeleRobert
Copy link
Collaborator

I was looking to this paper for information on multilabel classification. https://pdfs.semanticscholar.org/686d/f0720dc54c476dcb4d69584ef3c530d85f7f.pdf
I think where we can improve upon this is by trying to use the ontologies to improve clustering and ranking.

@callahantiff
Copy link
Owner Author

I was looking to this paper for information on multilabel classification. https://pdfs.semanticscholar.org/686d/f0720dc54c476dcb4d69584ef3c530d85f7f.pdf
I think where we can improve upon this is by trying to use the ontologies to improve clustering and ranking.

This looks really promising! I will make sure to read it prior to our meeting Friday so we can discuss it. Great find!

@callahantiff callahantiff added the release v2.0 work related to v2.0 label Sep 20, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
coding release v2.0 work related to v2.0
Projects
Coding Tasks
  
Needed Coding
Development

No branches or pull requests

2 participants