Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Boomer pipeline prototype implementation for Mondo ICD10CM mapping #9

Closed
matentzn opened this issue Feb 24, 2022 · 4 comments
Closed
Assignees

Comments

@matentzn
Copy link
Contributor

Implement goal in the Makefile that takes as an input a set of mappings, then

  1. generates ptables
  2. runs boomer
  3. Outputs a message that indicates "the next step" (i.e. which clique to review)
@matentzn
Copy link
Contributor Author

@cmungall can you just roughly jot down the method, e.g. something like:

  1. collect all mappings
  2. (unify to skos? or can the input be mix of skos and owl)
  3. generate individual ptables for each input table
    • (how to provide global priors on mapping set level?)
    • Or: merge everything first into one massive SSSOM file?
  4. run boomer
  5. Look at X (what is X) make a fix, run again

Thanks.

@cmungall
Copy link

cmungall commented Mar 3, 2022

  1. Generate OWL for each source
    • can we flat
    • should use subClassOf where appropriate (I think this only applies to ORDO - transform the part-ofs to is-a)
  2. Aggregate all mappings as SSSOM files
  3. Select subset of ontologies mappings for any given experiment (this may be simply everything to start with)
  4. Merge selected OWL ontologies using robot
  5. Fill a default confidence value for each SSSOM file
    • we can iterate on more sophisticated strategies, just choose 0.8 as a default for now
  6. Extract a ptable from all selected SSSOM files.
    • just use the default sssom-py method for now, no parameter tweaking
    • eventually boomer will take sssom natively accept SSSOM format INCATools/boomer#47
    • I don't think sssompy will make the prefixes table for boomer but this would be good functionality to have
  7. Feed the ptable and the merged ontology to boomer
    • use defaults for now
  8. Explore output

The input to boomer should be a ptable and the combined OWL files of all ontologies. We can iterate on methods to set priors but just to get started, something like 0.85/0.5/0.05/0.5.

@matentzn
Copy link
Contributor Author

matentzn commented Mar 4, 2022

@matentzn matentzn changed the title Implement boomer pipeline Boomer pipeline prototype implementation for Mondo ICD10CM mapping Mar 18, 2022
@matentzn
Copy link
Contributor Author

matentzn commented May 3, 2022

This ticket is now a bit too general, I consider this done - we will replace this now moving forward by more specific issues

@matentzn matentzn closed this as completed May 3, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants