Skip to content

Text mining tumor antigens for cancer immunotherapy research

License

Notifications You must be signed in to change notification settings

jakelever/tumorantigens

Repository files navigation

Tumor Antigens

This project used Kindred to extract mentions of tumor antigens in PubMed abstracts and PubMed Central papers.

The relevant files are described below:

  • wordlistLoader.py - Load up the list of gene names and prepare it for quick searching against sentences
  • findSentences.py - Find sentences that mention "tumor antigen" (plus more spellings) and a gene name
  • prepareForLearning.py - Parse the sentences and vectorize them using Kindred
  • activelyLearn.py - Use active learning to find ambigious sentences and request annotation. Continously builds the final knowledgebase as more sentences are annotated.

About

Text mining tumor antigens for cancer immunotherapy research

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published