EliIE (Eligiblity Criteria Information Extraction)
A parser designed for free text clinical trial eligibility criteria (CTEC).
Parsing free text CTEC and formalizing into OMOP CDM v5 table
The parser was trained on 250 clinical trials on Alzheimer's. The annotation guidelines is in folder Supple Materials.
Developed in Dr. Chunhua Weng's lab in Department of Biomedical Informatics at Columbia
Author: Tian Kang
Affiliation: Department of Biomedical Informatics, Columbia University
Contact Email: email@example.com
Last update: June 20, 2016 (add Negation detection in NER step)
Citation:EliIE: An open-source information extraction system for clinical trial eligibility criteria
- Entity recogntion
- Attribute recognition
- Clinical relation identification
- Data standardization
First download all codes and decompress
- set the parameter lists to your task-based ones
- run "
sh wrapper_for_parsing.sh" and parsing results will be generated in XML files.
(See example output directly running "
sh wrapper_for_parsing.sh" without changing )
- NER step: run
python NamedEntityRecognition.py $1:<input directory> $2:<input text name> $3:<output directory>
- Clinical Relation: run
python Relation.py $3:<output directory> $2:<input text name>
python NamedEntityRecognition.py Tempfile test.txt Tempfile
python Relation.py Tempfile test.txt
The example output would be Tempfile/test_NER.xml and Tempfile/test_Parsed.xml)
This parser assumes MetaMap is installed and requires that the MetaMap support services are running. If you have MetaMap installed in
$MM, these can be started as:
metamap_tag.sh; follow the guidance to change the MetaMap root dir and start running
Easy installation following the instruction:
Functions Under Developing
- Stadardize entities and attributes concepts using OHDSI standards
- Convert the final format into JSON
- Extend use case to more diseases