This repository has been archived by the owner on Jan 13, 2023. It is now read-only.

Home

Jump to bottom

Honghan Wu edited this page Nov 17, 2017 · 21 revisions

Welcome to the SemEHR wiki!

Run SemEHR pipeline

A typical SemEHR process contains the following steps:

query a database to get the documents for processing
NLP processing (e.g., using bio-yodie to annotate umls concepts)
index contextualised concepts into an elaticsearch instance
do patient centric indexing to integrate all patient docs and annotations

To do the process, the easiest way is to 0. (only do this ONCE) initialise SemEHR index using the mapping file.

setup the database view from which SemEHR will pull documents from.
edit the process configuration file using this template.
run the script python semehr_processor.py PATH_TO_YOUR_CONFIGURATION

Useful Links

sql server connection setup on linux

troubleshooting

when you see no concepts indexed for patients, please double check the index mapping to make sure the mappings are correct as defined in the script.

Toggle table of contents Pages 4

Clone this wiki locally