NorSynthClinical

Data and system for family history extraction from a synthetic corpus of Norwegian clinical text. The paper desccribing this work, entitled Iterative development of family history annotation guidelines using a synthetic corpus of clinical text, was presented at LOUHI workshop which is collocated with EMNLP 2018.

The co-authors of the paper are Pål Brekke, Øystein Nytrø, Lilja Øvrelid. The work is funded by BIGMED project.

Requirements

Scikit-learn

Code and data for experiments

The results reported in the paper can be replicated by

Train and test SVM 5-fold cross-validation for entity recognition.

python3 svm_ner.py all_sentences.vert.parse.entity 5

Train and test SVM 5-fold cross-validation for relation extraction.

python3 uio2rel.py pal_annotate all_sentences.vert.parse 5

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
20_Paper_final.pdf		20_Paper_final.pdf
README.md		README.md
all_sentences.vert.entity		all_sentences.vert.entity
all_sentences.vert.parse		all_sentences.vert.parse
all_sentences.vert.parse.entity		all_sentences.vert.parse.entity
interannotator_agreement.py		interannotator_agreement.py
ner_predicted.txt		ner_predicted.txt
svm_ner.py		svm_ner.py
synthetic_data.zip		synthetic_data.zip
uio2rel.py		uio2rel.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NorSynthClinical

Requirements

Code and data for experiments

About

Releases

Packages

Languages

ltgoslo/NorSynthClinical

Folders and files

Latest commit

History

Repository files navigation

NorSynthClinical

Requirements

Code and data for experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages