Skip to content

Annotation Best Practices

Huan He edited this page Jul 8, 2022 · 7 revisions

The creation of annotation corpora is a critical task in natural language processing which involves significant human effort. We summarized our experience of corpus annotation process in this repo Annotation Best Practice.

Moreover, we have published a work-in-progress book chapter Chapter 9: Best practices of annotating clinical texts for information extraction tasks to introduce more details.

Corpus annotation

The above figure comes from https://bmcmedinformdecismak.biomedcentral.com/articles/10.1186/s12911-020-1072-9/figures/1

More detailed description can be found in our paper:

Clone this wiki locally