Skip to content
PGxCorpus, a manually annotated corpus, designed for the extraction of pharmacogenomic relations from text.
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
baseline_experiment
LICENSE.txt
PGxCorpus.tar
PGxCorpus_pubtator.tar
README.md
annotation_guidelines.pdf

README.md

PGxCorpus

PGxCorpus is a manually annotated corpus, designed for the extraction of pharmacogenomic realtions from text. It is composed of 945 sentences mannually annotated, issued from 911 distinct PubMed abstracts. Annotation has been achieved by 11 annotators, including 5 senior annotators. Each sentence has been seen independently by 2 annotators, in a first phase, and by a third senior annotator, in a second phase.

Annotation guidelines

The annotation guidelines were provided to the annotators to reduced the heterogeneity in the annotation task.

Source code of a baseline experiment

The source code of the baseline experiment reported in [1], is available in ./baseline_experiment/

In preparation.

License

PGxCorpus is under Creative Commons BY NC 4.0.

References

  1. in preparation

Acknowledgments

PGxCorpus is supported by the PractiKPharma project (http://practikpharma.loria.fr/), funded by the French National Research Agency (ANR) under grant ANR-15-CE23-0028.

You can’t perform that action at this time.