Skip to content

Commit

Permalink
Corrected stats.
Browse files Browse the repository at this point in the history
  • Loading branch information
czi-sunil committed Nov 21, 2018
1 parent 9c18dfa commit 1f0d762
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions full/ReadMe.md
Expand Up @@ -24,11 +24,11 @@ Description | Stat | avg
Number of Concepts in UMLS 2017-AA Active | 3,271,124
Number of Semantic Types (incl. UnknownType) | 128
Number of Annotated Docs in MedMentions | 4,392
Total number of Mentioned Concepts | 34,728 | (1.06% of UMLS)
Total number of Mentions in MedMentions | 352,594 | (80.3 / doc)
Total number of Mentioned Concepts | 34,724 | (1.06% of UMLS)
Total number of Mentions in MedMentions | 352,496 | (80.3 / doc)
Total Number of of Tokens (PTB via StanfordNLP) | 1,176,058 | (267.8 / doc)
Number of Annotated Tokens | 493,908 | (112.5 / doc)
Proportion of tokens annotated | 42.0% | (1.4 / mention)
Number of Annotated Tokens | 579,839 | (132.0 / doc)
Proportion of tokens annotated | 49.3% | (1.6 / mention)

As a comparison, the [BioCreative V Chemical-Disease Relation Task Corpus (BC5-CDR)](http://www.biocreative.org/resources/biocreative-v/proceedings-biocreative5/)
is a smaller set of 1,500 papers annotated only with Chemical and Disease entity mentions
Expand Down

0 comments on commit 1f0d762

Please sign in to comment.