Find file History
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
..
Failed to load latest commit information.
1
2
3
4
5
GermanNER Adds GermanNER dataset. Jun 23, 2017
Wikiner
conll2002
README.md

README.md

##FOX - Datasets

fine the documentation of Fox here : documentation

  • 1 - news dataset (created manually)
  • 2 - Illinois dataset ([NERWebpagesColumns] (http://cogcomp.cs.illinois.edu/page/resource_view/28))
  • 3 - subset of news dataset
  • 4 - reuters dataset (created manually)
  • 5 - all datasets 1-4
  • conll2002 - the conll2002 dataset
  • GermanNER - dataset build on the GermEval 2014 dataset
  • Wikiner - silver standard datasets in many languages generated from Wikipedia

Number of entities separated according entity types and in total in the datasets. (e.g. 5117 entities of Location in dataset 1)

  • Location

    • 1 - 5117
    • 2 - 114
    • 3 - 341
    • 4 - 146
    • 5 - 5472
  • Organization

    • 1 - 6899
    • 2 - 257
    • 3 - 434
    • 4 - 208
    • 5 - 7467
  • Person

    • 1 - 3899
    • 2 - 396
    • 3 - 254
    • 4 - 91
    • 5 - 4549
  • Total

    • 1 - 15915
    • 2 - 767
    • 3 - 1029
    • 4 - 445
    • 5 - 17488