Skip to content

Commit

Permalink
Training guidelines update
Browse files Browse the repository at this point in the history
  • Loading branch information
lfoppiano committed Aug 26, 2016
1 parent 6e3ae51 commit 5203a6d
Showing 1 changed file with 2 additions and 3 deletions.
5 changes: 2 additions & 3 deletions grobid-ner/doc/training-guidelines.md
Original file line number Diff line number Diff line change
@@ -1,10 +1,9 @@
# Guidelines for annotation of NAmed Entities Recognition

The creation of annotated corpus for Named Entities is the process of find the correct class of named entities for words based on the context.

Grobid-NER can automatically generate training data from text files ( [Link to Page] ), recognising the best named entities with the model currently used.
The goal of the annotator is to correct the generated entities by: (1) changing them, (2) extending them to the proximity tokens or (3) removing them.

### Format
The format the training data is managed is the [CONLL 2003 format](http://www.cnts.ua.ac.be/conll2003/ner/), which is a 2 column tab separated file.
Expand Down

0 comments on commit 5203a6d

Please sign in to comment.