Skip to content

Commit

Permalink
Training guidelines update
Browse files Browse the repository at this point in the history
  • Loading branch information
lfoppiano committed Aug 26, 2016
1 parent 88bd7bd commit 277a40c
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions grobid-ner/doc/training-guidelines.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,9 +22,10 @@ During training it's mandatory not to modify the token for any reason. Only the
### Classes
The list of classes with the set of examples are defined in the [classes page](class-and-senses.md) of this manual.

### Greedy approach
### Largest entity mention

Composed concept should be considered instead of simple concept. Usually extended Named Entities have different classes for example:
Entities with more than one token, can be recognized in different way (for example given two tokens could be interpreted as one entity of two tokens or two entities of one token).
The approach choosen with GROBID-NER is to try to match the largest entity mentions. Here some examples:

1. the token _british_:

Expand Down

0 comments on commit 277a40c

Please sign in to comment.