Skip to content

Commit

Permalink
Correcting training data :-/
Browse files Browse the repository at this point in the history
Adding more examples in classes and senses
  • Loading branch information
lfoppiano committed Aug 29, 2016
1 parent 6379d99 commit a01d6e0
Show file tree
Hide file tree
Showing 2 changed files with 384 additions and 384 deletions.
20 changes: 10 additions & 10 deletions grobid-ner/doc/class-and-senses.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,28 +11,28 @@ The following table describes the 26 named entity classes produced by the model.
| PERSON | first, middle, last names and aliases of people and fictional characters | _John Smith_ |
| PERSON_TYPE | person type or role classified according to group membership | _African-American_, _Asian_ |
| TITLE | personal title or honorific | _Mr._, _Dr._, _General_ |
| LOCATION | physical location | _Los Angeles_ |
| LOCATION | physical location | _Los Angeles_, _Northern Madagascar_, _Southern Thailand_, _Channel Islands_ |
| ORGANISATION | organized group of people | _Alcoholics Anonymous_ |
| EVENT | event | _World War 2_, _Battle of France_ |
| ARTIFACT | human-made object | _FIAT 634_ |
| ACRONYM | acronym | _SAME_ (Sequence As Mode-No Existing), _NORM_ (Naturally Occurring Radioactive Material)|
| BUSINESS | company / commercial organisation | _Air Canada_ |
| INSTITUTION | organization of people and a location or structure that share the same name | _Yale University_, the _European Patent Office_ |
| ACRONYM | acronym | _SAME_ (Sequence As Mode-No Existing), _NORM_ (Naturally Occurring Radioactive Material), _WW1_ (World War 1)|
| BUSINESS | company / commercial organisation | _Air Canada_, _Microsoft_ |
| INSTITUTION | organization of people and a location or structure that share the same name | _Yale University_, the _European Patent Office_, _The british government_ |
| MEASURE | numerical amount including an optional unit of measure | _1,500_ |
| PERIOD | date, historical era or other time period | _January, the 2nd 2010_, _1985-1989_ |
| NATIONAL | relating to a location | _North American_, _German_, _Britain_ |
| WEBSITE | website URL or name | _Wikipedia_, http://www.inria.fr |
| ANIMAL | individual name of an animal | _Hachikō_, _Jappeloup_ |
| CREATION | artistic creation, such as song, movie, etc. | |
| CREATION | artistic creation, such as song, movie, etc. | _Monna Lisa_, _Mullaholland drive_ |
| IDENTIFIER | systematized identifier such as phone number, email address, ISBN | |
| AWARD | award for art, science, sport, etc. | |
| AWARD | award for art, science, sport, etc. | _Balon d'or_, _Nobel prize_|
| MEDIA | media organization or publication | |
| SUBSTANCE | natural substance | |
| SUBSTANCE | natural substance | |
| PLANT | name of a plant | _Ficus religiosa_ |
| SPORT_TEAM | sport group or organisation | |
| INSTALLATION | structure built by humans | _Strasbourg Cathedral_ |
| SPORT_TEAM | sport group or organisation | _The Yankees_ |
| INSTALLATION | structure built by humans | _Strasbourg Cathedral_, _Sforza Castle_ |
| CONCEPT | abstract concept not included in another class | _English_ (as language) |
| CONCEPTUAL | entity relating to a concept | _Greek_ myths |
| CONCEPTUAL | entity relating to a concept | _Greek_ myths, _European Union membership_ |
| UNKNOWN | entity not belonging to any previous classes| |
## Conventions
Expand Down

0 comments on commit a01d6e0

Please sign in to comment.