NER

Named entity tagging system that requires minimal linguistic knowledge and can be applied to several target languages without substantial changes. The system is based on the ideas of the Brill’s tagger which makes it really simple. Using supervised machine learning, we construct a series of automatons (or transducers) in order to tag a given text. The final model is composed entirely of automatons and it requires a lineal time for tagging. It was tested with the Spanish data set provided in the CoNLL-2002 attaining an overall Fβ=1 measure of 60%. Also, we present an algorithm for the construction of the final transducer used to encode all the learned contextual rules.

The following paper describes the theoretical background of this project:

https://arxiv.org/abs/2006.11548

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
nbproject		nbproject
src		src
.gitignore		.gitignore
README.md		README.md
build.xml		build.xml
esp.testa.txt		esp.testa.txt
esp.testb.txt		esp.testb.txt
esp.train.txt		esp.train.txt
esp.train_small.txt		esp.train_small.txt
example.txt		example.txt
manifest.mf		manifest.mf
model		model
reglas		reglas
resultExample.txt		resultExample.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NER

About

Releases

Packages

Languages

dahuerfanov/NER

Folders and files

Latest commit

History

Repository files navigation

NER

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages