Skip to content
forked from arop/ner-re-pt

Named entity extraction from Portuguese web text

Notifications You must be signed in to change notification settings

sekmet/ner-re-pt

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Named entity extraction from Portuguese web text

My master dissertation on Named entity extraction from Portuguese web text, at FEUP (Faculty of Engineering of University of Porto).

Entity extraction using well-established tools (OpenNLP, Stanford CoreNLP, spaCy and NLTK) for the Portuguese language, and more specifically for the news section in University of Porto Information System - SIGARRA and all its subdomains.

Author: André Ricardo Oliveira Pires

Supervisor: Sérgio Nunes

Co-supervisor: José Devezas

In colaboration with: FEUP InfoLab and INESC TEC

For more information, regarding the developing process, guidelines for each tool, results obtained, resources created (trained NER models and annotated dataset) and more, check wiki.

About

Named entity extraction from Portuguese web text

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 61.8%
  • Shell 26.9%
  • Perl 11.3%