Skip to content

The project is about Named-entity recognition which locate and classify named entities mentioned in unstructured text into pre-defined categories such as person names, organizations, locations, medical codes, time expressions, quantities, monetary values, percentages.

Notifications You must be signed in to change notification settings

aftabanjum4451/Named-Entity-Recognition-GMB-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Named Entity Recognition

In Natural Language Processing (NLP), Entity Recognition is one of the common problems. The entity is referred to as the part of the text that one is interested in. In NLP, NER is a method of extracting the relevant information from a large corpus and classifying those entities into predefined categories such as: location, organization, name and so on.

Information about lables:

  • geo--> Geographical Entity

  • org--> Organization

  • per--> Person

  • gpe--> Geopolitical Entity

  • tim--> Time indicator

  • art--> Artifact

  • eve--> Event

  • nat--> Natural Phenomenon

Dataset

(https://www.kaggle.com/abhinavwalia95/entity-annotated-corpus)

Project Highlights

  • Preprocess text data
  • Build and train Flair Model: Bi-directional LSTM and CRF
    • Glove Embedding
    • Stacked Embeddings: Glove, forward and backward Flair embeddings
  • Evaluate our model on the test set
  • Run the model on your own sentences!

How to Run The Code

All the code and comments are listed in the jupyter notbook (NER model file.ipynb)

About

The project is about Named-entity recognition which locate and classify named entities mentioned in unstructured text into pre-defined categories such as person names, organizations, locations, medical codes, time expressions, quantities, monetary values, percentages.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published