Skip to content

An example use of the Hidden Markov Model in Natural Language Processing: for tagging a corpus.

Notifications You must be signed in to change notification settings

5l1v3r1/hiddenMarkovModel-in-NLP

 
 

Repository files navigation

HMM based tagger

This is a tagger for Natural Languages based on Hidden Markov Model.

Usage

All programs in the package are compiled by issuing the make command in the main directory. Note that the main script does calls make by itself.

The main script is tagger. It should be used as follows:

tagger <body1> [<body2>]

where parameters denote catalogs with language corpuses in the IPI PAN format. It has two modes:

It can be run in two modes:

  • one-parameter - then it performs a 10-fold cross-validation in order to test the HMM tagger and compare it with a simple unigram tagger

  • two-parameter - then it trains the model on the first given body and tags the second body using the created model.

After launching, the program prints all information to the standard output including a progress bar.

About

An example use of the Hidden Markov Model in Natural Language Processing: for tagging a corpus.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 85.3%
  • Shell 13.2%
  • Makefile 1.5%