Hidden Markov Model POS Tagger
Python
Switch branches/tags
Nothing to show
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
.gitignore better gitignore Feb 22, 2012
Guesser.py
HMM.py
Helper.py
PennTags.py
README.markdown added more links Feb 23, 2012
Tagger.py
Treebank.py
TreebankCleaner.py finished commenting? Nov 17, 2010
hmm-tagger.py added --clean option to hmm-tagger.py, added README Feb 23, 2012
treebank3_sect2.txt

README.markdown

hmm-tagger

This is a Part of Speech tagger written in Python, utilizing the Viterbi algorithm (an instantiation of Hidden Markov Models). It uses the Natural Language Toolkit and trains on Penn Treebank-tagged text files. It will use ten-fold cross validation to generate accuracy statistics, comparing its tagged sentences with the gold standard.

Usage

python hmm-tagger.py [--clean]

Pass in the --clean option to clean a Treebank file before running the tagger. This can be time consuming, so you can leave it off during future runs.