-
Notifications
You must be signed in to change notification settings - Fork 0
NYU NLP Homework 3: Implement a Viterbi HMM POS tagger
License
dizys/nyu-nlp-homework-3
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
NYU NLP Homework 3: Implement a Viterbi HMM POS tagger Improved the unknown-word tagging by classify the unknown by common suffixes by Ziyang Zeng (zz2960) Spring 2022 Pre-requisites: - Python 3.8+ How did I handle OOV: I did the last suggested approach, which is combination of treating single appearance words as unknown words and classify them into common suffixes. How to run: `python3 main_zz2960_HW3.py --help` will give you: usage: main_zz2960_HW3.py [-h] [-s STATEFILE] [-o OUTPUTFILE] mode inputfile A Viterbi HMM POS tagger implementation. positional arguments: mode "train" or "tag" inputfile input corpus file optional arguments: -h, --help show this help message and exit -s STATEFILE path for storing/reading trained state file, default is states.pkl -o OUTPUTFILE path for storing tagged output file, default is output.txt Examples: To train: `python3 main_zz2960_HW3.py train data/WSJ_02-21.pos` To tag: `python3 main_zz2960_HW3.py tag data/WSJ_24.words -o output.txt`
About
NYU NLP Homework 3: Implement a Viterbi HMM POS tagger
Topics
Resources
License
Stars
Watchers
Forks
Packages 0
No packages published