The Viterbi algorithm is tagging algorithm based on TRIGRAM HIDDEN MARKOV MODELS (TRIGRAM HMMS) To improve upon the Viterbi algorithm,i.e. to handle rare and unseen words I have created 4 new classes of words 'RARE','ALLCAPS','NUMERIC' and 'LASTCAP' in addition to the already present 'O' and 'I-GENE'.Words appearing less than 5 times are classified depending on 'ALLCAPS','NUMERIC' and 'LASTCAP'.If a word appears less than 5 times and does not belong to any of the previous mentioned 3 classes then it is classified as 'RARE'. This creation of different word classes hepls to improve the Viterbi algorithm.
-
Notifications
You must be signed in to change notification settings - Fork 4
deerishi/Viterbi-algorithm
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
The Viterbi algorithm is tagging algorithm based on TRIGRAM HIDDEN MARKOV MODELS (TRIGRAM HMMS)
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published