Skip to content

Part-of-Speech tagger for tweets using the Viterbi Algorithm.

Notifications You must be signed in to change notification settings

dangitstam/twitter-viterbi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Twitter Viterbi

Part-of-Speech tagger for tweets using the Viterbi Algorithm.

Includes a vector-optimized implementation that pre-computes all transition probabilities and iterates through a sequence, calculating the optimal path using NumPy matrix broadcasting in log space.

Getting Started

Create a new environment for the project

conda create --name twitter-viterbi python=3.7

and activate it

conda activate twitter-viterbi

Install requirements (AllenNLP, NumPy, and linting tools)

pip install -r requirements.txt

Test your installation by running the unit tests:

pytest -v -W ignore

Acknowledgements

Special thanks to Andrew Li for helping me understand the implementation of Viterbi with matrix broadcasting.

About

Part-of-Speech tagger for tweets using the Viterbi Algorithm.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages