POS_Tagger

Designing, implementation and training a neural sequence model (RNN, LSTM, GRU, etc.) of your choice to (tokenize and) tag a given sentence with the correct part-of-speech tags. For example, given the input

example Mary had a little lamb

your model should output

Mary NOUN

had VERB

a DET

little ADJ

lamb NOUN

Note that the part-of-speech tag is separated from each word by a tab \t character.

Dataset

Used the Universal Dependencies dataset, downloadable here. We recommend the files located at ud-treebanks-v2.11/UD_English-Atis/en_atis-ud-{train,dev,test}.conllu. Use the first, second and fourth columns only (word index, lowercase word, and POS tag). The UD dataset does not include punctuation. You may filter the input sentence to remove punctuation before tagging it. Note that many languages’ data are downloadable from this resource. We expect a model trained on the English data at least, but you are free to train on other languages in addition.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Dataset1		Dataset1
POS_Tagger.ipynb		POS_Tagger.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

POS_Tagger

Dataset

About

Releases

Packages

Languages

tanalpha-aditya/POS_Tagger

Folders and files

Latest commit

History

Repository files navigation

POS_Tagger

Dataset

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages