Part of Speech Tagging using Social Networks
Code for the paper Stylistic Variation in Part of Speech Tagging.
The dataset used in the paper is from here.
Dependencies
- python 3.4 and above (2.7 should also work fine)
- dynet 1.0 (2.0 also works fine)
- scipy
- matplotlib
- emoji
- Glove embeddings
To run the baseline Bi-LSTM Tagger
python ensemble_train.py --no-ensemble
To run just a plain ensemble of Bi-LSTM Tagger
python ensemble_train.py --just-ensemble
To run the Social Attention Tagger using the Follow
network.
python ensemble_train.py --network=follow --num-basis=4
You can run the model using either, Follow
, Mention
or Retweet
networks, with any number of basis models.
To run the Social Attention Tagger using all the three social networks.
python ensemble_train.py --use-all-networks --num-basis=4
Please feel free to contact me if you have any questions on the code or the paper.
If you use this code, please cite our paper:
@inproceedings{balusu2018social,
title = {Stylistic Variation in Social Media Part-of-Speech Tagging},
author = {Murali Raghu Babu Balusu and Taha Merghani and Jacob Eisenstein},
booktitle = {Proceedings of {NAACL} workshop on stylistic variation},
year = {2018},
url = {https://arxiv.org/pdf/1804.07331.pdf},
code = {https://github.com/bmurali1994/socialnets_postagging}
}