A corpus of 7,800 vulgar tweets with annotation for vulgar function on a per-vulgar-token basis.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
README.md
all_data.tsv
dev.tsv
test.tsv
train.tsv

README.md

Why Swear? Analyzing and Inferring the Functions of Vulgar Expressions

Citation:

@InProceedings{holgate2018vulgar,
  author    = {Holgate, Eric  and  Cachola, Isabel  and  Preo\c{t}iuc-Pietro, Daniel  and  Li, Junyi Jessy},
  title     = {Why Swear? Analyzing and Inferring the Functions of Vulgar Expressions,
  booktitle = {Conference on Empirical Methods in Natural Language Processing (EMNLP)},
  year      = {2018},
  pages     = {to appear},
}

Use of the data presented here must abide by the Twitter Terms of Service and Developer Policy