Switch branches/tags
Nothing to show
Find file History
Permalink
..
Failed to load latest commit information.
labeled_data.csv Adding data files May 10, 2017
labeled_data.p Adding data files May 10, 2017
readme.md Adjusting formatting of readme May 23, 2017

readme.md

Data

The data are stored as a CSV and as a pickled pandas dataframe (Python 2.7). Each data file contains 5 columns:

count = number of CrowdFlower users who coded each tweet (min is 3, sometimes more users coded a tweet when judgments were determined to be unreliable by CF).

hate_speech = number of CF users who judged the tweet to be hate speech.

offensive_language = number of CF users who judged the tweet to be offensive.

neither = number of CF users who judged the tweet to be neither offensive nor non-offensive.

class = class label for majority of CF users. 0 - hate speech 1 - offensive language 2 - neither