- The dataset can be found in the
jsonl
format in the folder. - It contains three columns:
id
,label
, andsubreddit
id
indicates the unique comment id that can be used to get the text through the Reddit API.label
indicates the final label assigned to each of the textsubreddit
indicates which neighborhood/city the data is from- In case the full dataset with the text is required please email the corresponding author of the paper at mac9908@rit.edu
- The code and configuration parameters for training and experiments are in the notebook.
- If you use the dataset or code please cite the paper.