This dataset contains 3456 comment extracted from Instagram social media. These comments are collected from 5 profiles and are in Persian language. Each comment labels with two tags hate and no hate. Persian_HS dataset contains 2224 nohate comments and 1232 hate comments. There is also an equalized dataset contains 2464 comments. we also separates the train and test sets for easier use. The fraction is 80% for train and 20% for test.
Pegah Shams J, Ramin Toosi, Pooya Narimanii
@inproceedings{jey2022hate,
title={Hate Sentiment Recognition System For Persian Language},
author={Jey, Pegah Shams and Hemmati, Arash and Toosi, Ramin and Akhaee, Mohammad Ali},
booktitle={2022 12th International Conference on Computer and Knowledge Engineering (ICCKE)},
pages={517--522},
year={2022},
organization={IEEE}
}