This repository contains RuSentiTweet, a sentiment analysis dataset of 13,392 general domain tweets in Russian, which were created within the paper "RuSentiTweet: A Sentiment Analysis Dataset of General Domain Tweets in Russian". RuSentiTweet was manually annotated (moderate inter-rater agreement) using RuSentiment guidelines into 5 classes: Positive, Neutral, Negative, Speech Act, and Skip. As a source of data, we used Twitter Stream Grab, a historical collection of tweets obtained from the general Twitter API stream.
Citation:
@article{smetanin2022rusetitweet,
title = {RuSentiTweet: A Sentiment Analysis Dataset of General Domain Tweets in Russian},
author = {Sergey Smetanin},
journal = {PeerJ Computer Science},
volume = {8},
pages = {e1039},
year = {2022},
doi = {10.7717/peerj-cs.1039},
publisher = {PeerJ Inc.}
}