Skip to content

Latest commit

 

History

History
17 lines (15 loc) · 1.03 KB

README.md

File metadata and controls

17 lines (15 loc) · 1.03 KB

RuSentiTweet: A Sentiment Analysis Dataset of General Domain Tweets in Russian

This repository contains RuSentiTweet, a sentiment analysis dataset of 13,392 general domain tweets in Russian, which were created within the paper "RuSentiTweet: A Sentiment Analysis Dataset of General Domain Tweets in Russian". RuSentiTweet was manually annotated (moderate inter-rater agreement) using RuSentiment guidelines into 5 classes: Positive, Neutral, Negative, Speech Act, and Skip. As a source of data, we used Twitter Stream Grab, a historical collection of tweets obtained from the general Twitter API stream.

Citation:

@article{smetanin2022rusetitweet,
  title = {RuSentiTweet: A Sentiment Analysis Dataset of General Domain Tweets in Russian},
  author = {Sergey Smetanin},
  journal = {PeerJ Computer Science},
  volume = {8},
  pages = {e1039},
  year = {2022},
  doi = {10.7717/peerj-cs.1039},
  publisher = {PeerJ Inc.}
}