Skip to content

sismetanin/oswb-russian-tweets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Observable Subjective Well-Being Inferred from Tweets in Russian

This repo contains code for measuring Observable Subjective Well-Being based on tweets in Russian.

Twitter Data

We used the Twitter Stream Grab1 as a data source of tweets in Russian. Twitter Stream Grab is a publicly available historical collection of JSON grabbed from the general Twitter “Spritzer” API stream.

Sentiment Classification Model

We fine-tuned RuRoBERTa-Large2 on RuSentiTweet3 and achibed new state-of-the-art results of macro $F_1=0.7229$ and weighted $F_1=0.7281$, surpassing the previous SOTA results achieved by the RuBERT model.

Subjective Well-Being Data

We used survey-based VCIOM Happiness Index4, which is regularly reported by Russia Public Opinion Research Center.

Footnotes

  1. https://archive.org/details/twitterstream

  2. https://huggingface.co/sberbank-ai/ruRoberta-large

  3. https://peerj.com/articles/cs-1039/

  4. https://wciom.ru/analytical-reviews/analiticheskii-obzor/indeks-schastja

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

No packages published