Skip to content

Computational Analysis of Belarusian State-Owned Media

Notifications You must be signed in to change notification settings

tsimafeip/SB-belarus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SB-Belarus-Yesterday

The main contribution of this project is a new dataset (data/sb_articles.db).
I have scraped and preprocessed ~20k political articles, published by Belarusian state-owned newspaper "SB. Belarus' Segodnya" ("SB. Belarus Today").

As part of this project, I also completed other tasks:

  • simple count-based analysis was performed to analyse this dataset (see code/helper.py)
  • unsupervised LDA modelling for topic interpretation was conducted (see code/lda)
  • detailed description of all steps and background overview of Belarusian political situation can be found [in the accompanying paper](meta/Framing and Topic Modelling in Belarusian State-Owned Media.pdf).

Data Source and Examples

Data for analysing was scraped from web version of "SB. Belarus' Segodnya". It is currently down, but you can find articles by hyperlink using Internet Archive.

I use SQLite for storing data and querying it. Data is stored in the single table, called 'documents', with the structure, described in the table above. I will illustrate each field with an example from actual data. Please note, that source data is in Russian, I translated it by myself for this table.

Field Name Description Example
document_id non-empty unique identifier of the document 1440895
title primary document title To whom and why does union integration hinder
title_h1 optional secondary document title Our alternative
tags optional list of tags selected by author union state@politics@economy@sanctions
similar_documents optional links to similar articles https://www.sb.by/articles/soyuz-belarus-rossiya
publication_date date and time of publication 2021-10-21T10:42:00+03:00
author name of the author polina konoga
body main text of the article Alexei Avdonin: The West has chosen the Union State as its target. (first sentence, full body is much longer)
hyperlink original url of the article https://www.sb.by/articles/nasha-alternativa-souz.html

About

Computational Analysis of Belarusian State-Owned Media

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published