## Create Party support file

The data first need to be downloaded from the following Github file: [github.com/hampusborgos/SwedishPolls](https://github.com/hampusborgos/SwedishPolls/blob/master/Data/Polls.csv), and download it by pressing "download raw file". It will then be filtered and configured in this file.

You will need to enter the correct path bellow.

In [5]:
# Written by Magnus Olander

import pandas as pd

path_to_original_poll_data = "../../data/data_all/Polls.csv"
data = pd.read_csv(path_to_original_poll_data)

#### Removes unecessay columns and rows, and changes date to year and month

In [6]:
data[['Year', 'Month']] = data['PublYearMonth'].str.split('-', expand=True)
data = data.groupby(["Year", "Month"]).mean().reset_index()
data = data.drop(["FI", "Uncertain", "n", "approxPeriod"], axis=1)
data = data.fillna(0)
data.head()

Unnamed: 0,Year,Month,M,L,C,KD,S,V,MP,SD
0,1998,apr,26.8,6.9,6.5,5.2,37.9,9.5,5.7,0.0
1,1998,aug,26.1,6.7,6.3,7.0,39.1,9.7,4.5,0.0
2,1998,dec,25.4,4.7,4.8,10.6,35.2,11.5,5.4,0.0
3,1998,feb,28.4,6.5,5.9,4.7,37.6,9.4,5.9,0.0
4,1998,jan,29.8,6.5,5.7,4.2,36.9,9.5,6.0,0.0


Change to numbered month instead of swedish string names

In [7]:
# Mapping for numbered month names
month_mapping = {'jan': 1, 'feb': 2, 'mar': 3, 'apr': 4, 'maj': 5, 'jun': 6,
                 'jul': 7, 'aug': 8, 'sep': 9, 'okt': 10, 'nov': 11, 'dec': 12}

# Map all months in the datafile
data['Month'] = data['Month'].str.lower().map(month_mapping)

#### Removes data before 2006

In [8]:
data = data[data["Year"] > "2005"]

#### Save the altered party support (poll) file

In [10]:
# Name your output file
ouput_name = "../../data/data_all/polls_party.csv"

data.to_csv(ouput_name, index=False)