Folder containing code files used to get, clean, and analyze data for my dissertation
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.



This repository contains (some of) the code for my disseration. I will fully release my data when my dissertation is complete.


python/: Contains code to scrape the legislative text from the Dail Eireann website, to get tweets by Irish legislators from the Twitter API, and to clean and parse text and metadata from these sources.

data/: Contains various data files

dail_json/: Contains the jsonl files of legislators speeches and questions created from the raw html files

dail_html/: Contains the raw html files scraped from the Dail Eireann website

tweets/: Contains the json files of legislators tweets obtained from the Twitter API