Skip to content
#

text-as-data

Here are 30 public repositories matching this topic...

Code for collecting and cleaning speeches (text) of the US 2020 election campaign. Corresponding publication: "A text dataset of campaign speeches of the main tickets in the 2020 US presidential election", by Ioannis Chalkiadakis, Louise Anglès d’Auriac, Gareth W. Peters, and Divina Frau-Meigs

  • Updated Sep 20, 2024
  • Python

Collection of text corpora for publicly available speeches from Mexican president Andres Manuel Lopez Obrador (AMLO) sourced from YouTube. The dataset includes his daily morning conferences (conferencias mañaneras) 😴🪿

  • Updated Jul 7, 2024
  • Python

This repository uses text-as-data methods alongside traditional primary source reading to analyze early American state constitutions. The R scripts create a function to scrape and clean the constitutional text, run sentiment analysis, calculate tf-idf, and perform LDA. This is a work-in-progress.

  • Updated Dec 28, 2022
  • HTML

Improve this page

Add a description, image, and links to the text-as-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-as-data topic, visit your repo's landing page and select "manage topics."

Learn more