text-as-data

Here are 29 public repositories matching this topic...

JasonKessler / scattertext

Beautiful visualizations of how language differs among document types.

Updated Mar 6, 2024
Python

MilaNLProc / contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

nlp embeddings transformer topic-modeling nlp-library nlp-machine-learning bert neural-topic-models text-as-data topic-coherence multilingual-topic-models multilingual-models

Updated Jan 16, 2024
Python

JasonKessler / Scattertext-PyData

Star

Notebooks for the Seattle PyData 2017 talk on Scattertext

visualization nlp natural-language-processing word2vec pydata political-science gender political-parties computational-social-science text-visualization text-as-data

Updated Jan 12, 2018
HTML

ryanjgallagher / shifterator

Star

Interpretable data visualizations for understanding how texts differ at the word level

natural-language-processing sentiment-analysis information-theory text-analysis data-visualization digital-humanities computational-social-science text-as-data

Updated Nov 4, 2023
Python

jboynyc / textnets

Star

Text analysis with networks.

visualization nlp sociology text-analysis network-analysis computational-social-science text-as-data

Updated May 8, 2024
Python

fedenanni / Computational-Text-Analysis-2018-19

Star

2018 Computational Text Analysis Notebooks, University of Mannheim

natural-language-processing teaching-materials computational-social-science text-as-data

Updated Nov 22, 2018
Jupyter Notebook

davidycliao / redguards

Star

This is a designed package for replicating the estimates and findings in the article of Factionalism and the Red Guards under Mao's China: Ideal Point Estimation Using Text Data.

nlp china part-of-speech-tagger r-programming udpipe quanteda text-as-data red-guards cultural-revolution

Updated Feb 16, 2023
R

umanlp / SemScale

Star

A tool for Semantic Scaling of Political Text (branch of Topfish, a suite of tools for Political Text Analysis)

computational-social-science text-scaling text-as-data

Updated Jan 16, 2024
Python

wesslen / summer2017-socialmedia

Star

Summer 2017 Social Media Analytics Workshop Series

r twitter-api geospatial facebook-api text-as-data

Updated May 19, 2018
HTML

thieled / dictvectoR

Star

'dictvectoR' measures the similarity between a concept dictionary and documents, using fastText word vectors. Implements the "Distributed-Dictionary-Representation" (Garten et al. 2018) method in R.

natural-language-processing r dictionary word-embeddings text-analysis scaling word-representations ideology word-vectors text-as-data

Updated Sep 14, 2022
R

cjerzak / LinkOrgs-software

Star

LinkOrgs: An R package for linking linking records on organizations using half a billion open-collaborated records from LinkedIn

machine-learning record-linkage community-detection equinox text-as-data jax transformer-architecture organizational-units

Updated Mar 25, 2024
R

davidycliao / bisCrawler

Star

An Automation Webcrawler for Extracting Central Bankers' Speeches

python scraper scraping speeches text-as-data bank-for-international-settlements central-bankers-speeches central-banker

Updated Feb 9, 2023
Python

adamlauretig / gensim_in_R

Star

Code for estimating word embeddings with gensim in R.

r gensim text-as-data

Updated Oct 30, 2018

aflueckiger / KED2022

Star

The ABC of Computational Text Analysis. BA Seminar, Spring 2022, University of Lucerne

sociology teaching computational-social-science social-science text-as-data

Updated Feb 17, 2023
HTML

WZBSocialScienceCenter / tm_corona

Star

A small showcase for topic modeling with the tmtoolkit Python package. I use a corpus of articles from the German online news website Spiegel Online (SPON) to create a topic model for before and during the COVID-19 pandemic.

python text-mining news scraping text-analysis corona topic-modeling webscraping text-as-data topicmodeling covid-19

Updated Dec 2, 2020
Jupyter Notebook

graceadcox / Refugee-Text-as-Data

Star

Original corpus of articles relating to refugees scraped from Tennessee newspaper The Chattanoogan along with simple code for text-as-data word cloud.

r word-cloud text-as-data

Updated Nov 11, 2019
R

CT-P / portuguese_open_data

Star

Empirical framework applied to parliament discourses and Twitter data, with a Discourse Polarization Index.

discourse computational-social-science text-as-data political-polarization gentzkow

Updated Oct 11, 2022
Jupyter Notebook

Sam-Gartenstein / Machine-Learning-for-the-Social-Sciences

Star

Material from my Machine Learning for the Social Sciences course

neural-networks supervised-machine-learning unsupervised-machine-learning text-as-data

Updated Jan 2, 2024
Jupyter Notebook

smkerr / news-israel-gaza

Star

🇮🇱🇵🇸 News coverage of Israel-Hamas War 🇵🇸🇮🇱

r text-as-data

Updated Feb 9, 2024
R

ivansabik / chairum-corpus

Star

Collection of text corpora for publicly available speeches from Mexican president Andres Manuel Lopez Obrador (AMLO) sourced from YouTube. The dataset includes his daily morning conferences (conferencias mañaneras) 😴🪿

Updated Nov 1, 2023
Python

Improve this page

Add a description, image, and links to the text-as-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-as-data topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

text-as-data

Here are 29 public repositories matching this topic...

JasonKessler / scattertext

MilaNLProc / contextualized-topic-models

JasonKessler / Scattertext-PyData

ryanjgallagher / shifterator

jboynyc / textnets

fedenanni / Computational-Text-Analysis-2018-19

davidycliao / redguards

umanlp / SemScale

wesslen / summer2017-socialmedia

thieled / dictvectoR

cjerzak / LinkOrgs-software

davidycliao / bisCrawler

adamlauretig / gensim_in_R

aflueckiger / KED2022

WZBSocialScienceCenter / tm_corona

graceadcox / Refugee-Text-as-Data

CT-P / portuguese_open_data

Sam-Gartenstein / Machine-Learning-for-the-Social-Sciences

smkerr / news-israel-gaza

ivansabik / chairum-corpus

Improve this page

Add this topic to your repo