Skip to content

jpowerj/tad-workshop

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Columbia University Text-As-Data Workshop 2018

Thursdays November 8th to December 6th (excluding Thanksgiving week)

2:30-4pm in IAB (International Affairs Building) room 707

Specifically, info for the four sessions is as follows:

  1. Thursday November 8, 2:30-4pm, IAB 707
  2. Thursday November 15, 2:30-4pm, IAB 707
  3. Thursday November 29, 2:30-4pm, IAB 707
  4. Thursday December 6, 2:30-4pm, IAB 707

I emailed this survey out to get a peek into the minds of people who want to attend the workshop, so that I know things like where to start, what knowledge to assume, and what types of examples or text collections yall would find interesting.

Last but definitely not least, you'll be in a really good spot if you complete this Interactive R Tutorial before the first workshop. But I'll still go over the topics briefly just in case.

First Workshop: Gentle But SPEEDY Introduction to R

Full tutorial now up and ready, in Introduction_to_R.md within the week-1 folder!

Previews of potential topics for the (not-yet-written, since hopefully I'll customize extensively) second, third, and fourth workshops:

Second Workshop: From Data Analysis to TEXT ANALYSIS

Soon to come, in Basic_Text_Analysis.md within the week-2 folder.

Data analysis redux: Non-linear models

glm() and the new (basically) required argument

Refresher on logit, probit, tobit

Simplest logit example

  • How to get coeffs as probabilities instead of log-odds ratios: use plogis(model$coefficients))

Text Analysis 2: Sentiment Analysis Boogaloo

Third Workshop (Now it's all text analysis all the time)

Text Analysis 3: topic modeling

Now intro to topic modeling. (Blei figure of NYTimes figures, then Blei figure with colored disks and science article)

Simplest possible topic model project

Simplest possible dynamic topic model (if I have time, otherwise next week)

Methods for measuring "Innovation" and "Influence" over time (if I have time)

Fourth Workshop: The paths ahead

Finish dynamic topic modeling (if necessary)

WEB SCRAPING. SOCIAL MEDIA DATA. LOUD NOISES.

word2vec social science example

Text analysis combined with network analysis

Fun datasets! Do things with them. Go. Now.

About

Columbia University Text-as-Data Workshop, 2018-2019

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published