Thursdays November 8th to December 6th (excluding Thanksgiving week)
2:30-4pm in IAB (International Affairs Building) room 707
Specifically, info for the four sessions is as follows:
- Thursday November 8, 2:30-4pm, IAB 707
- Thursday November 15, 2:30-4pm, IAB 707
- Thursday November 29, 2:30-4pm, IAB 707
- Thursday December 6, 2:30-4pm, IAB 707
I emailed this survey out to get a peek into the minds of people who want to attend the workshop, so that I know things like where to start, what knowledge to assume, and what types of examples or text collections yall would find interesting.
Last but definitely not least, you'll be in a really good spot if you complete this Interactive R Tutorial before the first workshop. But I'll still go over the topics briefly just in case.
Full tutorial now up and ready, in Introduction_to_R.md
within the week-1
folder!
Previews of potential topics for the (not-yet-written, since hopefully I'll customize extensively) second, third, and fourth workshops:
Soon to come, in Basic_Text_Analysis.md
within the week-2
folder.
glm() and the new (basically) required argument
Refresher on logit, probit, tobit
Simplest logit example
- How to get coeffs as probabilities instead of log-odds ratios: use
plogis(model$coefficients))
Now intro to topic modeling. (Blei figure of NYTimes figures, then Blei figure with colored disks and science article)
Simplest possible topic model project
Simplest possible dynamic topic model (if I have time, otherwise next week)
Methods for measuring "Innovation" and "Influence" over time (if I have time)
Finish dynamic topic modeling (if necessary)
WEB SCRAPING. SOCIAL MEDIA DATA. LOUD NOISES.
word2vec social science example
Text analysis combined with network analysis
Fun datasets! Do things with them. Go. Now.