This repository contains scrits useful for analyzing the rhetoric of TED talks, by first downloading the subtitle (.srt) files for ~1600 TED talks, extracting desired text (at the level of an entire talk or a more granular level, such as only including certain parts of speech or treating different parts of a talk as separate documents), and then converting the data to a format usable by David Blei's LDA-C implementation of Latent Dirichlet Allocation.
-
Notifications
You must be signed in to change notification settings - Fork 0
siddjagadish/ted-lda
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published