📄 1mb Archive of Donald Trump Speeches (with CoreNLP 3.3.0 annotations)
Python
Switch branches/tags
Nothing to show
Clone or download
Pull request Compare This branch is 4 commits ahead, 1 commit behind ryanmcdermott:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
corenlp-output
examples
fake-splits
LICENSE
README.md
fake-splits.py
speeches.txt

README.md

trump-speeches

What is it?

1mb of text data taken from speeches made by Donald Trump at various points in his 2016 campaign for President of the United States.

What is this for?

For all of your data science, machine learning, and entertainment needs.

word_cloud

Examples

Run the example Word Cloud generator.

pip install wordcloud
cd examples
python trump_wordcloud.py

License

Copyright Disclaimer Under Section 107 of the Copyright Act 1976, allowance is made for "fair use" for purposes such as criticism, comment, news reporting, teaching, scholarship, and research. Fair use is a use permitted by copyright statute that might otherwise be infringing. Non-profit, educational or personal use tips the balance in favor of fair use

CoreNLP Annotations

corenlp-output contains annotations applied by CoreNLP 3.3.0. The annotations were made on fake documents: the original speeches.txt, broken up by line breaks.

Anything sentential should hold ok, but coreference will be broken across paragraphs.