What is it?
1mb of text data taken from speeches made by Donald Trump at various points in his 2016 campaign for President of the United States.
What is this for?
For all of your data science, machine learning, and entertainment needs.
Run the example Word Cloud generator.
pip install wordcloud cd examples python trump_wordcloud.py
Copyright Disclaimer Under Section 107 of the Copyright Act 1976, allowance is made for "fair use" for purposes such as criticism, comment, news reporting, teaching, scholarship, and research. Fair use is a use permitted by copyright statute that might otherwise be infringing. Non-profit, educational or personal use tips the balance in favor of fair use
corenlp-output contains annotations applied by CoreNLP 3.3.0. The annotations were made on fake documents: the original speeches.txt, broken up by line
Anything sentential should hold ok, but coreference will be broken across paragraphs.