Skip to content
No description, website, or topics provided.
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
SOTU-csv
SOTU-textfiles
__pycache__
.gitignore
README
aggregate_csvs.html
combined3.csv
count_words.py
csvs_to_json.py
sparkcloud2.html

README

Repository for our presentation on SparkClouds.
Aayam Poudel and Jamie Hand

Our process:

1. Got text of State of the Union addresses from 2001 to 2016
and put them into .txt files.

2. Wrote a python script to strip new lines and punctuation,
extract each word (excluding stop words like "the", "a", "I", and "will"),
count how many times it appears, and put them in a list in order of their
frequency.

3. Output these words, counts, and years to a CSV file, and
combined the CSV files into a single file.

4. Wrote an HTML page with a demo, showing a SparkCloud that
takes in the CSV as data.

TODO:

- Each sparkline should have its own domain and range.
You can’t perform that action at this time.