Module Four Final Project

For: Trace Political, LLC

By: Jonathan E. Ericksen, JE Consulting

Background

Trace Political is a consulting firm that specializes in social media consulting for congressional campaigns. Trace Political has contracted with JE Consulting to run an analysis on ~86K tweets from sitting congressional representatives. The tweets were acquired from May 2018 and are expected to yield valuable insights into the social media behavior and trends from current elected officials.

Objective:

The objective for JE Consulting is two fold:

Objective One: Corpus Analysis

- Extract the top 10 most words used within tweets from both Democrat and Republican representatives
- Extract the total vocab count used in both the Democrat and Republican tweet corpus'
- Extract words with the highest semantic relation to the following through word vectorization: 
    - Trump
    - Bill
    - Tax
- Using TextBlob, extract the sentiment rating for Democrat and Republican tweets

Objective Two: Tweet Classification using Neural Networks

In addition to corpus analysis, Trace Political has asked JE Consulting to suss out the feasibility of building a classifier using neural networks that successfully predicts the political party of which the author of an official tweet belongs. In order to be deemed successful, the model must successfully predict political party with 90% accuracy on test data. Should this succeed, Trace Political will further pursue projects related to predictive modeling with natural language processing.

Question: Can a neural network successfully classify political affiliation based on language contained in official tweets from the available dataset?

Methodology:

This project is broken down in to two parts. Part One will will address the first objective which is to perform basic analysis of the text corpus contained within tweets from the dataset. Part Two will attempt to model the text data resulting in successful party classification. Part Two will finish with a conclusion based upon the attempted modeling along with recommendations for future work to enhance model performances, et cetera. The sections and sub-sections are outlined in the table of contents below.

Finally, work should be done with to develop a relevant use case for a model that accurately predicts political party by tweet content. Though we did not meed Trace Political's threshold for deeming this project a success, with more data and model/data cleaning experimentation, a powerful model may still be possible. Once one is developed, ways to employ the model should be flushed out for use within Trace Political's product offering.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
ExtractedTweets.csv		ExtractedTweets.csv
Mod_4_Final_Project.ipynb		Mod_4_Final_Project.ipynb
Presentation.pdf		Presentation.pdf
README.md		README.md
cleaned_tweet_data.json		cleaned_tweet_data.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Module Four Final Project

Background

Objective:

Methodology:

Table of Contents:

Obtain

Part One

Part Two

About

Uh oh!

Releases

Packages

Languages

jericksen/Mod_4_Final_Project

Folders and files

Latest commit

History

Repository files navigation

Module Four Final Project

Background

Objective:

Methodology:

Table of Contents:

Obtain

Part One

Part Two

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages