Project 5: Natural Language Processing

In this project, I will use natural language processing techniques to explore a dataset containing tweets from members of the 116th United States Congress that met from January 3, 2019 to January 2, 2021. The dataset has already been cleaned to contain information about each legislator. Concretely, I will do the following:

Preprocess the text of legislators’ tweets
Conduct Exploratory Data Analysis of the text
Use sentiment analysis to explore differences between legislators’ tweets
Featurize text with manual feature engineering, frequency-based, and vectorbased techniques
Predict legislators’ political parties and whether they are a Senator or Representative
Explore whether asymmetric polarization shows up in how politicians communicate to their constituents through tweets
Explore whether Senators' tweets support the theory that the Senate is more moderate

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
Computational Social Science Project 5.pdf		Computational Social Science Project 5.pdf
README.md		README.md
project5.ipynb		project5.ipynb
project5_KQ.ipynb		project5_KQ.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

Computational Social Science Project 5.pdf

Computational Social Science Project 5.pdf

README.md

README.md

project5.ipynb

project5.ipynb

project5_KQ.ipynb

project5_KQ.ipynb

Repository files navigation

Project 5: Natural Language Processing

About

Releases

Packages

Contributors 2

Languages

anniehelms/annie_cssproject5

Folders and files

Latest commit

History

Repository files navigation

Project 5: Natural Language Processing

About

Resources

Stars

Watchers

Forks

Languages