![Final Project Head (1).png](attachment:681c261a-b8d4-4d1e-8e0f-82bf46351aa6.png)

### 12/07/2024 Ash Copeland
# Overview
This article covers the effect on Education and political-view/voting behavior. By examining data on levels of Educational Attainment and comparing them to election results, we can gain insights on just how level of education relates to political affiliation and election decisions by state. Our driving question here is not only "how does education affect political views and voting behavior", but also "Does a higher level of education lead to more democratic votes rather than republican?" This is the hypothesis I hope to resolve.

# Data Sources
First off, in order to examine the comparison stated above we require the data!
The data for this project was gathered from the United States Census Bureau's 2023 report on educational attainment. This report lists the levels of attained education as percentages by state. This data was downloaded in a .csv in order to make the importation process easy! This report can be viewed on the US Census Bureau website Here: https://data.census.gov/table/ACSST1Y2023.S1501?q=S1501&g=010XX00US$0400000 

# Data Tidying
Our first step is to tidy the data! It is far easier to work with information that is organized! We complete this process by importing the packages we need! Pandas for creating a visual dataframe from our raw data, and json for working with the core operations we require.

The next step is to isolate the data columns we actually need from the large array of US Census Data. So, first the two necessary columns that included the State and Education Percentage were isolated. It is important to note, the percentage chosen to show level of education was the percentage of individuals who have a bachelor's degree or above, as bachelors degree achievement is widely considered both in business and in social circles as the point in which you have gained a true level up higher education. Once those two columns are isolated, the next step is to rename them to better represent the data within them. Then, we display that data in the final dataframe! You can see this process unfold below:


In [5]:
import json
import pandas as pd

In [6]:
#Import the Data
ex = pd.read_csv('ACSST1Y2023.S1501-Data.csv')
#Isolate the Columns needed
exA = ex[['NAME', 'S1501_C02_015E']]
#Drop the row that only has labels in it
exB = exA.drop(0)
#Rename the columns to better fit
exC = exB.rename(columns={'NAME': 'State', 'S1501_C02_015E': 'Edu Percentage'})
#Display the results
exC

Unnamed: 0,State,Edu Percentage
1,Alabama,28.9
2,Alaska,32.2
3,Arizona,33.5
4,Arkansas,26.2
5,California,37.5
6,Colorado,46.4
7,Connecticut,42.9
8,Delaware,36.5
9,District of Columbia,65.9
10,Florida,34.9


# MapChart.net
MapChart.net (https://www.mapchart.net/usa.html) is a fantastic tool that can help us import that data in order to create figures to compare! Although it is certainly possible to write the code necessary to create a custom map based on a shape file using a special version of pandas known as geopandas, this is largely unnecessary, no reason to reinvent the wheel! MapChart even has a pre-made United States map awaiting data to input! It is important to note that although we do have Puerto Rico's data, we are focusing specifically on the United States official States rather than territories, so Puerto Rico can be excluded. Using MapCharts, we can first create a blank map that is ready for any array of data. Next we simply specify how we would like our data colorized! 

Our maximum percentage of education for a state was 48, and our minimum was 24. Luckily, we then have 36 right in the middle! This means our halfway marks between the center and minimum are 30 and the center and maximum half point would be 42. With this information, we can easily set up our map to colorize based on those parameters! 

For any state whose education is between **24-30**, they will be colorized with **light red/coral** to show a low education.<br>
For any state whose education is between **30-36**, they will be colorized with **standard red** to show a lesser education<br>
For any state whose education is between **36-42**, they will be colorized with **light blue** to show a good education<br>
For any state whose education is between **42-48**, they will be colorized with **standard blue** to show a high level of education.<br>

These color scales create an easy to view figure that still has the necessary colorization to compare with our election results!

With the parameters set it's just a matter of utilizing the colorization tools provided on MapChart.net to generate a figure!

# Election Results
With our new figure generated, we need something to compare it to! Why not use the most recent data we have! The election results map was downloaded from gzero media (https://www.gzeromedia.com/us-election/us-election-2024-map-the-vote) as they provided the best figure for comparison! The data came from the election results posted by the United States Elections Department.

With a little bit of artistic outlines and titles added for easy reading, we can display those figured below and take a look!

## Figure 1.0 | 2024 Presidential Election Results
*Note: This figure comes from November 06, 2024 and some state electoral decisions may have changed since*

![USA Map Labelling Worksheet (3).png](attachment:498ccdd7-8087-4a32-a544-d390cb038441.png)

## Figure 1.1 | Us Census Education Levels (Created with MapChart.net)

![USA Map Labelling Worksheet (2).png](attachment:10d2b7d8-19f3-4f95-9328-785c89149c42.png)

# Analysis
When looking at the two figures side by side we get a pretty clear comparison! States that have a good or high education often align with Democrats while states with less or lower education often align with Republicans. Now there are some outliers in our analysis here. VA (Which has become a blue state since November 6th, 2024), North Carolina, and New Mexico. It is important to keep in mind that although education may play a factor, the subject matter of bills or stances by each candidate may play a factor as well. But even without those outliers we have a pretty strong correlation here! Of course, this would just be the beginning. And there are some flaws in our process here to discuss and make note of.

# Flaws and Next Steps
This process is a very scaled down investigation compared to what can and should be done in a final report. Due to time constraints and the nature of this project, only a single election was compared. If continued, it would be best to utilize the US Census Bureau's entire library of Educational Attainment Data going back to 2010. This would allow us to see and possibly even graph a trend that we could evaluate to see the correlation.

Although outside tools can sometimes make the process easier, there is a certain disconnect that they create. If done again, I would recommend using geopandas alongside matplotlib.pyplot in order to create a figure locally for higher credibility. I would also pull the raw electoral data and create a figure for that as well, rather than utilizing an outside source that is slightly outdated due to the ever changing nature of a recent election. 

# Conclusion

I believe this process shows a fantastic first step in recognizing a very possible correlation between education and political stance, as well as providing some support for the hypothesis that higher education leads to a more democratic/liberal mindset in elections rather than republican/conservative. Although we would certainly need some more data to confirm, this is an excellent start!

# References

MapChart - https://www.mapchart.net/usa.html <br>
US Census Data - https://data.census.gov/table/ACSST1Y2023.S1501?q=S1501&g=010XX00US$0400000 <br>
Election Results Figure - https://www.gzeromedia.com/us-election/us-election-2024-map-the-vote <br>
