# Final Project - Choosing the Best Pittsburgh Neighborhood
## Amanda Cotumaccio, Sreeha Reddy, Kendal Kalanish

### Introduction

We decided to declare the best Pittsburgh neighborhood as the one that can be most easily traversed without a car. We feel that this metric matters for the sake of the environment and for happier cities. The EPA estimates that vehicles cause almost 75% of the carbon monoxide pollution in the US. This pollution mostly takes place in suburban, completely car dependent areas, but cities like Pittsburgh are still prone to car pollution. Easier and more accessible travel by bikes and walking is important to combat climate change. We decided not to consider public transportation in this metric and instead consider a more narrow definition of car-free travel. Public transportation in Pittsburgh is greatly available in many neighborhoods but not at all in others, leading to a lack of a pattern. 

Car-free travel in Pittsburgh is measured with three factors:
- Walkability is a term that is mainly used to describe the distance from where you live to other amenities, and whether it is within reasonable walking distance.

- Bikeability, same as Walkability, is determined by the capability of an area to be traversed by bike 

- Sidewalk to Street Ratio
Using information about different locations and blocks, we can study sidewalk ratings vs. street ratings. The quality of the sidewalks and the rating of the street condition for each block in the city can help us determine the best neighborhood in Pittsburgh. 
With these three factors, we were able to measure and find the best neighborhoods for our metrics.

### Walkability- Sreeha

*Introduction*
First, I imported pandas and ran through the walkability dataset file I had. Then, I calculated the mean of all the walk scores in the data set, ending up in a score of 42.33
*The Metric*

In [None]:
import pandas as pd
walk = pd.read_csv("walkscorect.xls-walk-score-by-ct.csv")
walk["Walk Score"].mean()

In [None]:
import pandas as pd
walk = pd.read_csv("walkscorect.xls-walk-score-by-ct.csv")
walk.plot(kind = "scatter", x = "Census Tract", y = "Walk Score", title = "Census Tract vs Walk Score")

In [None]:
import pandas as pd
walk = pd.read_csv("walkscorect.xls-walk-score-by-ct.csv")
walk.sample(20)

*Conclusion*
Just read through the excel sheet and graphed all the data points. It looks strangely uniform, but has three very clear columns of separation. I also created a map of the walk score vs neighborhood, using an edited/merged version of my census tract data and the SNAP neighborhood data that had the same census tracts

In [None]:
import pandas as pd
import numpy as np
import geopandas as gpd
%matplotlib inline
import matplotlib.pyplot as plt

data = pd.read_csv('Neighborhoods_with_SNAP_Data.csv')
df = pd.DataFrame(data)

neighborhoods = gpd.read_file("Neighborhood_SNAP.shp") # read in the shapefile

fig, ax=plt.subplots(figsize=(50,50),edgecolor="white")
neighborhoods.plot(ax=ax)
ax.scatter(df.Neighborhood_2010_INTPTLON10, df.Neighborhood_2010_INTPTLAT10, c=df.Walk_Score, s=900, cmap="Oranges")
plt.show()

### Bikeability - Kendal
I utilized matplotlib to create 2 maps from census information and POGOH data. The first shows the neighborhoods in Pittsburgh with POGOH stations

In [None]:
import pandas as pd
import numpy as np
import geopandas as gpd
%matplotlib inline
import matplotlib.pyplot as plt

datas = pd.read_csv('pogoh-station-locations-february-2023.csv')
df = pd.DataFrame(datas)

neighborhoods = gpd.read_file("Neighborhood_SNAP.shp") # read in the shapefile

fig, ax=plt.subplots(figsize=(20,20))
neighborhoods.plot(ax=ax, color="plum")
ax.scatter(df.Longitude, df.Latitude, s=df.Total_Docks, c = 'black')
plt.show()

This map shows the neighborhoods around Bloomfield and Oakland as having the most POGOH bike stations. 

I then created a map that plotted the color as the % of commuters that bike to work against the size as the total population in the area

In [None]:

data = pd.read_csv('Neighborhoods_with_SNAP_Data.csv')
df = pd.DataFrame(data)

neighborhoods = gpd.read_file("Neighborhood_SNAP.shp") # read in the shapefile

fig, ax=plt.subplots(figsize=(50,50),edgecolor="white",)
neighborhoods.plot(ax=ax)
ax.scatter(df.Neighborhood_2010_INTPTLON10, df.Neighborhood_2010_INTPTLAT10, c=df.SNAP_All_csv_Commute_to_Work__B, s=df.Pop__2010)
plt.show()


This map has 2 bright spots for commuting to work by bike. They are the neighborhoods of Lower Lawrenceville and Bloomfield. I chose to make maps because I feel they are succinct and to the point when representing data like this. One can easily recognize overall patterns based on the area, with the most bike stations and most bike commuters lining up in the same area of the city.

### Sidewalk score vs street score
This data plots the length and quality of the sidewalk against that of the street. Where the walk score considers the ability to access amenities from any one spot, this data considers the safety and quality of the walk, another important metric to consider. 

In [None]:
import pandas as pd
import matplotlib.pyplot as plt

# Read the CSV file into a pandas DataFrame
df = pd.read_csv('https://data.wprdc.org/dataset/853a077d-0a31-4292-8a1d-5d60b530169b/resource/b90ccee1-c0aa-43b9-93e2-8a25e690c393/download/sidewalkstreetratioupload.csv')
# Create a scatterplot
df.plot(kind='scatter', x='SidewalkStepLFt', y='StreetLFt', alpha=0.5)

# Set the title and axis labels
plt.title('Sidewalk Score vs. Street Score')
plt.xlabel('Sidewalk Score')
plt.ylabel('Street Score')

# Show the plot
plt.show()

#sidewalk condition to street condition

### Conclusion
The few areas that kept popping up throughout each map that we decided fit our metrics the best. They were Bloomfield, which had a high walk score as well as many POGOH stations and Lower Lawrenceville which actually had the highest number of biking commuters. Both of these destinations scored high across the datasets. After looking at a seperate topographic map of Pittsburgh, we choose Bloomfield as the best neighborhood for its flatness, feeling that this would add to ease of carless travel. 
See our full presentation here
https://docs.google.com/presentation/d/1QO1ELsxd61iKQIlwHeezNmlGaYDwBF0Nb-ZA1_sHeEo/edit#slide=id.g20a0b52ccc5_1_1