# Project: Determining the Best Neighborhood in Pittsburgh
In this project, we explore data-driven metrics to determine the best neighborhood in Pittsburgh.
We focus on metrics related to education levels, safety, and quality of life.

## Metrics
### Education Levels

In [ ]:
# Load education dataset
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

# Load education dataset
education_data = pd.read_csv('PATH')

# Analyze education levels by neighborhood
education_summary = education_data.groupby('Neighborhood')['Education Level'].mean().sort_values(ascending=False).head(10)

# Plotting education levels
plt.figure(figsize=(10, 6))
sns.barplot(x=education_summary.values, y=education_summary.index, palette='viridis')
plt.title('Top 10 Neighborhoods by Education Level')
plt.xlabel('Education Level (Mean)')
plt.ylabel('Neighborhood')
plt.show()

### Safety (Crime Rates)

In [ ]:
# Load crime dataset
crime_data = pd.read_csv('PATH')

# Analyze crime rates by neighborhood
crime_summary = crime_data.groupby('Neighborhood')['Crime Count'].sum().sort_values().head(10)

# Plotting crime rates
plt.figure(figsize=(10, 6))
sns.barplot(x=crime_summary.values, y=crime_summary.index, palette='magma')
plt.title('Top 10 Neighborhoods by Crime Rate')
plt.xlabel('Crime Count')
plt.ylabel('Neighborhood')
plt.show()

### Quality of Life (Parks)

In [ ]:
# Load parks dataset
parks_data = pd.read_csv('PATH')

# Analyze park count by neighborhood
parks_summary = parks_data.groupby('Neighborhood')['Park Count'].sum().sort_values(ascending=False).head(10)

# Plotting park counts
plt.figure(figsize=(10, 6))
sns.barplot(x=parks_summary.values, y=parks_summary.index, palette='plasma')
plt.title('Top 10 Neighborhoods by Park Count')
plt.xlabel('Number of Parks')
plt.ylabel('Neighborhood')
plt.show()

## Best Neighborhood
Combine metrics into a single metric (if applicable) and determine the best neighborhood based on your analysis.

In [ ]:
# Example: Combine metrics (weighted average)
combined_metric = (education_summary + (1 / crime_summary) + parks_summary) / 3
best_neighborhood = combined_metric.idxmax()

print(f'The best neighborhood in Pittsburgh based on our analysis is: {best_neighborhood}')

## Conclusion
Reflect on the data-driven determination of the best neighborhood compared to personal preferences.

In [ ]:
print('### Conclusion ###')
print('Our data-driven analysis suggests that the best neighborhood in Pittsburgh is determined by factors such as education levels, safety, and quality of life.')
print('These metrics provide a more objective view compared to personal preferences, which may vary.')