# Análise Geográfica da distribuição de Igrejas Católicas em Seattle

Este projeto tem como propósito efetivar os conhecimentos das bibliotecas GeoPandas e Folium do Python por meio da análise da distribução de igrejas católicas na área de Seattle (EUA). Além disso, também será efetuada uma breve limpeza e manipulação dos dados.

O arquivo *ADS-Churches.csv* foi obtido em https://www.kaggle.com/datasets/chesspiece99/archdiocese-of-seattle-churches/code.

In [40]:
import pandas as pd

df = pd.read_csv('ADS-Churches.csv')
df

Unnamed: 0,X,Y,NAME,ADDRESS,CITY,NEIGHBORHOOD,LABEL
0,47.676973,-122.294037,Assumption Church,6201 33rd Ave NE,Seattle,Ravenna,Assumption Church
1,47.666715,-122.319072,Blessed Sacrament,5050 8th Ave NE,Seattle,U-District,Blessed Sacrament
2,47.616895,-122.340406,Christ Our Hope,1902 2nd Ave NE,Seattle,Pike Place,Christ Our Hope
3,47.7171,-122.354404,Christ the King,405 N 117th St,Seattle,Broadview,Christ the King
4,47.540873,-122.376435,Holy Family,9622 20th Ave SW,Seattle,White Center,Holy Family
5,47.568445,-122.385018,Holy Rosary,4139 42nd Ave SW,Seattle,Alki,Holy Rosary
6,47.611976,-122.308628,Immaculate Conception,820 18th Ave NE,Seattle,Central District,Immaculate Conception
7,47.649343,-122.400488,Our Lady of Fatima,3307 W Dravus St,Seattle,Interbay,Our Lady of Fatima
8,47.513981,-122.319443,Our Lady of Lourdes,10243 12th Ave S,Seattle,South Park,Our Lady of ourdes
9,47.697081,-122.290604,Our Lady of the Lake,8900 35th Ave NE,Seattle,Maple Leaf,Our Lady of the Lake


In [41]:
# Apagando a coluna com o nome da cidade pois, como todos os dados, dela são iguais, não há necessidade de sua presença.
df.drop('CITY', axis=1, inplace=True)

In [42]:
df

Unnamed: 0,X,Y,NAME,ADDRESS,NEIGHBORHOOD,LABEL
0,47.676973,-122.294037,Assumption Church,6201 33rd Ave NE,Ravenna,Assumption Church
1,47.666715,-122.319072,Blessed Sacrament,5050 8th Ave NE,U-District,Blessed Sacrament
2,47.616895,-122.340406,Christ Our Hope,1902 2nd Ave NE,Pike Place,Christ Our Hope
3,47.7171,-122.354404,Christ the King,405 N 117th St,Broadview,Christ the King
4,47.540873,-122.376435,Holy Family,9622 20th Ave SW,White Center,Holy Family
5,47.568445,-122.385018,Holy Rosary,4139 42nd Ave SW,Alki,Holy Rosary
6,47.611976,-122.308628,Immaculate Conception,820 18th Ave NE,Central District,Immaculate Conception
7,47.649343,-122.400488,Our Lady of Fatima,3307 W Dravus St,Interbay,Our Lady of Fatima
8,47.513981,-122.319443,Our Lady of Lourdes,10243 12th Ave S,South Park,Our Lady of ourdes
9,47.697081,-122.290604,Our Lady of the Lake,8900 35th Ave NE,Maple Leaf,Our Lady of the Lake


In [43]:
df['NAME'] == df['LABEL']

0      True
1      True
2      True
3      True
4      True
5      True
6      True
7      True
8     False
9      True
10     True
11     True
12     True
13     True
14     True
15     True
16     True
17     True
18     True
19     True
20     True
21     True
22     True
23     True
24    False
dtype: bool

In [44]:
# podemos observar que não há diferença prática entre as duas colunas. Logo, a coluna Label também pode ser deletada
df.drop('LABEL', axis=1, inplace=True)

In [45]:
df

Unnamed: 0,X,Y,NAME,ADDRESS,NEIGHBORHOOD
0,47.676973,-122.294037,Assumption Church,6201 33rd Ave NE,Ravenna
1,47.666715,-122.319072,Blessed Sacrament,5050 8th Ave NE,U-District
2,47.616895,-122.340406,Christ Our Hope,1902 2nd Ave NE,Pike Place
3,47.7171,-122.354404,Christ the King,405 N 117th St,Broadview
4,47.540873,-122.376435,Holy Family,9622 20th Ave SW,White Center
5,47.568445,-122.385018,Holy Rosary,4139 42nd Ave SW,Alki
6,47.611976,-122.308628,Immaculate Conception,820 18th Ave NE,Central District
7,47.649343,-122.400488,Our Lady of Fatima,3307 W Dravus St,Interbay
8,47.513981,-122.319443,Our Lady of Lourdes,10243 12th Ave S,South Park
9,47.697081,-122.290604,Our Lady of the Lake,8900 35th Ave NE,Maple Leaf


In [46]:
df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 25 entries, 0 to 24
Data columns (total 5 columns):
 #   Column        Non-Null Count  Dtype  
---  ------        --------------  -----  
 0   X             25 non-null     float64
 1   Y             25 non-null     float64
 2   NAME          25 non-null     object 
 3   ADDRESS       25 non-null     object 
 4   NEIGHBORHOOD  25 non-null     object 
dtypes: float64(2), object(3)
memory usage: 1.1+ KB


In [47]:
df.duplicated().sum()
# logo, também não há dados duplicados. Assim, podemos partir para a análise geográfica

0

##Análise Geográfica


In [48]:
!pip install geopandas



In [49]:
!pip install folium



In [50]:
import geopandas as gpd
import folium

seattle = folium.Map(
    location=[47.607831919325065, -122.32422097977813],
    zoom_start=11,
    min_zoom=12,
    max_zoom=21

)
seattle

In [53]:
# adicionando as igrejas ao mapa
for _, igreja in df.iterrows():
    folium.Marker(
        location=[igreja['X'], igreja['Y']],
        popup=igreja['NAME']
    ).add_to(seattle)

In [52]:
seattle

In [55]:
# salvando o mapa em um arquivo html
seattle.save('seattle_catholic_churches.html')