<h1> 1. Introduction/Business Problem </h1>


<p>
    
<h3>1.1 Entrepreneurial Ecosystem:</h3>    
An entrepreneurial ecosystem consists of stakeholders such as non profits, government, academia, private practitioners, financial institutions, among other support organizations. These stakeholders provide support in different areas such as education, access to capital, prototyping and even business plan. In order to maximize the resources of these support organizations they should know the business stage, size, and industry of the client they are helping. Specially, they should consider the location of the business since that data provides an additional layer of context. 


<h3>1.2 Location:</h3>
In Puerto Rico, a territory of 3.1 million, there are more than 250 entrepreneurial support organizations. Sometimes, however, these organizations do not have the data in an accessible and digestible form that allows them to make data-driven decisions to improve their services.

<h3>1.3 Use Cases:</h3>
In this project we will be exploring geographic data with the idea of providing input to organizations that could help them to:
<ul>
    <li>provide a better service in a personalized way</li>
    <li>be able to design projects and develop grants based on the data</li>
    <li>hire specialized personnel to serve the geographic area in which they are located</li>
    <li>and develop public policy projects that help businesses in their area.</li>
</ul>
    
</p>



<h1> 2. Data </h1>

<h3>2.1 Data Sources: </h3>
<p> For this project we will be using Foursquare's Places API. The membership tier is the Sandbox. It includes: 
<ul>
    <li>950 Regular Calls/Day</li>
    <li>50 Premium Calls/Day</li>
    <li>1 Photo per Venue</li>
    <li>1 Tip per Venue.</li>
</ul>

With this API it is possible to search, explore, get trends and recommendations regarding venues. In the same way it is possible to get data about users such as details, check-ins, venue history, among other information. 


<h3>2.2 Feature Selection:</h3>

In this project the data will be limited to venues in Puerto Rico. The data that will be explored is: stats over a given time, categories, and likes. With this information entrepreneurial support organizations will be able to identify business clusters in different locations across Puerto Rico and areas of opportunity for new business owners or those that are looking to expand established businesses.

<h3>2.3 Data cleaning:</h3>

</p>


<h1> 3. Methodology </h1>

In [1]:
!pip install beautifulsoup4



In [2]:
import pandas as pd

contents = pd.read_csv("PRZipCodes.csv", dtype={'tablescraper-selected-row': object})

zips_df = pd.DataFrame(contents)


zips_df = zips_df.drop(columns =['tablescraper-selected-row href','tablescraper-selected-row 2','tablescraper-selected-row href 2', 'tablescraper-selected-row href 3'])

zips_df.rename(columns = {'tablescraper-selected-row':'ZIP','tablescraper-selected-row 3':'Borough','tablescraper-selected-row 4':'County'}, inplace = True)
zips_df['ZIP'] = zips_df['ZIP'].astype(str)

zips_df.head()


Unnamed: 0,ZIP,Borough,County
0,601,Adjuntas,Adjuntas
1,602,Aguada,Aguada
2,603,Aguadilla,Aguadilla
3,604,Aguadilla,Aguadilla
4,605,Aguadilla,Aguadilla


In [3]:
LatLong_df = pd.read_csv("codes_latlong.csv", dtype={'tablescraper-selected-row': object})

LatLong_df['ZIP'] = LatLong_df['ZIP'].apply(lambda x: '{0:0>5}'.format(x))
LatLong_df['ZIP'] = LatLong_df['ZIP'].astype(str)

LatLong_df.head()

Unnamed: 0,ZIP,LAT,LNG
0,601,18.180555,-66.749961
1,602,18.361945,-67.175597
2,603,18.455183,-67.119887
3,606,18.158345,-66.932911
4,610,18.295366,-67.125135


In [4]:
zips_df.drop_duplicates()

LatLong_df.drop_duplicates()

Unnamed: 0,ZIP,LAT,LNG
0,00601,18.180555,-66.749961
1,00602,18.361945,-67.175597
2,00603,18.455183,-67.119887
3,00606,18.158345,-66.932911
4,00610,18.295366,-67.125135
...,...,...,...
33139,99923,56.002315,-130.041026
33140,99925,55.550204,-132.945933
33141,99926,55.138352,-131.470424
33142,99927,56.239062,-133.457924


In [5]:
join_df = pd.merge(zips_df, LatLong_df, on='ZIP', how='inner')

join_df.head()

Unnamed: 0,ZIP,Borough,County,LAT,LNG
0,601,Adjuntas,Adjuntas,18.180555,-66.749961
1,602,Aguada,Aguada,18.361945,-67.175597
2,603,Aguadilla,Aguadilla,18.455183,-67.119887
3,606,Maricao,Maricao,18.158345,-66.932911
4,610,Anasco,Anasco,18.295366,-67.125135


In [6]:
!pip install folium



In [11]:
import folium 
# create map of Toronto using latitude and longitude values
map_pr = folium.Map(location=[18.158345, -66.932911], zoom_start=10)

# add markers to map
for lat, lng, borough, county in zip(join_df['LAT'], join_df['LNG'], join_df['Borough'], join_df['County']):
    label = '{}, {}'.format(county, borough)
    label = folium.Popup(label, parse_html=True)
    folium.CircleMarker(
        [lat, lng],
        radius=5,
        popup=label,
        color='blue',
        fill=True,
        fill_color='#3186cc',
        fill_opacity=0.7,
        parse_html=False).add_to(map_pr)  
    
map_pr

<h1> 4. Results </h1>

<h1> 5. Discussion </h1>

<h1> 6. Conclusion</h1>