## 1. Introduction

Copenhagen is the capital as well as the most populous city of Denmark. As of januar 1st 2020 the city had a population of 794,128 spanning 179,8 km<sup>2</sup>, giving a population density of 4,417/km<sup>2</sup>. Copenhagen is a sprawling metropolis with a large number of venues for entertainment / enjoyment.

Copenhagen can be divided into a number of boroughs. From an investment point of view, if you would like to open a shop / venue, you are of course interested in the price of real estate, as well as the saturation of venues in the area. An investor might look for a cheap location that is not overly saturated by similar venues. From a residential point of view you are again looking for a cheap location, but a large number of venues is now positive. From this it can be said that the target audience includes both potential real estate investors, as well as private people looking for a residential property. 

Based on this, it becomes interesting to examine where you get the most bang for your buck - i.e. where an investor can get the cheapest real estate in an area not already saturated by other venues, and where a potential resident can minimize the cost of real estate while maximizing the number of nearby venues. 

Considering the above problems, it is possible to create a map with information on venue density and real estate price. From this it will be possible to extrapolate where you get the lowest real estate prices combined with the lowest / highest density of venues.

## 2. Data

To solve the above problem, two data sets are needed, firstly information on real estate prices for each borough / district is needed, secondly information on venues is needed.

I have below listed how the needed data has been obtained:

<ul>
    <li><b>Real Estate Prices</b> - Finans Danmark releases quarterly information on the average price per square meter for each borough in copenhagen. This dataset is publicly  available on their webpage, ref.: <a href="https://rkr.statistikbank.dk/statbank5a/SelectVarVal/Define.asp?MainTable=BM011">www.rkr.statisbank.dk</a>. I have limited my selection to "Ejerlejlighed" (apartments) and Q3 2020 data. However, in order to later use this data in combination with a map, latitude and longitude is needed for each of the boroughs, this was obtained from Google and merged with the dataset from Finans Danmark. </li>
    <li><b>Venue data set</b> - the Foursquare API will be used to obtain the most popular / most common venues for a given borough in Copenhagen</li>
  
</ul>

Through a combination of the data set of real estate prices (containing features on average pricing per square meter for each borough as well as latitude and longitude information for the boroughs), and the foursquare API, i will analyze the venues accross the boroughs of copenhagen, and finally visualize the combination of pricing and venues, in order to guide both investors looking for retail property and private people looking for a residential property. 


## 3. Methodology

As per above, a dataset containing real estate prices as well as latitude and longitude was read into a Pandas Dataframe, giving the following table:

![table 1.PNG](attachment:3e74a7b2-ee4e-4351-831b-d8a9136150bc.PNG)

Based on the above dataframe containing the latitude and longitude for the boroughs of Copenhagen, it is possible to visualize these using the Folium Library in Python. Superimposed on top of a map of Copenhagen gives the following result:

![Figure 1.PNG](attachment:fefbfc59-f058-4cb7-bbe4-3dd100f57055.PNG)

However, the purpose of this project was to perform further analysis on the boroughs in order to inform a potential investor or future resident about the real estate prices, as well as the saturation of nearby venues. In order to perform further analysis, we can use the Foursquare API as required in the Capstone project. Using the Foursquare API, a dataframe was created containing information on venues and which borough they were located in, resulting in the following table:


![Table 2.PNG](attachment:9003b230-731b-4eec-8f7b-762a6f4c7b7a.PNG)

The above table was created with a limit of 100 venues for every borough. To perform an initial analysis, the above table was grouped by borough to identify the number of venues retrieved using the Foursquare API for each borough. This resulted in the following table:

![Table 3.PNG](attachment:98fc51f1-d10f-458d-9773-973a3bcada33.PNG)

It is clear that Kbh.K., Kbh.V., Frederiksberg C, København Ø and København N have all reached our limit of 100 venues. A simple analysis could therefore conclude that these are the most 'popular' neighbourhoods. To further examine this, it could be interesting to perform an analysis of the most common venues in each borough. This can be an important metric for both an investor and a potential future resident, as an investor would like to know the saturation of the venue they plan to open, while a potential future resident could have personal preferences towards certain categories of venues. A table showing the 5 most common venues in each borough was thus created:

![Table 4.PNG](attachment:e91d2978-a422-42f2-aad0-1e2ec45766a7.PNG)

As the purpose of this project was to provide clear and concise information to potential investor or future residents, a visualization of the above result was to be done. In order to create a map showing information on  DKK pr. m <sup>2</sup> as well as most popular venues we need to create a dataframe containing a column with a combined string with information on the most common venue categories, giving the following table:

![Table 5.PNG](attachment:26d859da-f94f-4745-8008-3cc860ce6fd9.PNG)

## 4. Results

With knowledge of the most popular venues in each borough, we can now combine the above table into a final results table. Our final results table thus contains information about the price per square meter for each borough as well as the information obtained about the most common category of venue in each of the boroughs:  

![Table 6.PNG](attachment:823a725b-8e2e-4b06-ba00-31c1a7c944d0.PNG)

Based on the above final results table, we can now visualize this on a map of copenhagen showing the price per square meter for each borough, as well as the 5 most common venues in a given venue: 

![Figure 2.PNG](attachment:68a9a222-36bd-43d3-b4ea-91ae1a721946.PNG)

From the above map we can, as an example, see that the average price pr. square meter for an apartment in 2200 København N amounts to 45.214, with the most common venues in the borough being:

<ol>
  <li>Café (with 9 venues)</li>
  <li>Coffee Shop (also 9 venues)</li>
  <li>Beer Bar (with 7 venues)</li>
  <li>Wine Bar (with 6 venues)</li>
  <li>Pizza Place (with 5 venues)</li>
</ol>

## 5. Discussion

Kbh.K., Kbh.V., Frederiksberg C, København Ø and København N are the most ‘popular’ boroughs with the most venues, with all of them reaching the limit of 100 venues. This is in line with what one would expect, as these are the most downtown areas of Copenhagen. If all a private resident values is the number of venues, it can then be extrapolated that you get the most bang for the buck in 2200 København N, as this has the lowest price per square meter in the above listed boroughs.

On the other hand, if price is most important, it is clear that 2700 Brønshøj is the cheapest area to live, however the number venues has also decreased drastically, even more so proportionally than the price, why it can be observed that there is not a linear relationship between price and number of venues. 

While no specific recommendation can be made based on the tables and maps for a future resident, it enables a potential resident to make an informed decision about where to live. 

From an investment perspective, It can, for example, be observed that the downtown areas of Copenhagen has significantly more bars than the outskirts of Copenhagen, where grocery store are more common. As a specific example the most common venue in 1800-1999 Frederiksberg are cocktail bars (a total 6 in the borough), where for 2720 Vanløse the most common venue is pizza places (6 venues in the borough). 

It can thus be extrapolated that if an investor wants to differentiate themselves from the competition, they should perhaps aboid opening a cocktail bar, French restaurant, Italian restaurant or the like in 1800-1999 Frederiksberg. However, of course an assumption can also be made, that there is a correlation between the types of venues in a given borough and the demographic of the borough, why it is a balancing act to differentiate your venue sufficiently from others, while also being surrounded by the appropriate demographic. Similar observations can be made for every borough. 

As with potential future residents, no specific recommendation for investors can be made, however the tables and maps enables an investor to make a decision on an informed basis. 


## 6. Conclusion

It was identified that the most venues are centralized in Copenhagen, in the boroughs Kbh.K., Kbh.V., Frederiksberg C, København Ø and København N.

The established tables and maps enables both an investor and a future private resident the ability to make a more informed decision about their purchase of property than otherwise. 

