# AirBnB NY Locations Data Case Study

In this final project, you task will be to take the data provided and find evidence to answer the following questions.

1. How many neighborhood groups are there and which shows up the most?
2. Are private rooms the most popular in Manhattan?
3. Who are the top 10 busiest hosts based on number of reviews per month?
4. Which neighorhood group has the highest average price? Plot the average prices in a professional looking bar chart
5. What is the most expensive neighbourhood in each neighbourhood group?
6. Which host has the highest minimum rental? *minimal rental is based on price and minimum nights
7. Who currently has no (zero) availability with a review count of 100 or more?
8. If each host rents all of their properties for the night, which 5 make the most that night?
9. When did Danielle from Queens last receive a review?
10. Display a bar chart showing the number of each type of room in the Bronx

You will be given **4 hours** to complete this assignment. 
**Be Advised** I will go dark for this entire assignment time period. That said, any questions that you would like to ask about the data, or the project **MUST** be asked before the time starts. Once the time has started, I can no longer give information.

This is to simulate what you will face when you are out in the wild. 

Happy Coding!

In [1]:
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

In [2]:
air_bnb = pd.read_csv('./AB_NYC_2019.csv')
air_bnb.head()


Unnamed: 0,id,name,host_id,host_name,neighbourhood_group,neighbourhood,latitude,longitude,room_type,price,minimum_nights,number_of_reviews,last_review,reviews_per_month,calculated_host_listings_count,availability_365
0,2539,Clean & quiet apt home by the park,2787,John,Brooklyn,Kensington,40.64749,-73.97237,Private room,149,1,9,2018-10-19,0.21,6,365
1,2595,Skylit Midtown Castle,2845,Jennifer,Manhattan,Midtown,40.75362,-73.98377,Entire home/apt,225,1,45,2019-05-21,0.38,2,355
2,3647,THE VILLAGE OF HARLEM....NEW YORK !,4632,Elisabeth,Manhattan,Harlem,40.80902,-73.9419,Private room,150,3,0,,,1,365
3,3831,Cozy Entire Floor of Brownstone,4869,LisaRoxanne,Brooklyn,Clinton Hill,40.68514,-73.95976,Entire home/apt,89,1,270,2019-07-05,4.64,1,194
4,5022,Entire Apt: Spacious Studio/Loft by central park,7192,Laura,Manhattan,East Harlem,40.79851,-73.94399,Entire home/apt,80,10,9,2018-11-19,0.1,1,0


In [3]:
# How many neighbourhood groups / which shows up the most
air_bnb['neighbourhood_group']
air_bnb['neighbourhood_group'].describe()
# Manhattan
# 5 groups

count         48895
unique            5
top       Manhattan
freq          21661
Name: neighbourhood_group, dtype: object

In [4]:
# Are private rooms the most popular in Manhattan?
air_bnb[air_bnb['neighbourhood_group'] == 'Manhattan']['room_type'].describe()

# Entire home/apt

count               21661
unique                  3
top       Entire home/apt
freq                13199
Name: room_type, dtype: object

In [6]:
# Who are the top 10 busiest hosts based on number of reviews per month?
best_host= air_bnb.groupby(['host_id','host_name']).sum()
best_host.sort_values('reviews_per_month').tail(10)

Unnamed: 0_level_0,Unnamed: 1_level_0,id,latitude,longitude,price,minimum_nights,number_of_reviews,reviews_per_month,calculated_host_listings_count,availability_365
host_id,host_name,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1
37312959,Maya,51232186,203.85153,-369.38202,213,5,2273,53.53,25,824
156684502,Nalicia,66477294,121.98566,-221.3088,180,3,1046,54.38,9,77
344035,Brooklyn& Breakfast -Len-,155403744,528.8214,-961.61568,970,13,2205,56.0,169,3723
156948703,Asad,135792224,244.61926,-443.23651,588,6,1052,56.44,36,2038
224414117,Gabriel,921229890,1222.67553,-2219.92301,6394,30,403,59.1,900,8679
137274917,David,345517389,489.12293,-887.88925,1320,12,607,62.89,144,2827
26432133,Danielle,74542317,203.81845,-369.35591,236,5,2017,68.02,25,1443
232251881,Lakshmee,254974885,325.33942,-590.2725,403,8,482,80.63,64,1369
244361589,Row NYC,294108503,366.82627,-665.89623,1845,9,305,111.72,81,2633
219517861,Sonder (NYC),10885561678,13316.25823,-24198.18856,82795,4353,1281,397.56,106929,98588


In [9]:
# Which neighborhood group has the highest avg price and plot the average prices in a bar chart
average = air_bnb.groupby('neighbourhood_group').mean()['price'].sort_values(ascending=False)
average

neighbourhood_group
Manhattan        196.875814
Brooklyn         124.383207
Staten Island    114.812332
Queens            99.517649
Bronx             87.496792
Name: price, dtype: float64

In [17]:
# What is the most expensive neighbourhood in each neighbourhood_group
air_bnb.groupby(['neighbourhood_group', 'neighbourhood']).mean().nlargest(10, 'price')

Unnamed: 0_level_0,Unnamed: 1_level_0,id,host_id,latitude,longitude,price,minimum_nights,number_of_reviews,reviews_per_month,calculated_host_listings_count,availability_365
neighbourhood_group,neighbourhood,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1
Staten Island,Fort Wadsworth,20305160.0,1715301.0,40.59546,-74.06092,800.0,7.0,0.0,,3.0,365.0
Staten Island,Woodrow,1798271.0,9430973.0,40.53884,-74.19826,700.0,7.0,0.0,,1.0,0.0
Manhattan,Tribeca,19694560.0,52744500.0,40.717744,-74.0074,490.638418,11.378531,11.491525,0.834909,42.971751,126.508475
Brooklyn,Sea Gate,20179550.0,116866500.0,40.577119,-74.00872,487.857143,4.142857,1.428571,0.33,1.857143,65.571429
Bronx,Riverdale,18602900.0,68906780.0,40.887352,-73.912064,442.090909,5.363636,26.636364,1.32,1.545455,160.818182
Staten Island,Prince's Bay,27558070.0,160410200.0,40.5257,-74.201255,409.5,4.75,5.75,1.885,1.0,110.5
Manhattan,Battery Park City,18310940.0,67655550.0,40.709964,-74.016585,367.557143,30.328571,8.285714,0.675556,9.442857,100.328571
Manhattan,Flatiron District,17304950.0,59993910.0,40.741052,-73.988413,341.925,6.225,17.7375,1.251207,7.3875,123.05
Staten Island,Randall Manor,23293230.0,121565100.0,40.632749,-74.121504,336.0,2.578947,18.0,1.596471,2.473684,275.210526
Manhattan,NoHo,19080550.0,66588090.0,40.72701,-73.993052,295.717949,5.987179,12.74359,0.801935,8.628205,101.820513


In [29]:
# Which host has the highest minimum rental? *minimal rental is based on price and minimum nights
air_bnb['minimum_rental'] = air_bnb['minimum_nights']*air_bnb['price']

air_bnb.nlargest(10, 'minimum_rental').describe()

Unnamed: 0,id,host_id,latitude,longitude,price,minimum_nights,number_of_reviews,reviews_per_month,calculated_host_listings_count,availability_365,min_price,minimum_rental
count,10.0,10.0,10.0,10.0,10.0,10.0,10.0,3.0,10.0,10.0,10.0,10.0
mean,19915520.0,60186630.0,40.728152,-73.954646,4833.1,279.8,1.8,0.143333,1.1,232.7,663058.1,663058.1
std,12929670.0,82173760.0,0.027821,0.053964,3951.122891,281.916378,3.457681,0.10504,0.316228,166.949127,324636.7,324636.7
min,1615764.0,3906464.0,40.68807,-74.0162,400.0,30.0,0.0,0.04,1.0,0.0,300000.0,300000.0
25%,9743424.0,8885541.0,40.712142,-73.986255,1910.0,103.5,0.0,0.09,1.0,83.0,374297.5,374297.5
50%,18199950.0,31226480.0,40.717095,-73.965135,3275.0,180.0,0.0,0.14,1.0,352.5,604400.0,604400.0
75%,32883630.0,67660650.0,40.748757,-73.92527,9124.25,365.0,1.5,0.195,1.0,363.5,956863.2,956863.2
max,36056810.0,271248700.0,40.77213,-73.83746,10000.0,1000.0,10.0,0.25,2.0,365.0,1170000.0,1170000.0


In [32]:
# Who has no (zero) availability with a review count of 100 or more
no_avail = air_bnb[(air_bnb['availability_365'] == 0) & (air_bnb['number_of_reviews'] >= 100)]

no_avail[host_reviews['availability_365'] == 0][['host_name', 'number_of_reviews', 'availability_365']]

Unnamed: 0,host_name,number_of_reviews,availability_365
8,MaryEllen,118,0
94,Christiana,168,0
132,Sol,193,0
174,Coral,114,0
180,Doug,206,0
...,...,...,...
29581,Kathleen,103,0
30461,Janet,119,0
31250,Albert,102,0
32670,Stephany,131,0


In [33]:
# If each host rents all of their properties for the night, which 5 make the most that night?
air_bnb.groupby(['host_id', 'host_name']).sum().nlargest(5, 'price')

Unnamed: 0_level_0,Unnamed: 1_level_0,id,latitude,longitude,price,minimum_nights,number_of_reviews,reviews_per_month,calculated_host_listings_count,availability_365,min_price,minimum_rental
host_id,host_name,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1
219517861,Sonder (NYC),10885561678,13316.25823,-24198.18856,82795,4353,1281,397.56,106929,98588,950453,950453
107434423,Blueground,7210036953,9451.60418,-17166.13165,70331,7470,29,6.04,53824,58884,2258580,2258580
156158778,Sally,332529233,488.73929,-887.71735,37097,12,1,1.0,144,776,37097,37097
205031545,Red Awning,1415225676,1996.92821,-3624.34656,35294,750,127,21.21,2401,10796,525450,525450
30283594,Kara,1611854192,4931.41347,-8952.50779,33581,3767,65,3.94,14641,37924,1164243,1164243


In [39]:
# When did Danielle from Queens last receive a review?
air_bnb[air_bnb['host_name'] == 'Danielle'][['host_name', 'last_review']].sort_values('last_review', ascending=False)

air_bnb

Unnamed: 0,id,name,host_id,host_name,neighbourhood_group,neighbourhood,latitude,longitude,room_type,price,minimum_nights,number_of_reviews,last_review,reviews_per_month,calculated_host_listings_count,availability_365,min_price,minimum_rental
0,2539,Clean & quiet apt home by the park,2787,John,Brooklyn,Kensington,40.64749,-73.97237,Private room,149,1,9,2018-10-19,0.21,6,365,149,149
1,2595,Skylit Midtown Castle,2845,Jennifer,Manhattan,Midtown,40.75362,-73.98377,Entire home/apt,225,1,45,2019-05-21,0.38,2,355,225,225
2,3647,THE VILLAGE OF HARLEM....NEW YORK !,4632,Elisabeth,Manhattan,Harlem,40.80902,-73.94190,Private room,150,3,0,,,1,365,450,450
3,3831,Cozy Entire Floor of Brownstone,4869,LisaRoxanne,Brooklyn,Clinton Hill,40.68514,-73.95976,Entire home/apt,89,1,270,2019-07-05,4.64,1,194,89,89
4,5022,Entire Apt: Spacious Studio/Loft by central park,7192,Laura,Manhattan,East Harlem,40.79851,-73.94399,Entire home/apt,80,10,9,2018-11-19,0.10,1,0,800,800
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
48890,36484665,Charming one bedroom - newly renovated rowhouse,8232441,Sabrina,Brooklyn,Bedford-Stuyvesant,40.67853,-73.94995,Private room,70,2,0,,,2,9,140,140
48891,36485057,Affordable room in Bushwick/East Williamsburg,6570630,Marisol,Brooklyn,Bushwick,40.70184,-73.93317,Private room,40,4,0,,,2,36,160,160
48892,36485431,Sunny Studio at Historical Neighborhood,23492952,Ilgar & Aysel,Manhattan,Harlem,40.81475,-73.94867,Entire home/apt,115,10,0,,,1,27,1150,1150
48893,36485609,43rd St. Time Square-cozy single bed,30985759,Taz,Manhattan,Hell's Kitchen,40.75751,-73.99112,Shared room,55,1,0,,,6,2,55,55


In [None]:
# bar chart showing the number of room types in the Bronx


# Final Conclusion

In this cell, write your final conclusion for each of the questions asked.

Also, if you uncovered some more details that were not asked above, please discribe them here.

-- Add your conclusion --

In [None]:
# Not bad, definatly need to and want to play with this more later on.