![NYC Skyline](nyc.jpg)

Welcome to New York City, one of the most-visited cities in the world. There are many Airbnb listings in New York City to meet the high demand for temporary lodging for travelers, which can be anywhere between a few nights to many months. In this project, we will take a closer look at the New York Airbnb market by combining data from multiple file types like `.csv`, `.tsv`, and `.xlsx`.

Recall that **CSV**, **TSV**, and **Excel** files are three common formats for storing data. 
Three files containing data on 2019 Airbnb listings are available to you:

**data/airbnb_price.csv**
This is a CSV file containing data on Airbnb listing prices and locations.
- **`listing_id`**: unique identifier of listing
- **`price`**: nightly listing price in USD
- **`nbhood_full`**: name of borough and neighborhood where listing is located

**data/airbnb_room_type.xlsx**
This is an Excel file containing data on Airbnb listing descriptions and room types.
- **`listing_id`**: unique identifier of listing
- **`description`**: listing description
- **`room_type`**: Airbnb has three types of rooms: shared rooms, private rooms, and entire homes/apartments

**data/airbnb_last_review.tsv**
This is a TSV file containing data on Airbnb host names and review dates.
- **`listing_id`**: unique identifier of listing
- **`host_name`**: name of listing host
- **`last_review`**: date when the listing was last reviewed

In [3]:
# We've loaded your first package for you! You can add as many cells as you need.
import numpy as np
import pandas as pd

#Import 3 files in the data folder
price = pd.read_csv("data/airbnb_price.csv")
room_type = pd.read_excel('data/airbnb_room_type.xlsx')
last_review = pd.read_csv('data/airbnb_last_review.tsv',sep='\t')

# The dates of the earliest and most recent reviews:
last_review['last_review'] = pd.to_datetime(last_review['last_review'],infer_datetime_format=True).dt.date
last_review['last_review'].head()

first_reviewed = last_review['last_review'].min()
last_reviewed = last_review['last_review'].max()
print(f'The date of the earliest review is {first_reviewed} and the most recent is {last_reviewed}')

# Number of the private room listings:
room_type['room_type'] = room_type['room_type'].str.title()
nb_private_rooms = room_type[room_type['room_type'] == 'Private Room']['room_type'].count()
print (f'There are {nb_private_rooms} listings of private rooms')

# Average listing price:
price['price'] = price['price'].str.strip(' dollars').astype(int) # Remove dollars in the values and change to integer data type
avg_price = round(price['price'].mean(),2)
print(f'The average listing price is {avg_price}')

df = {'first_reviewed':[first_reviewed], 'last_reviewed':[last_reviewed], 'nb_private_rooms':[nb_private_rooms], 'avg_price':[avg_price]}
review_dates = pd.DataFrame(df)
print(review_dates)




ImportError: Missing optional dependency 'openpyxl'.  Use pip or conda to install openpyxl.