# Lesson 5: Vacation planning using CSV files

In this lesson you'll learn to read in and work with data stored in CSV format. Data of this type looks like a table with rows and columns, and is referred to by programmers as **structured data**.

As always, begin by loading the helper functions you'll use:

In [32]:
# Imports
from helper_functions import get_llm_response, print_llm_response, display_table
from IPython.display import Markdown
import csv

Note that `import csv` here is new. Don't worry about the details for now, but this line of code will be used later to read in CSV data. You'll learn more about this code in Course 4.

## Loading data from a CSV file

You'll use the file ```itinerary.csv```, which has information about arrival and departure dates for each destination in a trip around the world.

Here is the code to load the file - the first part is the same as you've been using up to this point:

In [33]:
f = open("itinerary.csv", 'r')

The next part, where you read the data in from the file, is different because you are now reading in a CSV file:

In [34]:
csv_reader = csv.DictReader(f)
itinerary = []
for row in csv_reader:
    print(row)
    itinerary.append(row)

{'Arrival': 'July-01', 'Departure': 'July-08', 'City': 'New York', 'Country': 'USA'}
{'Arrival': 'July-09', 'Departure': 'July-16', 'City': 'Rio de Janeiro', 'Country': 'Brazil'}
{'Arrival': 'July-17', 'Departure': 'July-24', 'City': 'Cape Town', 'Country': 'South Africa'}
{'Arrival': 'July-25', 'Departure': 'August-01', 'City': 'Istanbul', 'Country': 'Turkey'}
{'Arrival': 'August-02', 'Departure': 'August-09', 'City': 'Paris', 'Country': 'France'}
{'Arrival': 'August-10', 'Departure': 'August-17', 'City': 'Tokyo', 'Country': 'Japan'}
{'Arrival': 'August-18', 'Departure': 'August-25', 'City': 'Sydney', 'Country': 'Australia'}


Now close the file:

In [None]:
f.close()

You can print the itinerary to view it's content and use the `type` function to check the datatype:

In [35]:
print(itinerary)

[{'Arrival': 'July-01', 'Departure': 'July-08', 'City': 'New York', 'Country': 'USA'}, {'Arrival': 'July-09', 'Departure': 'July-16', 'City': 'Rio de Janeiro', 'Country': 'Brazil'}, {'Arrival': 'July-17', 'Departure': 'July-24', 'City': 'Cape Town', 'Country': 'South Africa'}, {'Arrival': 'July-25', 'Departure': 'August-01', 'City': 'Istanbul', 'Country': 'Turkey'}, {'Arrival': 'August-02', 'Departure': 'August-09', 'City': 'Paris', 'Country': 'France'}, {'Arrival': 'August-10', 'Departure': 'August-17', 'City': 'Tokyo', 'Country': 'Japan'}, {'Arrival': 'August-18', 'Departure': 'August-25', 'City': 'Sydney', 'Country': 'Australia'}]


In [None]:
type(itinerary)

Now take a look at the first item
* Remember the first item in a list has index 0

In [36]:
# Print item 0 
print(itinerary[0])

{'Arrival': 'July-01', 'Departure': 'July-08', 'City': 'New York', 'Country': 'USA'}


This is a dictionary. You can access a particular value by passing in the key - let's look at the `Country` value in the first row of the itinerary:

In [37]:
print(itinerary[0]["Country"])

USA


## Try for yourself!

Pause the video and explore other rows in the itinerary list, or individual items in any destination. Modify the code below to explore this world tour!

In [38]:
print(itinerary[0])
print(itinerary[0]["Country"])

{'Arrival': 'July-01', 'Departure': 'July-08', 'City': 'New York', 'Country': 'USA'}
USA


<p style="background-color:#F5C780; padding:15px"> 🤖 <b>Use the Chatbot</b>:
    <br><br>
    Explain this code line by line:
    <br><br>f = open("itinerary.csv", 'r')
    <br>csv_reader = csv.DictReader(f)
    <br>itinerary = []
    <br>for row in csv_reader:
    <br>itinerary.append(row)
    <br><br>f.close()
</p>

## Structured Data

Let's visualize this itinerary in a more readable way.

* Use the ```display_table``` helper function:

In [39]:
display_table(itinerary)

Arrival,Departure,City,Country
July-01,July-08,New York,USA
July-09,July-16,Rio de Janeiro,Brazil
July-17,July-24,Cape Town,South Africa
July-25,August-01,Istanbul,Turkey
August-02,August-09,Paris,France
August-10,August-17,Tokyo,Japan
August-18,August-25,Sydney,Australia


Next, write code to filter the table based on some criterion - in this case if the country is Japan - and then add the information for that stop to a new list called `filtered_data`:

In [40]:
# Create an empty list to store the filtered data
filtered_data = []

# Filter by country
for trip_stop in itinerary:
    # For example: get the destinations located in "Japan"
    if trip_stop["Country"] == "Japan":
        filtered_data.append(trip_stop)

In [41]:
display_table(filtered_data)

Arrival,Departure,City,Country
August-10,August-17,Tokyo,Japan


Note that the `filtered_data` variable only contains one row.

## Using AI to suggest trip activities

Retrieve the first destination and then ask an LLM for suggestions of activities to do in that location during the dates of the visit:

In [42]:
# Select the first destination from the itinerary list (Hint: index=0)
trip_stop = itinerary[0]
print(trip_stop)

{'Arrival': 'July-01', 'Departure': 'July-08', 'City': 'New York', 'Country': 'USA'}


Create variables to store all the individual items from ```trip_stop```:

In [43]:
city = trip_stop["City"]
country = trip_stop["Country"]
arrival = trip_stop["Arrival"]
departure = trip_stop["Departure"]

Write a prompt to get activity suggestions for your trip destination:

In [44]:
prompt = f"""I will visit {city}, {country}, from {arrival} to {departure}. 
Please create a detailed daily itinerary."""

print(prompt)

I will visit New York, USA, from July-01 to July-08. 
Please create a detailed daily itinerary.


Use Markdown to display the LLM response nicely in the Jupyter notebook:

In [45]:
# Store the LLM response
response = get_llm_response(prompt)

# Print in Markdown format
display(Markdown(response))

**New York City Itinerary: July 1 - July 8**

**Day 1: July 1 (Saturday)**
- **Morning:** Arrive in NYC, check into your hotel.
- **Afternoon:** Explore Times Square; grab lunch at a nearby café.
- **Evening:** Visit the Top of the Rock for sunset views; dinner at a restaurant in Midtown.

**Day 2: July 2 (Sunday)**
- **Morning:** Visit the Statue of Liberty and Ellis Island (book tickets in advance).
- **Afternoon:** Explore Battery Park; lunch in the Financial District.
- **Evening:** Walk across the Brooklyn Bridge; dinner in DUMBO with views of Manhattan.

**Day 3: July 3 (Monday)**
- **Morning:** Visit the 9/11 Memorial & Museum.
- **Afternoon:** Explore Wall Street; lunch at a local deli.
- **Evening:** Take a sunset cruise around Manhattan; dinner in the West Village.

**Day 4: July 4 (Tuesday)**
- **Morning:** Visit Central Park; rent a bike or take a leisurely walk.
- **Afternoon:** Explore the Metropolitan Museum of Art (The Met).
- **Evening:** Enjoy Independence Day fireworks (check location and time).

**Day 5: July 5 (Wednesday)**
- **Morning:** Visit the American Museum of Natural History.
- **Afternoon:** Lunch on the Upper West Side; stroll through Central Park.
- **Evening:** Catch a Broadway show (book tickets in advance).

**Day 6: July 6 (Thursday)**
- **Morning:** Explore the High Line and Chelsea Market for lunch.
- **Afternoon:** Visit the Whitney Museum of American Art.
- **Evening:** Dinner in the Meatpacking District; explore nightlife.

**Day 7: July 7 (Friday)**
- **Morning:** Visit the Museum of Modern Art (MoMA).
- **Afternoon:** Lunch in Midtown; shop along Fifth Avenue.
- **Evening:** Visit Rockefeller Center; enjoy dinner at a rooftop restaurant.

**Day 8: July 8 (Saturday)**
- **Morning:** Last-minute shopping or visit any missed attractions.
- **Afternoon:** Check out of your hotel; head to the airport.

**Tips:**
- Use the subway for efficient travel.
- Book tickets for popular attractions in advance.
- Stay hydrated and wear comfortable shoes.

## Extra Practice

In these exercises, you'll create an itinerary for another stop on the trip! 

### Exercise 1

First, create a filtered dataset for Brazil. You'll need to update the `if` statement to select the right country. 

In [46]:
# Create an empty list to store the filtered data
filtered_data = []

# Filter by country
for trip_stop in itinerary:
    # For example: get the destinations located in "Brazil"
    # Complete code on next line:
    if trip_stop["Country"] == "Brazil":
        filtered_data.append(trip_stop)

print(filtered_data)

[{'Arrival': 'July-09', 'Departure': 'July-16', 'City': 'Rio de Janeiro', 'Country': 'Brazil'}]


### Exercise 2

Next, update the variables to pass in the prompt to the LLM. You'll need to modify the code on the next line to select the first item from `filtered_data` rather than the whole `itinerary`.

In [47]:
trip_stop = itinerary[1]

city = trip_stop["City"]
country = trip_stop["Country"]
arrival = trip_stop["Arrival"]
departure = trip_stop["Departure"]

print(f" The city is: {city}")
print(f" The country is: {country}")
print(" The arrival date is: {arrival}")
print(" The departure date is: {departure}")

 The city is: Rio de Janeiro
 The country is: Brazil
 The arrival date is: {arrival}
 The departure date is: {departure}


Now, you can run the prompt to get a new itinerary!

In [48]:
prompt = f"""I will visit {city}, {country}, from {arrival} to {departure}. 
Please create a detailed daily itinerary."""

print_llm_response(prompt)

### Rio de Janeiro Itinerary (July 9 - July 16)

#### Day 1: July 9 (Arrival)
- **Morning:** Arrive in Rio de Janeiro, check into your hotel.
- **Afternoon:** Explore Copacabana Beach; relax and enjoy the sun.
- **Evening:** Dinner at a beachfront restaurant.

#### Day 2: July 10 (Christ the Redeemer & Santa Teresa)
- **Morning:** Visit Christ the Redeemer (early to avoid crowds).
- **Afternoon:** Explore Santa Teresa neighborhood; visit the Selarón Steps.
- **Evening:** Dinner at a local restaurant in Santa Teresa.

#### Day 3: July 11 (Sugarloaf Mountain & Botafogo)
- **Morning:** Take the cable car to Sugarloaf Mountain; enjoy panoramic views.
- **Afternoon:** Stroll through Botafogo Beach and visit the shopping mall.
- **Evening:** Dinner in Botafogo; try traditional Brazilian cuisine.

#### Day 4: July 12 (Tijuca National Park)
- **Morning:** Hike in Tijuca National Park; visit the Cascatinha Taunay waterfall.
- **Afternoon:** Picnic in the park or lunch at a nearby restaurant.
- 

### Challenge exercise!

Complete the code below so that it will **print out the country of every destination** in the `itinerary.csv` file. Ask the chatbot for help if you need it!

In [54]:
f = open("itinerary.csv", "r")
csv_reader = csv.DictReader(f)
itinerary = []
for row in csv_reader:
    print(row)
    itinerary.append(row)
f.close()

# Complete the next two lines to print the country:
for trip_stop in csv_reader :
    print(trip_stop['Brazil'])

{'Arrival': 'July-01', 'Departure': 'July-08', 'City': 'New York', 'Country': 'USA'}
{'Arrival': 'July-09', 'Departure': 'July-16', 'City': 'Rio de Janeiro', 'Country': 'Brazil'}
{'Arrival': 'July-17', 'Departure': 'July-24', 'City': 'Cape Town', 'Country': 'South Africa'}
{'Arrival': 'July-25', 'Departure': 'August-01', 'City': 'Istanbul', 'Country': 'Turkey'}
{'Arrival': 'August-02', 'Departure': 'August-09', 'City': 'Paris', 'Country': 'France'}
{'Arrival': 'August-10', 'Departure': 'August-17', 'City': 'Tokyo', 'Country': 'Japan'}
{'Arrival': 'August-18', 'Departure': 'August-25', 'City': 'Sydney', 'Country': 'Australia'}


ValueError: I/O operation on closed file.