# RICE-VIRT-DATA-PT-05-2022-U-B-MW Module 6 Challenge

# Deliverable 1: Retrieve Weather Data
## Code Summary
- **Purpose  :** Collect and analyze weather data across cities worldwide, using 500 or more unique & random cities
- **Created  :** 2022 Jun 19 22:41:05 UTC (Meghan E. Hull)
- **Modified :** 2022 Jun 20 05:45:44 UTC (Meghan E. Hull)

## Dependencies

In [7]:
# Import the dependencies.
import pandas as pd
import time
import matplotlib.pyplot as plt
import numpy as np
from citipy import citipy
import time
from datetime import datetime
import requests
from scipy.stats import linregress

## Inputs & API keys

In [3]:
# Add the directory above current directory to list of directories where Python will look for modules
import sys; sys.path.insert(0, '..')

# Import the API key from main directory
from config import weather_api_key

# Specify the output file (CSV)
output_data_file = "WeatherPy_Database.csv"

# Retreive Weather Data for 500+ Random Cities
## 1. Generate more than 2,000 random latitudes and longitudes

In [4]:
# Parameters for latitude & longitude ranges
lat_low=-90
lat_high=90
lng_low=-180
lng_high=180
no_gen=2000

# Create a set of random latitude and longitude combinations.
lats = np.random.uniform(low=lat_low, high=lat_high, size=no_gen)
lngs = np.random.uniform(low=lng_low, high=lng_high, size=no_gen)
lat_lngs = zip(lats, lngs)

# Add the latitudes and longitudes to a list.
coordinates = list(lat_lngs)

## 2. List the nearest city to the latitudes and longitudes

In [32]:
# Create a list for holding the cities.
cities = []

# Identify the nearest city for each latitude and longitude combination.
for coordinate in coordinates:
    city = citipy.nearest_city(coordinate[0], coordinate[1]).city_name

    # If the city is unique, then we will add it to the cities list.
    if city not in cities:
        cities.append(city)

# Print the city count to confirm sufficient count.
len(cities)

743

## 3. Use OpenWeatherMap API to request current weather data from each unique city in cities list

In [33]:
# Creater start of URL for Weather Map API Call.
url = "http://api.openweathermap.org/data/2.5/weather?units=Imperial&APPID=" + weather_api_key

# Create an empty list to hold the weather data.
city_data = []

# Print the beginning of the logging.
print("Beginning Data Retrieval for " + str(len(cities)) + " Cities")
print("-----------------------------")

# Set start time (sec0nds)
t_o=time.time()

# Create counters.
record_count = 1
set_count = 1

# Loop through all the cities in the list.
for i, city in enumerate(cities):

    # Group cities in sets of 50 for logging purposes.
    if (i % 50 == 0 and i >= 50):
        set_count += 1
        record_count = 1
        time.sleep(60)

    # Create endpoint URL with each city.
    city_url = url + "&q=" + cities[i]
    
    # Log the URL, record, and set numbers and the city.
    print(f"Processing Record {record_count} of Set {set_count} | {city}")
    
    # Add 1 to the record count.
    record_count += 1
    
    # Run an API request for each of the cities.
    try:
## 4. Parse the JSON and retrieve data.
        city_weather = requests.get(city_url).json()
        # Parse out the needed data.
        city_lat = city_weather["coord"]["lat"]
        city_lng = city_weather["coord"]["lon"]
        city_max_temp = city_weather["main"]["temp_max"]
        city_humidity = city_weather["main"]["humidity"]
        city_clouds = city_weather["clouds"]["all"]
        city_wind = city_weather["wind"]["speed"]
        city_country = city_weather["sys"]["country"]
        city_weather_desc = city_weather["weather"][0]["description"]
        # Convert the date to ISO standard.
        city_date = datetime.utcfromtimestamp(city_weather["dt"]).strftime('%Y-%m-%d %H:%M:%S')
        # Append the city information into city_data list.
        city_data.append({"City": city.title(),
                          "Country": city_country,
                          "Lat": city_lat,
                          "Lng": city_lng,
                          "Max Temp": city_max_temp,
                          "Humidity": city_humidity,
                          "Cloudiness": city_clouds,
                          "Wind Speed": city_wind,
                          "Current Description": city_weather_desc})

    # If an error is experienced, skip the city.
    except Exception as e:
#         print(e)
        print("City not found. Skipping...")
        pass

# Set end time (sec0nds)
t_f=time.time()

# Find elapsed time (sec0nds)
t_del=t_f-t_o
t_del_hr=int(t_del/3600)
t_del_min=int(t_del/60)-t_del_hr*60
t_del_sec=round(t_del-t_del_hr*3600-t_del_min*60,2)
t_del_str=str(t_del_hr) + ":" + str(t_del_min) + ":" + str(t_del_sec) + " (" + str(round(t_del,2)) + " sec)"

# Indicate that Data Loading is complete.
print("-----------------------------")
print("Data Retrieval Completed     ")
print("-----------------------------")
print("Time Elapsed: " + t_del_str)
print("Data found for " + str(len(city_data)) + " of " + str(len(cities)) + " cities")
print("-----------------------------")


Beginning Data Retrieval for 743 Cities
-----------------------------
Processing Record 1 of Set 1 | bulgan
Processing Record 2 of Set 1 | codrington
Processing Record 3 of Set 1 | rikitea
Processing Record 4 of Set 1 | alofi
Processing Record 5 of Set 1 | hermanus
Processing Record 6 of Set 1 | barrow
Processing Record 7 of Set 1 | muyezerskiy
Processing Record 8 of Set 1 | airai
Processing Record 9 of Set 1 | illoqqortoormiut
City not found. Skipping...
Processing Record 10 of Set 1 | cairns
Processing Record 11 of Set 1 | concordia
Processing Record 12 of Set 1 | port elizabeth
Processing Record 13 of Set 1 | anloga
Processing Record 14 of Set 1 | jamestown
Processing Record 15 of Set 1 | taolanaro
City not found. Skipping...
Processing Record 16 of Set 1 | busselton
Processing Record 17 of Set 1 | nizhneyansk
City not found. Skipping...
Processing Record 18 of Set 1 | ushuaia
Processing Record 19 of Set 1 | diffa
Processing Record 20 of Set 1 | geraldton
Processing Record 21 of Set

Processing Record 38 of Set 4 | okhotsk
Processing Record 39 of Set 4 | viligili
City not found. Skipping...
Processing Record 40 of Set 4 | jalu
Processing Record 41 of Set 4 | salalah
Processing Record 42 of Set 4 | muros
Processing Record 43 of Set 4 | mezen
Processing Record 44 of Set 4 | banamba
Processing Record 45 of Set 4 | chifeng
Processing Record 46 of Set 4 | kruisfontein
Processing Record 47 of Set 4 | along
Processing Record 48 of Set 4 | rio gallegos
Processing Record 49 of Set 4 | kaoma
Processing Record 50 of Set 4 | lazaro cardenas
Processing Record 1 of Set 5 | meulaboh
Processing Record 2 of Set 5 | port blair
Processing Record 3 of Set 5 | udachnyy
Processing Record 4 of Set 5 | karaul
City not found. Skipping...
Processing Record 5 of Set 5 | ardistan
City not found. Skipping...
Processing Record 6 of Set 5 | wenatchee
Processing Record 7 of Set 5 | isangel
Processing Record 8 of Set 5 | ondjiva
Processing Record 9 of Set 5 | kelo
Processing Record 10 of Set 5 | t

Processing Record 27 of Set 8 | xianyang
Processing Record 28 of Set 8 | hernani
Processing Record 29 of Set 8 | tigil
Processing Record 30 of Set 8 | sarrebourg
Processing Record 31 of Set 8 | niesky
Processing Record 32 of Set 8 | shubarshi
Processing Record 33 of Set 8 | tazovskiy
Processing Record 34 of Set 8 | sofiysk
City not found. Skipping...
Processing Record 35 of Set 8 | kemijarvi
Processing Record 36 of Set 8 | dayong
Processing Record 37 of Set 8 | tsihombe
City not found. Skipping...
Processing Record 38 of Set 8 | tiarei
Processing Record 39 of Set 8 | starkville
Processing Record 40 of Set 8 | marawi
Processing Record 41 of Set 8 | shepsi
Processing Record 42 of Set 8 | ous
Processing Record 43 of Set 8 | bratsk
Processing Record 44 of Set 8 | luderitz
Processing Record 45 of Set 8 | bubaque
Processing Record 46 of Set 8 | belomorsk
Processing Record 47 of Set 8 | sandakan
Processing Record 48 of Set 8 | nioro
Processing Record 49 of Set 8 | voh
Processing Record 50 of 

Processing Record 13 of Set 12 | moindou
Processing Record 14 of Set 12 | malmyzh
Processing Record 15 of Set 12 | yantai
Processing Record 16 of Set 12 | dien bien
City not found. Skipping...
Processing Record 17 of Set 12 | miles city
Processing Record 18 of Set 12 | walvis bay
Processing Record 19 of Set 12 | broken hill
Processing Record 20 of Set 12 | kalmunai
Processing Record 21 of Set 12 | formoso do araguaia
City not found. Skipping...
Processing Record 22 of Set 12 | mineros
Processing Record 23 of Set 12 | inyonga
Processing Record 24 of Set 12 | chantada
Processing Record 25 of Set 12 | inuvik
Processing Record 26 of Set 12 | emerald
Processing Record 27 of Set 12 | phan thiet
Processing Record 28 of Set 12 | cervo
Processing Record 29 of Set 12 | ayan
Processing Record 30 of Set 12 | amberley
Processing Record 31 of Set 12 | mandiana
Processing Record 32 of Set 12 | grand gaube
Processing Record 33 of Set 12 | balabac
Processing Record 34 of Set 12 | portland
Processing Re

# Add Weather Data a DataFrame

In [34]:
# Convert the array of dictionaries to a Pandas DataFrame.
city_data_df = pd.DataFrame(city_data)

# # Reorder columns
# new_column_order=["City","Country","Date","Lat","Lng","Max Temp","Humidity","Cloudiness","Wind Speed"]
# city_data_df=city_data_df[new_column_order]

#Check dataframe
city_data_df.head(10)

Unnamed: 0,City,Country,Lat,Lng,Max Temp,Humidity,Cloudiness,Wind Speed,Current Description
0,Bulgan,MN,48.8125,103.5347,72.9,42,52,5.46,broken clouds
1,Codrington,AU,-38.2667,141.9667,54.19,80,100,20.4,overcast clouds
2,Rikitea,PF,-23.1203,-134.9692,71.58,58,76,19.71,broken clouds
3,Alofi,NU,-19.0595,-169.9187,76.89,94,75,4.61,broken clouds
4,Hermanus,ZA,-34.4187,19.2345,44.8,84,0,13.2,clear sky
5,Barrow,US,71.2906,-156.7887,55.42,71,0,11.5,clear sky
6,Muyezerskiy,RU,63.9333,31.65,49.41,74,64,8.46,broken clouds
7,Airai,TL,-8.9266,125.4092,72.23,43,65,4.29,broken clouds
8,Cairns,AU,-16.9167,145.7667,83.07,67,100,16.11,overcast clouds
9,Concordia,AR,-31.393,-58.0209,38.79,100,9,5.06,clear sky


# Store data in an output csv

In [36]:
# Export the City_Data into a CSV.
city_data_df.to_csv(output_data_file, index_label="City_ID")