## Background
Python script to visualize the weather of 500+ cities across the world of varying distance from the equator. To accomplish this, I will utilize a simple Python library, the OpenWeatherMap API, and a little common sense to create a representative model of weather across world cities.

The visualization has a series of scatter plots to showcase the following relationships:
* Temperature (F) vs. Latitude
* Humidity (%) vs. Latitude
* Cloudiness (%) vs. Latitude
* Wind Speed (mph) vs. Latitude

The scripts acocomplishes the following
* Randomly select at least 500 unique (non-repeat) cities based on latitude and longitude.
* Perform a weather check on each of the cities using a series of successive API calls.
* Include a print log of each city as it's being processed with the city number and city name.
* Save both a CSV of all data retrieved and png images for each scatter plot.

## Three observable trends based on the data

* Not surprisingly, temperate increases as you get closer to the equator. However, the temperate peaks at around 20 degrees latitude and not at the equator.

* Cloudiness and humdity don't show strong correltions to latitude. However, it looks like humidity decreases the most at 2 separate troughs(谷), 20 degrees latitude and -20 degrees.

* Wind speed appears to increase slightly the further away you go from the equator. For a definitive conclusion, I would need to make another variable to analyze it.

In [1]:
# Dependencies and Setup
import matplotlib.pyplot as plt
import pandas as pd
import numpy as np
import requests
import time

#Import API key
import api_keys

# Incorporated citipy to determine city based on latitude and longitude
from citipy import citipy

# Output File (CSV)
output_data_file = "output_data/cities.csv"

# Range of latitudes and longitudes
lat_range = (-90, 90)
lng_range = (-180, 180)

ModuleNotFoundError: No module named 'api_keys'

## Generate Cities List

In [None]:
 # List for holding lat_lngs and cities
lat_lngs = []
cities = []

# Create a set of random lat and lng combinations
lats = np.random.uniform(low=-90.000, high=90.000, size=1500)
lngs = np.random.uniform(low=-180.000, high=180.000, size=1500)
lat_lngs = zip(lats, lngs)

# Identify nearest city for each lat, lng combination
for lat_lng in lat_lngs:
    city = citipy.nearest_city(lat_lng[0], lat_lng[1]).city_name
    
    # If the city is unique, then add it to a our cities list
    if city not in cities:
        cities.append(city)

# Print the city count to confirm sufficient count
len(cities)

### Perform API Calls
* Perform a weather check on each city using a series of successive API calls.
* Include a print log of each city as it'sbeing processed (with the city number and city name).

In [None]:
# OpenWeatherMap API Key
api_key = api_keys.api_key

# URL for Weather Map API call 
url = "http://api.openweathermap.org/data/2.5/weather?units=Imperial&APPID=" + api_key 

In [None]:
# Create lists to hold reponse info
country = []
city_name = []
date = []
lat = []
lng = []
cloudiness = []
humidity = []
maxTemp = []
windSpeed = []

In [None]:
#　 Start counter
record = 1

# Print log file
print(f"Beginning Data Retrieval")
print(f"-----------------------------")

# Loop through the list of cities in the city list
for city in cities:
    
    # Try statement to append calls when the value is found
    try:
        response = requests.get(f"{url}&q={city}").json()
        city_name.append(response["name"])
        cloudiness.append(response["clouds"]["all"])
        country.append(response["sys"]["country"])
        date.append(response["dt"])
        humidity.append(response["main"]["humidity"])
        maxTemp.append(response["main"]["temp_max"])
        lat.append(response["coord"]["lat"])
        lng.append(response["coord"]["lon"])
        windSpeed.append(response["wind"]["speed"])
        cityRecord = response["name"]
        print(f"Processing Record {record} | {cityRecord}")
        print(f"{url}&q={city}")
        
        # Increase counter +1
        record = record + 1
        
        # Wait a second in loop to not over API exceed limits 
        time.sleep(1.01)
        
    # If no record is found, skip to next cell
    except:
        print("The city is not found. Skipping...")
    continue

In [None]:
# Create a  dictionary with the generated list 
WeatherPyDict = {
    "City":city_name,
    "Cloudiness":cloudiness,
    "Country":country,
    "Date":date,
    "Humidity":humidity,
    "Lat":lat,
    "Lng":lng,
    "Max Temp":maxTemp,
    "Wind Speed":windSpeed
}

In [None]:
# Create a dataframe from the dictionary 
weatherData = pd.DataFrame(WeatherPyDict)

# Count of weatherData values
weatherData.count()

In [None]:
#Save the dataframe 'weatherData' to csv
weatherData.to_csv('output_data/weatherData.csv')

In [None]:
weatherData.head()

## Plotting the Data
* Use proper labeling of the plots using plot titles (including date of analysis) and axes labels.
* Save the plotted figures as .pngs.

## Latitude vs. Temperature Plot

In [None]:
# Create a scatrer plot 
plt.scatter(weatherData["Lat"], weatherData["Max Temp"], marker="o", 
            facecolors="lightskyblue", edgecolors="black", alpha=0.75, s=25)

# Incorporate the other graph properties 
plt.title("City Latitude vs. Max Temperature")
plt.xlabel("Latitude")
plt.ylabel("Max Temperature(F)")
plt.grid()

# Save Image
plt.savefig("output_images/Latitude_vs_MaxTemperature")
plt.show()

## Latitude vs. Humidity Plot

In [None]:
# Create a scatrer plot 
plt.scatter(weatherData["Lat"], weatherData["Humidity"], marker="o", 
            facecolors="lightskyblue", edgecolors="black", alpha=0.75, s=25)

# Incorporate the other graph properties 
plt.title("City Latitude vs. Humidity")
plt.xlabel("Latitude")
plt.ylabel("Humidity(%)")
plt.grid()

# Save Image
plt.savefig("output_images/Latitude_vs_Humidity")
plt.show()

## Latitude vs. Cloudiness Plot

In [None]:
# Create a scatrer plot 
plt.scatter(weatherData["Lat"], weatherData["Cloudiness"], marker="o", 
            facecolors="lightskyblue", edgecolors="black", alpha=0.75, s=25)

# Incorporate the other graph properties 
plt.title("City Latitude vs. Cloudiness")
plt.xlabel("Latitude")
plt.ylabel("Cloudiness(%)")
plt.grid()

# Save Image
plt.savefig("output_images/Latitude_vs_Cloudiness")
plt.show()

## Latitude vs. Wind Speed Plot

In [2]:
# Create a scatrer plot 
plt.scatter(weatherData["Lat"], weatherData["Wind Speed"], marker="o", 
            facecolors="lightskyblue", edgecolors="black", alpha=0.75, s=25)

# Incorporate the other graph properties 
plt.title("City Latitude vs. Wind Speed")
plt.xlabel("Latitude")
plt.ylabel("Wind Speed(mph)")
plt.grid()

# Save Image
plt.savefig("output_images/Latitude_vs_WindSpeed")
plt.show()

NameError: name 'weatherData' is not defined