# WeatherPy
----

#### Analysis
Observation:  Northern Hemisphere - Max Temp vs. Latitude Linear Regression
Negative linear relationship – The temperature decreases as we move further away from the equator and latitude increases.

Observation: Southern Hemisphere - Max Temp vs. Latitude Linear Regression
Positive linear relationship – The temperature increases as we move towards the equator and latitude increases.

Observation: Northern and Southern Hemisphere - Humidity (%) vs. Latitude Linear Regression
No correlation is shown between humidity and latitude for the Northern and Southern Hemispheres and no strong relationship between cloudiness and latitude for both the Northern and Southern Hemispheres.


In [1]:
# Install citipy module if not already installed ** 
# pip install citipy

In [2]:
# Dependencies and Setup
import os
import matplotlib.pyplot as plt
import seaborn as sns
import pandas as pd
import numpy as np
import requests
import time
import scipy.stats as st
from scipy.stats import linregress

# Import API key
from config import weather_api_key

# Incorporated citipy to determine city based on latitude and longitude
from citipy import citipy

# Output File (CSV)
output_data_file = "../output_data/cities.csv"

# Range of latitudes and longitudes
lat_range = (-90, 90)
lng_range = (-180, 180)

## Generate Cities List

In [3]:
# List for holding lat_lngs and cities
lat_lngs = []
cities = []

# Create a set of random lat and lng combinations
lats = np.random.uniform(lat_range[0], lat_range[1], size=1500)
lngs = np.random.uniform(lng_range[0], lng_range[1], size=1500)
lat_lngs = zip(lats, lngs)

# Identify nearest city for each lat, lng combination
for lat_lng in lat_lngs:
    city = citipy.nearest_city(lat_lng[0], lat_lng[1]).city_name
    
    # If the city is unique, then add it to a our cities list
    if city not in cities:
        cities.append(city)

# Print the city count to confirm sufficient count
len(cities)

611

### Perform API Calls
* Perform a weather check on each city using a series of successive API calls.
* Include a print log of each city as it's being processed (with the city number and city name).


In [4]:
# Get weather data from open weather map
# https://openweathermap.org/current

url = "http://api.openweathermap.org/data/2.5/weather?"
units = "imperial"
process_record = 0
set_counter = 1
cities_weather = []

for city in cities:
    
     # Increase counter by one 
    process_record += 1
    print(f"Processing Record {process_record} of Set {set_counter} | {city}")

    if(process_record==50):
        set_counter+=1
        process_record=0
    
    query_url = f"{url}&appid={weather_api_key}&q={city}&units={units}"

    try:
        response = requests.get(query_url).json()
        cities_weather.append({'City':city,
                               'Lat':response['coord']['lat'],
                               'Lng':response['coord']['lon'],
                               'Max Temp':response['main']['temp_max'],
                               'Humidity':response['main']['humidity'],
                               'Cloudiness':response['clouds']['all'],
                               'Wind Speed':response['wind']['speed'],
                               'Country':response['sys']['country'],
                               'Date':response['dt']})
    except:
        print('City not found. Skipping...')
        pass
        
    
print('-----------------------------')
print('Data Retrieval Complete')      
print('-----------------------------')

Processing Record 1 of Set 1 | barentsburg
City not found. Skipping...
Processing Record 2 of Set 1 | nakusp
City not found. Skipping...
Processing Record 3 of Set 1 | mindyak
City not found. Skipping...
Processing Record 4 of Set 1 | kaitangata
City not found. Skipping...
Processing Record 5 of Set 1 | illoqqortoormiut
City not found. Skipping...
Processing Record 6 of Set 1 | road town
City not found. Skipping...
Processing Record 7 of Set 1 | kodiak
City not found. Skipping...
Processing Record 8 of Set 1 | alta floresta
City not found. Skipping...
Processing Record 9 of Set 1 | tiksi
City not found. Skipping...
Processing Record 10 of Set 1 | ariano irpino
City not found. Skipping...
Processing Record 11 of Set 1 | tura
City not found. Skipping...
Processing Record 12 of Set 1 | busselton
City not found. Skipping...
Processing Record 13 of Set 1 | ray
City not found. Skipping...
Processing Record 14 of Set 1 | castro
City not found. Skipping...
Processing Record 15 of Set 1 | sasky

City not found. Skipping...
Processing Record 23 of Set 3 | vestmannaeyjar
City not found. Skipping...
Processing Record 24 of Set 3 | chodziez
City not found. Skipping...
Processing Record 25 of Set 3 | bambous virieux
City not found. Skipping...
Processing Record 26 of Set 3 | bluff
City not found. Skipping...
Processing Record 27 of Set 3 | zhigansk
City not found. Skipping...
Processing Record 28 of Set 3 | lolua
City not found. Skipping...
Processing Record 29 of Set 3 | san cristobal
City not found. Skipping...
Processing Record 30 of Set 3 | coquimbo
City not found. Skipping...
Processing Record 31 of Set 3 | beloha
City not found. Skipping...
Processing Record 32 of Set 3 | nikolskoye
City not found. Skipping...
Processing Record 33 of Set 3 | la rioja
City not found. Skipping...
Processing Record 34 of Set 3 | sira
City not found. Skipping...
Processing Record 35 of Set 3 | zambezi
City not found. Skipping...
Processing Record 36 of Set 3 | mahebourg
City not found. Skipping..

City not found. Skipping...
Processing Record 44 of Set 5 | synya
City not found. Skipping...
Processing Record 45 of Set 5 | shache
City not found. Skipping...
Processing Record 46 of Set 5 | ancud
City not found. Skipping...
Processing Record 47 of Set 5 | djenne
City not found. Skipping...
Processing Record 48 of Set 5 | ust-maya
City not found. Skipping...
Processing Record 49 of Set 5 | turayf
City not found. Skipping...
Processing Record 50 of Set 5 | padang
City not found. Skipping...
Processing Record 1 of Set 6 | mullaitivu
City not found. Skipping...
Processing Record 2 of Set 6 | norman wells
City not found. Skipping...
Processing Record 3 of Set 6 | codrington
City not found. Skipping...
Processing Record 4 of Set 6 | meulaboh
City not found. Skipping...
Processing Record 5 of Set 6 | nabire
City not found. Skipping...
Processing Record 6 of Set 6 | hunza
City not found. Skipping...
Processing Record 7 of Set 6 | rabaul
City not found. Skipping...
Processing Record 8 of Set

City not found. Skipping...
Processing Record 16 of Set 8 | yulara
City not found. Skipping...
Processing Record 17 of Set 8 | sabzevar
City not found. Skipping...
Processing Record 18 of Set 8 | eldorado
City not found. Skipping...
Processing Record 19 of Set 8 | saldanha
City not found. Skipping...
Processing Record 20 of Set 8 | auxerre
City not found. Skipping...
Processing Record 21 of Set 8 | chiredzi
City not found. Skipping...
Processing Record 22 of Set 8 | kaseda
City not found. Skipping...
Processing Record 23 of Set 8 | alyangula
City not found. Skipping...
Processing Record 24 of Set 8 | abu zabad
City not found. Skipping...
Processing Record 25 of Set 8 | teguldet
City not found. Skipping...
Processing Record 26 of Set 8 | havelock
City not found. Skipping...
Processing Record 27 of Set 8 | bantva
City not found. Skipping...
Processing Record 28 of Set 8 | humaita
City not found. Skipping...
Processing Record 29 of Set 8 | puerto colombia
City not found. Skipping...
Proce

City not found. Skipping...
Processing Record 35 of Set 10 | lapao
City not found. Skipping...
Processing Record 36 of Set 10 | nantucket
City not found. Skipping...
Processing Record 37 of Set 10 | bada
City not found. Skipping...
Processing Record 38 of Set 10 | batagay-alyta
City not found. Skipping...
Processing Record 39 of Set 10 | ulladulla
City not found. Skipping...
Processing Record 40 of Set 10 | omboue
City not found. Skipping...
Processing Record 41 of Set 10 | menongue
City not found. Skipping...
Processing Record 42 of Set 10 | edson
City not found. Skipping...
Processing Record 43 of Set 10 | santa rosa
City not found. Skipping...
Processing Record 44 of Set 10 | safford
City not found. Skipping...
Processing Record 45 of Set 10 | george
City not found. Skipping...
Processing Record 46 of Set 10 | carpentras
City not found. Skipping...
Processing Record 47 of Set 10 | maragogi
City not found. Skipping...
Processing Record 48 of Set 10 | kamenka
City not found. Skipping.

City not found. Skipping...
Processing Record 6 of Set 13 | emerald
City not found. Skipping...
Processing Record 7 of Set 13 | chernyshevskiy
City not found. Skipping...
Processing Record 8 of Set 13 | sabinas
City not found. Skipping...
Processing Record 9 of Set 13 | paveh
City not found. Skipping...
Processing Record 10 of Set 13 | sobolevo
City not found. Skipping...
Processing Record 11 of Set 13 | lilongwe
City not found. Skipping...
-----------------------------
Data Retrieval Complete
-----------------------------


In [5]:
cities = pd.DataFrame(cities_weather)
cities.head()

### Convert Raw Data to DataFrame
* Export the city data into a .csv.
* Display the DataFrame

In [6]:
cities.count()

Series([], dtype: int64)

In [7]:
# Display the DataFrame
cities = pd.DataFrame(cities_weather)
cities

In [8]:
# Export the data to csv
cities.to_csv(output_data_file)

## Inspect the data and remove the cities where the humidity > 100%.
----
Skip this step if there are no cities that have humidity > 100%. 

In [None]:
# Display a statistical overview of the DataFrame 
cities.describe()

In [None]:
# Cities that have humidity > 70%.
midHumidity = cities.index[cities['Humidity'] > 80].tolist()
len(midHumidity)

In [None]:
# Cities that have humidity > 100%.
maxHumidity = cities.index[cities['Humidity'] > 100].tolist()
len(maxHumidity)

In [None]:
# Make a new DataFrame equal to the city data to drop all humidity outliers by index.
# Passing "inplace=False" will make a copy of the city_data DataFrame, which we call "clean_city_data".
city_weatherData = cities.drop(maxHumidity, inplace=False)
city_weatherData.head()

* As there were no cities with a humidity greater than 100 the number of cities in the dataframe remane the same.

In [None]:
# Check the number of cities
city_weatherData.count()

In [None]:
# Export the City_Data into a csv
city_weatherData.to_csv("../output_data/city_weatherData.csv")

## Plotting the Data
* Label of the plots using plot titles (including date of analysis) and axes labels.
* Saved the plotted figures as .pngs.

## Latitude vs. Temperature Plot

In [None]:
# Assigning a variable to hue will map its levels to the color of the points: hue="time"
# sns.scatterplot(data=city_weatherData, x="Lat", y="Max Temp", palette="deep", hue="")

sns.scatterplot(data=city_weatherData, x="Lat", y="Max Temp", palette="deep")

In [None]:
# Latitude Vs Temperature Scatter Plot 
plt.scatter(city_weatherData['Lat'],city_weatherData['Max Temp'],facecolors = "magenta", edgecolors ="black", marker ="o")
plt.title(f'City Latitude vs. Max Temperature ({time.strftime("%m/%d/%Y")})')
plt.xlabel("Latitude")
plt.ylabel("Temperature (F)")
plt.grid(linestyle='-', linewidth=1, alpha = 0.5)

# Save Figure
plt.savefig("../output_data/City_Latitude_vs_Temperature.png")

plt.show()

## Latitude vs. Humidity Plot

In [None]:
# Latitude vs. Humidity Scatter Plot 
plt.scatter(city_weatherData["Lat"],city_weatherData["Humidity"],facecolors = "cyan", edgecolors ="black",marker ="o")
plt.title(f'City Latitude vs. Humidity ({time.strftime("%m/%d/%Y")})')
plt.xlabel("Latitude")
plt.ylabel("Humidity (%)")
plt.grid(linestyle='-', linewidth=1, alpha = 0.5)

# Save Figure
plt.savefig("../output_data/City_Latitude_vs_Humidity.png")

plt.show()

## Latitude vs. Cloudiness Plot

In [None]:
# Latitude vs. Cloudiness Scatter Plot - **ADD CLEAN DATAFRAME?
plt.scatter(city_weatherData['Lat'],city_weatherData['Cloudiness'],facecolors = "plum", edgecolors ="black", marker ="o")
plt.title(f'City Latitude vs. Cloudiness ({time.strftime("%m/%d/%Y")})')
plt.xlabel("Latitude")
plt.ylabel("Cloudiness (%)")
plt.grid(linestyle='-', linewidth=1, alpha = 0.5)

# Save Figure
plt.savefig("../output_data/City_Latitude_vs_Cloudiness.png")

plt.show()

## Latitude vs. Wind Speed Plot

In [None]:
# Latitude vs. Wind Speed Scatter Plot 
plt.scatter(city_weatherData["Lat"],city_weatherData["Wind Speed"],facecolors = "steelblue", edgecolors ="black", marker ="o")
plt.title(f'City Latitude vs. Wind Speed ({time.strftime("%m/%d/%Y")})')
plt.xlabel("Latitude")
plt.ylabel("Cloudiness (%)")
plt.grid(linestyle='-', linewidth=1, alpha = 0.5)

# Save Figure
plt.savefig("../output_data/City_Latitude_vs_WindSpeed.png")

plt.show()

## Linear Regression

In [None]:
def plot_linregress(X, y, title):
    print(f"The r-squared is: {round(st.pearsonr(X, y)[0],2)}")
    (slope, intercept, rvalue, pvalue, stderr) = linregress(X, y)
    regress_values = X * slope + intercept
    line_eq = "y = " + str(round(slope,2)) + "X + " + str(round(intercept,2))
    plt.scatter(X, y)
    plt.plot(X,regress_values,"r-")
    plt.title(title + '\n' + line_eq)
    plt.xlabel(X.name)
    plt.ylabel(y.name)


In [None]:
# Create Northern and Southern Hemisphere DataFrames
northern_hemisphere_df = city_weatherData.loc[city_weatherData['Lat'] >= 0]
print('northern_hemisphere_df.shape', northern_hemisphere_df.shape)

southern_hemisphere_df = city_weatherData.loc[city_weatherData['Lat'] < 0]
print('southern_hemisphere_df.shape', southern_hemisphere_df.shape)

####  Northern Hemisphere - Max Temp vs. Latitude Linear Regression

In [None]:
plot_linregress(northern_hemisphere_df['Lat'], northern_hemisphere_df['Max Temp'], 'Northern Hemisphere - Max Temp vs. Latitude Linear Regression')

####  Southern Hemisphere - Max Temp vs. Latitude Linear Regression

In [None]:
plot_linregress(southern_hemisphere_df['Lat'], southern_hemisphere_df['Max Temp'], 'Southern Hemisphere -Max Temp vs. Latitude Linear Regression')

####  Northern Hemisphere - Humidity (%) vs. Latitude Linear Regression

In [None]:
plot_linregress(northern_hemisphere_df['Lat'], northern_hemisphere_df['Humidity'], 'Northern Hemisphere - Humidity (%) vs. Latitude Linear Regression')

####  Southern Hemisphere - Humidity (%) vs. Latitude Linear Regression

In [None]:
plot_linregress(southern_hemisphere_df['Lat'], southern_hemisphere_df['Humidity'], 'Southern Hemisphere - Humidity (%) vs. Latitude Linear Regression')

####  Northern Hemisphere - Cloudiness (%) vs. Latitude Linear Regression

In [None]:
plot_linregress(northern_hemisphere_df['Lat'], northern_hemisphere_df['Cloudiness'], 'Northern Hemisphere - Cloudiness (%) vs. Latitude Linear Regression')

####  Southern Hemisphere - Cloudiness (%) vs. Latitude Linear Regression

In [None]:
plot_linregress(southern_hemisphere_df['Lat'], southern_hemisphere_df['Cloudiness'], 'Southern Hemisphere - Cloudiness (%) vs. Latitude Linear Regression')

####  Northern Hemisphere - Wind Speed (mph) vs. Latitude Linear Regression

In [None]:
plot_linregress(northern_hemisphere_df['Lat'], northern_hemisphere_df['Wind Speed'], 'Northern Hemisphere - Wind Speed (mph) vs. Latitude Linear Regression')

####  Southern Hemisphere - Wind Speed (mph) vs. Latitude Linear Regression

In [None]:
plot_linregress(southern_hemisphere_df['Lat'], southern_hemisphere_df['Wind Speed'], 'Southern Hemisphere - Wind Speed (mph) vs. Latitude Linear Regression')