## Create Latitude and Longitude Combinations

In [1]:
# Import the dependencies.
import pandas as pd
import matplotlib.pyplot as plt
import numpy as np

* In the next cell, we'll add the code that generates the latitudes and longitudes, but first, they need to be stored so that we can access them later. Since we are creating arrays of latitudes and longitudes, we'll declare each array as a variable.
* In the next cell, add the following code that we used to generate the random latitudes.
* We'll create a similar code snippet that will generate longitudes. To ensure enough latitudes and longitudes, we'll start with 1,500. In addition, we'll pack the latitudes (lats) and longitudes (lngs) as pairs by zipping them (lat_lngs) with the zip() function.

In [2]:
# Create a set of random latitude and longitude combinations.
lats = np.random.uniform(low=-90.000, high=90.000, size=1500)
lngs = np.random.uniform(low=-180.000, high=180.000, size=1500)
lat_lngs = zip(lats, lngs)
lat_lngs

<zip at 0x7f839f237240>

The zip object packs each pair of lats and lngs having the same index in their respective array into a tuple. If there are 1,500 latitudes and longitudes, there will be 1,500 tuples of paired latitudes and longitudes, where each latitude and longitude in a tuple can be accessed by the index of 0 and 1, respectively.

## Let's unpack our lat_lngs zip object into a list. This way, we only need to create a set of random latitudes and longitudes once.

In [3]:
# Add the latitudes and longitudes to a list.
coordinates = list(lat_lngs)

# 6.1.5 Generate Random World Cities

* With our list of random latitudes and longitudes, we'll use the coordinates in our lat_lngs tuple to find the nearest city using Python's citipy module.

In [4]:
!pip install citipy



In [5]:
# Use the citipy module to determine city based on latitude and longitude.
from citipy import citipy

In [6]:
# Create a list for holding the cities.
cities = []
# Identify the nearest city for each latitude and longitude combination.
for coordinate in coordinates:
    city = citipy.nearest_city(coordinate[0], coordinate[1]).city_name

    # If the city is unique, then we will add it to the cities list.
    if city not in cities:
        cities.append(city)
# Print the city count to confirm sufficient count.
len(cities)


631

# 6.2.1 Understanding APIs

# 6.2.2 Get Started with OpenWeatherMap API

In [7]:
# Import the requests library.
import requests

# Import the API key.
from config import weather_api_key

In [8]:
# Starting URL for Weather Map API Call.
url = "http://api.openweathermap.org/data/2.5/weather?units=Imperial&APPID=" + weather_api_key

In [9]:
# Create an endpoint URL for a city.
city_url = url + "&q=" + "Boston"
print(city_url)

http://api.openweathermap.org/data/2.5/weather?units=Imperial&APPID=5b080f7f22dbf268c1c5ab4257892315&q=Boston


## The JavaScript Object Notation Format for API Data

* When we retrieve data from a website, we have to make a "request," which returns data in a text format, not in a tab- or comma-separated file.
* One format we can use to parse data is JavaScript Object Notation (JSON).
* The JSON format is also referred to as an "object" or "JSON object."
* The data inside a JSON object opens and closes with curly braces, much like a Python dictionary.
* Inside the JSON object is a collection of dictionaries and arrays.

## The Python Requests Library


* To request JSON data over the internet, we use the Requests Library in Python.
* The Anaconda installation comes with version 2.22 of the Requests Library.

In [10]:
import requests
requests.__version__

'2.26.0'

# 6.2.5 Parse a Response from an API
Before we collect weather data from more than 500 cities, we'll walk through how to get the weather data from Boston.

In [11]:
# Create an endpoint URL for a city.
city_url = url + "&q=" + "Boston"
city_weather = requests.get(city_url)
city_weather.json()

{'coord': {'lon': -71.0598, 'lat': 42.3584},
 'weather': [{'id': 801,
   'main': 'Clouds',
   'description': 'few clouds',
   'icon': '02n'}],
 'base': 'stations',
 'main': {'temp': 6.75,
  'feels_like': -3.01,
  'temp_min': -0.36,
  'temp_max': 13.95,
  'pressure': 1022,
  'humidity': 77},
 'visibility': 10000,
 'wind': {'speed': 5.32, 'deg': 264, 'gust': 11.65},
 'clouds': {'all': 11},
 'dt': 1643606746,
 'sys': {'type': 2,
  'id': 2039376,
  'country': 'US',
  'sunrise': 1643630333,
  'sunset': 1643666182},
 'timezone': -18000,
 'id': 4930956,
 'name': 'Boston',
 'cod': 200}

### 1. In a new cell, let's assign a variable to the city_weather.json() data to the variable "boston_data" and run the cell.

In [12]:
# Get the JSON data.
boston_data = city_weather.json()
boston_data

{'coord': {'lon': -71.0598, 'lat': 42.3584},
 'weather': [{'id': 801,
   'main': 'Clouds',
   'description': 'few clouds',
   'icon': '02n'}],
 'base': 'stations',
 'main': {'temp': 6.75,
  'feels_like': -3.01,
  'temp_min': -0.36,
  'temp_max': 13.95,
  'pressure': 1022,
  'humidity': 77},
 'visibility': 10000,
 'wind': {'speed': 5.32, 'deg': 264, 'gust': 11.65},
 'clouds': {'all': 11},
 'dt': 1643606746,
 'sys': {'type': 2,
  'id': 2039376,
  'country': 'US',
  'sunrise': 1643630333,
  'sunset': 1643666182},
 'timezone': -18000,
 'id': 4930956,
 'name': 'Boston',
 'cod': 200}

## 2. Next, using the sys key to get the corresponding value, we type boston_data['sys'] in a new cell and run the cell. The output is another dictionary as shown in the following image.

In [13]:
boston_data["sys"]

{'type': 2,
 'id': 2039376,
 'country': 'US',
 'sunrise': 1643630333,
 'sunset': 1643666182}

## 3. If we add the country key in brackets after the sys key, and run the cell again, ‘US’ will be returned in the output.

In [14]:
boston_data["sys"]["country"]

'US'

If we want to retrieve the date in the weather data, we would add the dt key to the boston_data variable like this: boston_data["dt"].

In [15]:
boston_data["dt"]

1643606746

* Using similar syntax to get the time of day, we can get the latitude, longitude, maximum temperature, humidity, percent cloudiness, and wind speed.

Add the following code to a new cell and run the cell.

In [16]:
lat = boston_data["coord"]["lat"]
lng = boston_data["coord"]["lon"]
max_temp = boston_data["main"]["temp_max"]
humidity = boston_data["main"]["humidity"]
clouds = boston_data["clouds"]["all"]
wind = boston_data["wind"]["speed"]
print(lat, lng, max_temp, humidity, clouds, wind)

42.3584 -71.0598 13.95 77 11 5.32


## Convert the Date Timestamp

*If we want to convert the timestamp to the International Organization for Standardization (ISO) format, or YYYY-MM-DD-HH-MM-SS, we need to use the Python datetime module.

Let's convert the date from the Boston weather data in the JSON format to the ISO format.

In [17]:
# Import the datetime module from the datetime library.
from datetime import datetime
# Get the date from the JSON file.
date = boston_data["dt"]
# Convert the UTC date to a date format with year, month, day, hours, minutes, and seconds.
datetime.utcfromtimestamp(date)

datetime.datetime(2022, 1, 31, 5, 25, 46)

We can convert this datetime format to 2019-10-21 17:24:35 using the Python string format method strftime() and adding how we want the string to look inside the parentheses

In [18]:
datetime.utcfromtimestamp(date).strftime('%Y-%m-%d %H:%M:%S')

'2022-01-31 05:25:46'

# 6.2.6 Get the City Weather Data

## Import Dependencies, and Initialize an Empty List and Counters

* At the top of our code block, we are going to declare an empty list, city_data = []; add a print statement that references the beginning of the logging; and create counters for the record numbers, 1–50; and the set counter.

We will now work in our WeatherPy.ipynb file. Before continuing, make sure the following tasks are completed:

* Import your Requests Library and the weather_api_key.
* Build the basic URL for the OpenWeatherMap with your weather_api_key added to the URL.
* Also, import the time library, as well as the datetime module using the following code:

In [19]:
# Import the time library and the datetime module from the datetime library 
import time
from datetime import datetime

Next, add the following code to a new cell, but don't run the cell. Instead, continue to add on to this code block.


In [20]:
# Create an empty list to hold the weather data.
city_data = []
# Print the beginning of the logging.
print("Beginning Data Retrieval     ")
print("-----------------------------")

# Create counters.
record_count = 1
set_count = 1

# Loop through all the cities in the list.
for i, city in enumerate(cities):

    # Group cities in sets of 50 for logging purposes.
    if (i % 50 == 0 and i >= 50):
        set_count += 1
        record_count = 1
        time.sleep(60)

    # Create endpoint URL with each city.
    city_url = url + "&q=" + city.replace(" ","+")

    # Log the URL, record, and set numbers and the city.
    print(f"Processing Record {record_count} of Set {set_count} | {city}")
    # Add 1 to the record count.
    record_count += 1
    
# Run an API request for each of the cities.
    try:
        # Parse the JSON and retrieve data.
        city_weather = requests.get(city_url).json()
        # Parse out the needed data.
        city_lat = city_weather["coord"]["lat"]
        city_lng = city_weather["coord"]["lon"]
        city_max_temp = city_weather["main"]["temp_max"]
        city_humidity = city_weather["main"]["humidity"]
        city_clouds = city_weather["clouds"]["all"]
        city_wind = city_weather["wind"]["speed"]
        city_country = city_weather["sys"]["country"]
        # Convert the date to ISO standard.
        city_date = datetime.utcfromtimestamp(city_weather["dt"]).strftime('%Y-%m-%d %H:%M:%S')
        # Append the city information into city_data list.
        city_data.append({"City": city.title(),
                          "Lat": city_lat,
                          "Lng": city_lng,
                          "Max Temp": city_max_temp,
                          "Humidity": city_humidity,
                          "Cloudiness": city_clouds,
                          "Wind Speed": city_wind,
                          "Country": city_country,
                          "Date": city_date})

# If an error is experienced, skip the city.
    except:
        print("City not found. Skipping...")
        pass

# Indicate that Data Loading is complete.
print("-----------------------------")
print("Data Retrieval Complete      ")
print("-----------------------------")    

Beginning Data Retrieval     
-----------------------------
Processing Record 1 of Set 1 | severo-kurilsk
Processing Record 2 of Set 1 | bluff
Processing Record 3 of Set 1 | yomou
Processing Record 4 of Set 1 | padang
Processing Record 5 of Set 1 | mwene-ditu
Processing Record 6 of Set 1 | camocim
Processing Record 7 of Set 1 | sao felix do xingu
Processing Record 8 of Set 1 | meadow lake
Processing Record 9 of Set 1 | bambous virieux
Processing Record 10 of Set 1 | oussouye
Processing Record 11 of Set 1 | cherskiy
Processing Record 12 of Set 1 | port alfred
Processing Record 13 of Set 1 | siocon
Processing Record 14 of Set 1 | saint-georges
Processing Record 15 of Set 1 | thanh hoa
Processing Record 16 of Set 1 | punta arenas
Processing Record 17 of Set 1 | kapaa
Processing Record 18 of Set 1 | cidreira
Processing Record 19 of Set 1 | mokhsogollokh
Processing Record 20 of Set 1 | rikitea
Processing Record 21 of Set 1 | thinadhoo
Processing Record 22 of Set 1 | saint george
Processing 

Processing Record 35 of Set 4 | puerto escondido
Processing Record 36 of Set 4 | klaksvik
Processing Record 37 of Set 4 | egvekinot
Processing Record 38 of Set 4 | belyy yar
Processing Record 39 of Set 4 | san vicente
Processing Record 40 of Set 4 | coquimbo
Processing Record 41 of Set 4 | pacific grove
Processing Record 42 of Set 4 | khatanga
Processing Record 43 of Set 4 | kishtwar
Processing Record 44 of Set 4 | tiksi
Processing Record 45 of Set 4 | nome
Processing Record 46 of Set 4 | marcona
City not found. Skipping...
Processing Record 47 of Set 4 | westport
Processing Record 48 of Set 4 | georgetown
Processing Record 49 of Set 4 | bandarbeyla
Processing Record 50 of Set 4 | gafanha da encarnacao
Processing Record 1 of Set 5 | beringovskiy
Processing Record 2 of Set 5 | moranbah
Processing Record 3 of Set 5 | ahuimanu
Processing Record 4 of Set 5 | tuytepa
Processing Record 5 of Set 5 | port hedland
Processing Record 6 of Set 5 | atuona
Processing Record 7 of Set 5 | saint-philip

Processing Record 27 of Set 8 | penzance
Processing Record 28 of Set 8 | ust-ilimsk
Processing Record 29 of Set 8 | grand gaube
Processing Record 30 of Set 8 | livingstone
Processing Record 31 of Set 8 | bintulu
Processing Record 32 of Set 8 | bosaso
Processing Record 33 of Set 8 | karamea
City not found. Skipping...
Processing Record 34 of Set 8 | fontem
Processing Record 35 of Set 8 | kars
Processing Record 36 of Set 8 | koubia
Processing Record 37 of Set 8 | manama
Processing Record 38 of Set 8 | nizhneyansk
City not found. Skipping...
Processing Record 39 of Set 8 | pavino
Processing Record 40 of Set 8 | labe
Processing Record 41 of Set 8 | lagoa
Processing Record 42 of Set 8 | bontang
Processing Record 43 of Set 8 | bobonong
City not found. Skipping...
Processing Record 44 of Set 8 | baykit
Processing Record 45 of Set 8 | kyrylivka
Processing Record 46 of Set 8 | margate
Processing Record 47 of Set 8 | fortuna
Processing Record 48 of Set 8 | port moresby
Processing Record 49 of Se

Processing Record 14 of Set 12 | harrisonville
Processing Record 15 of Set 12 | belomorsk
Processing Record 16 of Set 12 | rungata
City not found. Skipping...
Processing Record 17 of Set 12 | boa vista
Processing Record 18 of Set 12 | hami
Processing Record 19 of Set 12 | horn lake
Processing Record 20 of Set 12 | aquiraz
Processing Record 21 of Set 12 | taltal
Processing Record 22 of Set 12 | semey
Processing Record 23 of Set 12 | inderborskiy
City not found. Skipping...
Processing Record 24 of Set 12 | burnie
Processing Record 25 of Set 12 | verkhnyaya toyma
Processing Record 26 of Set 12 | mingoyo
Processing Record 27 of Set 12 | sao gabriel
Processing Record 28 of Set 12 | palmer
Processing Record 29 of Set 12 | college
Processing Record 30 of Set 12 | yongzhou
Processing Record 31 of Set 12 | deep river
Processing Record 32 of Set 12 | karaul
City not found. Skipping...
Processing Record 33 of Set 12 | ha giang
Processing Record 34 of Set 12 | ambon
Processing Record 35 of Set 12 

In [25]:
len(city_data)

576

# 6.2.7 Create a DataFrame of City Weather Data

### Convert the array of dictionaries to a Pandas DataFrame

In [26]:
# Convert the array of dictionaries to a Pandas DataFrame.
city_data_df = pd.DataFrame(city_data)
city_data_df.head(10)

Unnamed: 0,City,Lat,Lng,Max Temp,Humidity,Cloudiness,Wind Speed,Country,Date
0,Severo-Kurilsk,50.6789,156.125,29.23,82,98,11.16,RU,2022-01-31 05:26:34
1,Bluff,-46.6,168.3333,63.14,84,96,3.38,NZ,2022-01-31 05:26:35
2,Yomou,7.5603,-9.2653,61.77,37,1,1.1,GN,2022-01-31 05:26:35
3,Padang,-0.9492,100.3543,86.05,63,82,4.81,ID,2022-01-31 05:26:36
4,Mwene-Ditu,-7.0,23.45,64.31,97,98,3.49,CD,2022-01-31 05:26:36
5,Camocim,-2.9022,-40.8411,75.76,90,99,7.96,BR,2022-01-31 05:26:37
6,Sao Felix Do Xingu,-6.6447,-51.995,69.15,97,100,0.29,BR,2022-01-31 05:26:37
7,Meadow Lake,34.8014,-106.5436,38.41,53,70,2.1,US,2022-01-31 05:26:37
8,Bambous Virieux,-20.3428,57.7575,86.22,79,75,11.5,MU,2022-01-31 05:26:38
9,Oussouye,12.485,-16.5469,70.77,53,51,7.92,SN,2022-01-31 05:26:39


### Reorder columns in the DataFrame

In [31]:
city_data_df = city_data_df[["City", "Country", "Date", "Lat", "Lng", "Max Temp"," Humidity", "Cloudiness",
                            "Wind Speed"]]
city_data_df.head()

Unnamed: 0,City,Country,Date,Lat,Lng,Max Temp,Humidity,Cloudiness,Wind Speed
0,Severo-Kurilsk,RU,2022-01-31 05:26:34,50.6789,156.125,29.23,,98,11.16
1,Bluff,NZ,2022-01-31 05:26:35,-46.6,168.3333,63.14,,96,3.38
2,Yomou,GN,2022-01-31 05:26:35,7.5603,-9.2653,61.77,,1,1.1
3,Padang,ID,2022-01-31 05:26:36,-0.9492,100.3543,86.05,,82,4.81
4,Mwene-Ditu,CD,2022-01-31 05:26:36,-7.0,23.45,64.31,,98,3.49


### Create an output file to save the DataFrame as a CSV in a new folder for that file

In [33]:
# Create the output file (CSV).
output_data_file = "weather_data/cities.csv"
# Export the City_Data into a CSV.
city_data_df.to_csv(output_data_file, index_label="City_ID")