# Deliverable 1. Retrieve Weather Data.
---
1. Create a folder called `Weather_Database` to save all the files related with this deliverable.

2. Save the `Weather_Database_starter_code.ipynb` starter code to the `Weather_Database` folder and rename it as `Weather_Database.ipynb`.

3. Use the `np.random.uniform` function to generate a new set of 2,000 random latitudes and 2,000 longitudes.

4. Use the `citipy` module to get the nearest city for each latitude and longitude combination.

5. Import your OpenWeatherMap's API key and assemble the API call URL as a string variable. Recall to edit the `config.py` file to add your API key; also, it's critical to avoid publishing your API key on your GitHub repository.

6. Retrieve the following information from the API call:

    * Latitude and longitude

    * Maximum temperature

    * Percent humidity

    * Percent cloudiness

    * Wind speed

    * Weather description (for example, clouds, fog, light rain, clear sky)

7. Add the weather data to a new DataFrame.

8. Export the DataFrame as a CSV file, and save it as `WeatherPy_Database.csv` in the `Weather_Database` folder.
---

In [11]:
# Import initial libraries, depends, times, key
import pandas as pd
import numpy as np
import timeit
from citipy import citipy
import time
from datetime import datetime
import requests
import matplotlib.pyplot as plt

## Use the `np.random.uniform` function to generate a new set of 2,000 random latitudes and 2,000 longitudes.

In [7]:
# Create a set of random latitude and longitude combinations
place_count = 2000 
lats = np.random.uniform(low=-90.000, high=90.000, size=place_count)
lngs = np.random.uniform(low=-180.000, high=180.000, size=place_count)
coordinates = zip(lats, lngs)

## Use the `citipy` module to get the nearest city for each latitude and longitude combination.

In [8]:
# Create a list for holding the cities.
cities = []
# Identify the nearest city for each latitude and longitude combination.
for coordinate in coordinates:
    city = citipy.nearest_city(coordinate[0], coordinate[1]).city_name

    # If the city is unique, then we will add it to the cities list.
    if city not in cities:
        cities.append(city)
# Print the city count to confirm sufficient count.
len(cities)

763

## Import your OpenWeatherMap's API key and assemble the API call URL as a string variable. Recall to edit the `config.py` file to add your API key.

In [10]:
# Import the requests library
import requests

# Import the time library
import time

# Import the datetime module from the datetime library
from datetime import datetime

# Import the OpenWeatherMap's API key
from config import weather_api_key

In [12]:
# Assemble the OpenWeatherMap's API call
url = "http://api.openweathermap.org/data/2.5/weather?units=Imperial&APPID=" + weather_api_key

## Retrieve the following information from the API call.

- The latitude and longitude
- The Max temperature
- The % humidity
- The % cloudiness
- The Wind speed
- The Weather description, i.e., cloudy, fog, light rain, clear sky, etc. 

In [21]:
# Create an empty list to hold weather data for each city
city_data = []

# Print a message to indicate that the data retrieval starts
print("Beginning Data Retrieval     ")
print("-----------------------------")

# Create counters and set them to 1
record_count = 1
set_count = 1

# Loop through all the cities in our list to fetch weather data for each city
for i, city in enumerate(cities):
        
    # Group cities in sets of 50 for logging purposes
    if (i % 50 == 0 and i >= 50):
        set_count += 1
        record_count = 1
        time.sleep(60)

    # Create an endpoint URL for each city
    city_url = url + "&q=" + city.replace(" ","+")
    
    # Log the url, record, and set numbers
    print(f"Processing Record {record_count} of Set {set_count} | {city}")

    # Add 1 to the record count
    record_count += 1

    # Run an API request for each of the cities
    try:
        city_weather = requests.get(city_url).json()
        city_weather = requests.get(city_url).json()
        # Parse out the needed data.
        city_lat = city_weather["coord"]["lat"]
        city_lng = city_weather["coord"]["lon"]
        city_max_temp = city_weather["main"]["temp_max"]
        city_humidity = city_weather["main"]["humidity"]
        city_clouds = city_weather["clouds"]["all"]
        city_wind = city_weather["wind"]["speed"]
        city_country = city_weather["sys"]["country"]
        city_description = city_weather["weather"][0]['main']
        # Convert the date to ISO standard.
        city_date = datetime.utcfromtimestamp(city_weather["dt"]).strftime('%Y-%m-%d %H:%M:%S')
        # Append the city information into city_data list.
        city_data.append({"City": city.title(),
                          "Lat": city_lat,
                          "Lng": city_lng,
                          "Max Temp": city_max_temp,
                          "Humidity": city_humidity,
                          "Cloudiness": city_clouds,
                          "Wind Speed": city_wind,
                          "Country": city_country,
                          "Date": city_date,
                          "City Weather":city_description})     
    
    # If an error is experienced, skip the city
    except:
        print("City not found. Skipping...")
        pass

# Indicate that the data retrieval is complete 
print("-----------------------------")
print("Data Retrieval Complete      ")
print("-----------------------------")

Beginning Data Retrieval     
-----------------------------
Processing Record 1 of Set 1 | attawapiskat
City not found. Skipping...
Processing Record 2 of Set 1 | kabalo
Processing Record 3 of Set 1 | kamenka
Processing Record 4 of Set 1 | tuktoyaktuk
Processing Record 5 of Set 1 | umm lajj
Processing Record 6 of Set 1 | nouadhibou
Processing Record 7 of Set 1 | kindu
Processing Record 8 of Set 1 | rikitea
Processing Record 9 of Set 1 | kapaa
Processing Record 10 of Set 1 | kuche
City not found. Skipping...
Processing Record 11 of Set 1 | nexo
Processing Record 12 of Set 1 | cabo san lucas
Processing Record 13 of Set 1 | atuona
Processing Record 14 of Set 1 | ushuaia
Processing Record 15 of Set 1 | san rafael
Processing Record 16 of Set 1 | gayny
Processing Record 17 of Set 1 | bethel
Processing Record 18 of Set 1 | portobelo
Processing Record 19 of Set 1 | clyde river
Processing Record 20 of Set 1 | mikuni
Processing Record 21 of Set 1 | batagay-alyta
Processing Record 22 of Set 1 | s

In [22]:
# Print the length of the city_data list to verify how many cities you have
len(city_data)

692

## Add the weather data to a new DataFrame.

In [23]:
# Use the city_data list to create a new pandas DataFrame.
city_data_df = pd.DataFrame(city_data)


In [24]:
# Display sample data
city_data_df.head(10)

Unnamed: 0,City,Lat,Lng,Max Temp,Humidity,Cloudiness,Wind Speed,Country,Date,City Weather
0,Kabalo,-6.05,26.9167,70.45,96,100,4.14,CD,2023-02-19 21:37:03,Clouds
1,Kamenka,51.3223,42.7678,28.18,96,100,8.86,RU,2023-02-19 21:37:04,Clouds
2,Tuktoyaktuk,69.4541,-133.0374,-25.6,68,75,5.75,CA,2023-02-19 21:37:04,Clouds
3,Umm Lajj,25.0213,37.2685,71.71,62,81,5.68,SA,2023-02-19 21:37:05,Clouds
4,Nouadhibou,20.931,-17.0347,66.18,72,87,13.8,MR,2023-02-19 21:37:06,Clouds
5,Kindu,-2.95,25.95,73.96,86,100,2.86,CD,2023-02-19 21:37:06,Clouds
6,Rikitea,-23.1203,-134.9692,79.77,74,13,15.9,PF,2023-02-19 21:37:07,Clouds
7,Kapaa,22.0752,-159.319,80.58,85,100,4.0,US,2023-02-19 21:37:08,Rain
8,Nexo,55.0607,15.1306,39.29,83,100,24.47,DK,2023-02-19 21:37:09,Clouds
9,Cabo San Lucas,22.8909,-109.9124,80.62,50,0,14.97,MX,2023-02-19 21:34:37,Clear


In [25]:
# Display the DataFrame's column names using the columns Pandas function
city_data_df.columns

Index(['City', 'Lat', 'Lng', 'Max Temp', 'Humidity', 'Cloudiness',
       'Wind Speed', 'Country', 'Date', 'City Weather'],
      dtype='object')

In [26]:
# Create a list to reorder the column names as follows:
# "City", "Country", "Lat", "Lng", "Max Temp", "Humidity",  "Cloudiness", "Wind Speed",  "Current Description"
new_column_order = ['City', 'Country', 'Lat', 'Lng', 
                    'Max Temp', 'Humidity', 'Cloudiness',
                    'Wind Speed', 'City Weather']

# Recreate the DataFrame by using the new column order
city_data_df = city_data_df[new_column_order]
# Display sample data

In [27]:
city_data_df = city_data_df[new_column_order]
city_data_df

Unnamed: 0,City,Country,Lat,Lng,Max Temp,Humidity,Cloudiness,Wind Speed,City Weather
0,Kabalo,CD,-6.0500,26.9167,70.45,96,100,4.14,Clouds
1,Kamenka,RU,51.3223,42.7678,28.18,96,100,8.86,Clouds
2,Tuktoyaktuk,CA,69.4541,-133.0374,-25.60,68,75,5.75,Clouds
3,Umm Lajj,SA,25.0213,37.2685,71.71,62,81,5.68,Clouds
4,Nouadhibou,MR,20.9310,-17.0347,66.18,72,87,13.80,Clouds
...,...,...,...,...,...,...,...,...,...
687,Tabas,IR,33.5959,56.9244,46.62,42,0,2.30,Clear
688,Olafsvik,IS,64.8945,-23.7142,36.55,90,100,28.72,Clouds
689,San Fernando,PH,15.0286,120.6898,78.53,88,96,0.34,Clouds
690,Carutapera,BR,-1.1950,-46.0200,74.17,96,100,7.65,Clouds


In [29]:
# Display the data types of each column by using the dtypes Pandas function
city_data_df.dtypes

City             object
Country          object
Lat             float64
Lng             float64
Max Temp        float64
Humidity          int64
Cloudiness        int64
Wind Speed      float64
City Weather     object
dtype: object

## Export the DataFrame as a CSV file, and save it as `WeatherPy_Database.csv` in the `Weather_Database` folder.

In [36]:
# Set the output file name
output_data_file = r'C:\Users\ssala\Class\World_Weather_Analysis\Weather_Database/WeatherPy_Database.csv'

# Export the city_data DataFrame into a CSV file
city_data_df.to_csv(output_data_file, index_label="None")