<a href="https://colab.research.google.com/github/Iamcuriousity/Alokita-Jha/blob/main/Assignment_7_1_Alokitaipynb.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

Certainly! Here's a heading for the analysis:

## Exploratory Data Analysis: Surface Water Quantification Dataset

### Dataset Overview
The surface water quantification dataset offers insights into the historical and near-real-time surface water extent of various inland surface water bodies. Utilizing satellite-based monitoring and remote sensing technology, this dataset enables continuous observation, detection of long-term changes, and identification of flood and water shortage areas. The comprehensive information provided supports informed decision-making in water resource management, assisting policymakers, researchers, and environmentalists in understanding the dynamics of surface water. The dataset promotes proactive measures for sustainable water use and ecosystem preservation.

### Dataset Attributes
1. **Asset ID:** The unique identifier for the water body, also known as an asset.
2. **Recorded At:** The date for which the data is reported, ranging from 2018-01-01 to 2023-08-01.
3. **Area (in km2):** The surface area of the water body.
4. **Perimeter (in km):** The surface perimeter of the water body.
5. **Name:** Name of the water body.

In [None]:
import requests


In [None]:
import pandas as pd
import requests
import urllib.parse
import plotly.express as px
from IPython.display import display

# Step 1: Retrieve API token
API_KEY = "54ef4248-dda2-52a3-87f1-7dbca0b71100"

# Step 2: Define Asset IDs and Names
assets_info = {
    "b63a9300-c5c4-4f5d-b0f3-0c1edad76bde": "Tungabhadra Lake",
    "811b9803-dbb2-474e-964c-11ed7ad9d36a": "Sriram Sagar Lake",
    "31c6f1d9-3adc-4eab-8e33-e2d00cc780ff": "Nath Sagar Lake",
    "5623361f-dd29-44ad-aeef-2616172fdeca": "Govind Vallabh Pant Sagar",
    "421cfb73-5ff0-40e4-bf89-380a3604b1ea": "Red Hills Reservoir",
    "4d9c30fd-8a91-4421-8089-c525b2c24133": "Manchar Lake",
    "dbe69056-6a63-4b64-b848-bb683afff678": "Panta de Sau",
    "246e7438-8473-44b6-9581-5ae8d2ec60a5": "Lake Mead",
    "ae096c2b-8231-4d79-9634-0f97ef4c93cc": "Lake Oroville",
    "7615d8f5-fedb-4cd7-8e0b-28418cb7f04f": "Folsom Lake",
    "5283bf5b-5148-43e3-ba06-607a07edac15": "Theewaterskloof Dam",
}

# Step 2: Fetch Data and Conduct Initial Exploratory Data Analysis
for asset_id, asset_name in assets_info.items():
    # API endpoint or url
    API = f"https://gateway.blueskyhq.io/api/water-quantification/{asset_id}"

    # Define date range
    START_DATE = "2018-01-01T00:00:00.000Z"
    END_DATE = "2022-12-31T23:59:59.999Z"
    TIME_BUCKET = "1d"

    # Request parameters
    PARAMS = {
        "api-key": API_KEY,
        "start_date": START_DATE,
        "end_date": END_DATE,
        "time_bucket": TIME_BUCKET,
    }
    HEADERS = {
        "Content-Type": "application/json",
    }

    # API request
    response = requests.get(API, headers=HEADERS, params=urllib.parse.urlencode(PARAMS))

    if response.status_code == 200:
        # Convert response json to DataFrame
        df = pd.DataFrame(response.json()["data"])
        # Datetime string to datetime object
        df["datetime"] = pd.to_datetime(df["datetime"])

        # Initial Exploratory Data Analysis
        print(f"Initial Exploratory Data Analysis for {asset_name}")
        display(df.head())  # Display first few rows for exploration

        # Handle missing dates or other data cleaning steps as needed
        # For example, forward fill missing values
        df['area'].fillna(method='ffill', inplace=True)

        # Plot interactive time series graph with Plotly Express
        fig = px.line(df, x="datetime", y="area", title=f"Time Series Graph for {asset_name}")
        fig.update_xaxes(title_text="Date")
        fig.update_yaxes(title_text="Surface Area (km²)")
        fig.show()

    else:
        print(response.text)




Initial Exploratory Data Analysis for Tungabhadra Lake


Unnamed: 0,datetime,area,perimeter
0,2018-01-02 00:00:00+00:00,198.69,3.32
1,2018-01-07 00:00:00+00:00,190.52,3.36
2,2018-01-17 00:00:00+00:00,163.02,3.16
3,2018-01-22 00:00:00+00:00,150.46,2.99
4,2018-02-26 00:00:00+00:00,81.44,2.12


Initial Exploratory Data Analysis for Sriram Sagar Lake


Unnamed: 0,datetime,area,perimeter
0,2018-01-17 00:00:00+00:00,221.89,7.24
1,2018-01-22 00:00:00+00:00,200.27,5.89
2,2018-02-26 00:00:00+00:00,145.33,4.02
3,2018-09-29 00:00:00+00:00,276.82,4.42
4,2018-11-28 00:00:00+00:00,177.79,4.26


Initial Exploratory Data Analysis for Nath Sagar Lake


Unnamed: 0,datetime,area,perimeter
0,2018-01-20 00:00:00+00:00,308.74,3.03
1,2018-01-30 00:00:00+00:00,304.52,3.04
2,2018-02-09 00:00:00+00:00,298.41,3.07
3,2018-02-19 00:00:00+00:00,294.46,2.98
4,2018-04-10 00:00:00+00:00,259.61,2.92


Initial Exploratory Data Analysis for Govind Vallabh Pant Sagar


Unnamed: 0,datetime,area,perimeter
0,2018-01-11 00:00:00+00:00,385.38,538.87
1,2018-01-16 00:00:00+00:00,335.41,396.91
2,2018-01-31 00:00:00+00:00,358.29,526.3
3,2018-02-15 00:00:00+00:00,310.77,370.01
4,2018-03-02 00:00:00+00:00,343.8,588.15


Initial Exploratory Data Analysis for Red Hills Reservoir


Unnamed: 0,datetime,area,perimeter
0,2018-02-25 00:00:00+00:00,13.51,0.29
1,2018-03-02 00:00:00+00:00,13.61,0.3
2,2018-09-13 00:00:00+00:00,9.29,0.24
3,2019-02-05 00:00:00+00:00,9.55,0.25
4,2019-02-15 00:00:00+00:00,8.93,0.22


Initial Exploratory Data Analysis for Manchar Lake


Unnamed: 0,datetime,area,perimeter
0,2020-01-09 00:00:00+00:00,237.06,138.05
1,2020-01-17 00:00:00+00:00,236.3,140.26
2,2020-01-24 00:00:00+00:00,237.6,132.19
3,2020-01-27 00:00:00+00:00,233.59,130.17
4,2020-01-29 00:00:00+00:00,236.77,135.85


Initial Exploratory Data Analysis for Panta de Sau


Unnamed: 0,datetime,area,perimeter
0,2019-04-13 00:00:00+00:00,3.67,32.45
1,2019-08-21 00:00:00+00:00,3.64,31.91
2,2019-08-26 00:00:00+00:00,3.64,31.99
3,2020-08-20 00:00:00+00:00,3.84,32.88
4,2020-08-25 00:00:00+00:00,3.85,32.81


Initial Exploratory Data Analysis for Lake Mead


Unnamed: 0,datetime,area,perimeter
0,2018-01-21 00:00:00+00:00,325.11,603.08
1,2018-02-05 00:00:00+00:00,336.64,611.09
2,2018-04-21 00:00:00+00:00,331.63,609.56
3,2018-06-25 00:00:00+00:00,316.35,642.38
4,2018-06-30 00:00:00+00:00,299.81,594.1


Initial Exploratory Data Analysis for Lake Oroville


Unnamed: 0,datetime,area,perimeter
0,2018-01-20 00:00:00+00:00,26.79,3.25
1,2018-03-11 00:00:00+00:00,31.92,5.22
2,2018-03-26 00:00:00+00:00,38.72,3.59
3,2018-04-20 00:00:00+00:00,42.12,3.87
4,2018-04-25 00:00:00+00:00,43.53,4.65


Initial Exploratory Data Analysis for Folsom Lake


Unnamed: 0,datetime,area,perimeter
0,2018-01-20 00:00:00+00:00,30.34,2.08
1,2018-02-19 00:00:00+00:00,33.37,5.25
2,2018-03-11 00:00:00+00:00,30.33,1.72
3,2018-03-26 00:00:00+00:00,39.96,1.78
4,2018-04-20 00:00:00+00:00,37.74,1.69


{"message":"Unauthorized"}


Analysis

Analysis: