## Sea Level Predictor

You will anaylize a dataset of the global average sea level change since 1880. You will use the data to predict the sea level change through year 2050.

Use the data to complete the following tasks:

- Use Pandas to import the data from epa-sea-level.csv.
- Use matplotlib to create a scatter plot using the "Year" column as the x-axis and the "CSIRO Adjusted Sea Level" column as the y-axix.

- Use the linregress function from scipi.stats to get the slope and y-intercept of the line of best fit. 
- Plot the line of best fit over the top of the scatter plot. Make the line go through the year 2050 to predict the sea level rise in 2050.

- Plot a new line of best fit just using the data from year 2000 through the most recent year in the dataset. Make the line also go through the year 2050 to predict the sea level rise in 2050 if the rate of rise continues as it has since the year 2000.

- The x label should be "Year", the y label should be "Sea Level (inches)", and the title should be "Rise in Sea Level".

In [None]:
import pandas as pd
import matplotlib.pyplot as plt
import numpy as np
from scipy.stats import linregress

def draw_plot():
    # Read data from file
    df = pd.read_csv('epa-sea-level.csv')
    df.Year = df.Year.astype(float)
    # Create scatter plot
    fig = plt.figure(figsize=(8, 6))
    ax = fig.add_subplot(111)

    ax.scatter(x='Year', y='CSIRO Adjusted Sea Level', data=df)

    # Create first line of best fit
    x = df['Year']
    y = df['CSIRO Adjusted Sea Level']
    slope, intercept, r_value, p_value, std_err = linregress(x, y)
    years_extended = np.arange(1880, 2050, 1)
    line = [slope * xi + intercept for xi in years_extended]

    plt.plot(years_extended, line, c='r')
    plt.xticks(np.arange(1850.0, 2100.0, 25.0))
    plt.yticks(range(0,16, 2))

    # Create second line of best fit
    df2 = df[df.Year >= 2000]
    x = df2['Year']
    y = df2['CSIRO Adjusted Sea Level']
    slope, intercept, r_value, p_value, std_err = linregress(x, y)
    years_extended = np.arange(2000, 2050, 1)
    line = [slope * xi + intercept for xi in years_extended]

    plt.plot(years_extended, line, c='r')

    # Add labels and title
    ax.set_title('Rise in Sea Level')
    ax.set_xlabel('Year')
    ax.set_ylabel('Sea Level (inches)')
    
    # Save plot and return data for testing
    plt.savefig('sea_level_plot.png')
    return plt.gca()