# 10.16 Intro to Data Science: Time Series and Simple Linear Regression 

**This file includes the Self Check snippets which continue from the section body.**

### Time Series
### Simple Linear Regression
### Linear Relationships

In [None]:
%matplotlib inline
c = lambda f: 5 / 9 * (f - 32)

In [None]:
temps = [(f, c(f)) for f in range(0, 101, 10)]

In [None]:
import pandas as pd

In [None]:
temps_df = pd.DataFrame(temps, columns=['Fahrenheit', 'Celsius'])

In [None]:
axes = temps_df.plot(x='Fahrenheit', y='Celsius', style='.-')

y_label = axes.set_ylabel('Celsius')

In [None]:
# Extra cell added to keep subsequent snippet numbers the same as the chapter.
# Had to merge the two prior cells for use in the notebook.

### Components of the Simple Linear Regression Equation 
### SciPy’s `stats` Module
### Pandas
### Seaborn Visualization
### Getting Weather Data from NOAA


### Loading the Average High Temperatures into a `DataFrame` 

In [None]:
nyc = pd.read_csv('ave_hi_nyc_jan_1895-2018.csv')

In [None]:
nyc.head()

In [None]:
nyc.tail()

### Cleaning the Data

In [None]:
nyc.columns = ['Date', 'Temperature', 'Anomaly']

In [None]:
nyc.head(3)

In [None]:
nyc.Date.dtype

In [None]:
nyc.Date = nyc.Date.floordiv(100)

In [None]:
nyc.head(3)

### Calculating Basic Descriptive Statistics for the Dataset

In [None]:
pd.set_option('precision', 2)

In [None]:
nyc.Temperature.describe()

### Forecasting Future January Average High Temperatures

In [None]:
from scipy import stats

In [None]:
linear_regression = stats.linregress(x=nyc.Date,
                                     y=nyc.Temperature)

In [None]:
linear_regression.slope

In [None]:
linear_regression.intercept

In [None]:
linear_regression.slope * 2019 + linear_regression.intercept

In [None]:
linear_regression.slope * 1850 + linear_regression.intercept

### Plotting the Average High Temperatures and a Regression Line 

In [None]:
import seaborn as sns

In [None]:
sns.set_style('whitegrid')

In [None]:
axes = sns.regplot(x=nyc.Date, y=nyc.Temperature)

axes.set_ylim(10, 70)

In [None]:
# Extra cell added to keep subsequent snippet numbers the same as the chapter.
# Had to merge the two prior cells for use in the notebook.

### Getting Time Series Datasets

![Self Check Exercises check mark image](files/art/check.png)
# 10.16 Self Check

**1. _(Fill-In)_**  Time series `________` looks at existing time series data for patterns, helping data analysts understand the data. Time series `________` uses data from the past to predict the future. 

**Answer:** analysis, forecasting.

**2. _(True/False)_** In the formula, `c` `=` `5` `/` `9` `*` `(f` `-` `32)`, `f` (the Fahrenheit temperature) is the independent variable and `c` (the Celsius temperature) is the dependent variable.

**Answer:** True. 

**3. _(IPython Session)_** Based on the slope and intercept values calculated in this section’s interactive session, in what year might the average January temperature in New York City reach 40 degrees Fahrenheit.

**Answer:** 

In [None]:
year = 2019

In [None]:
slope = linear_regression.slope

In [None]:
intercept = linear_regression.intercept

In [None]:
temperature = slope * year + intercept

In [None]:
while temperature < 40.0:
    year += 1
    temperature = slope * year + intercept

In [None]:
year

In [None]:
##########################################################################
# (C) Copyright 2019 by Deitel & Associates, Inc. and                    #
# Pearson Education, Inc. All Rights Reserved.                           #
#                                                                        #
# DISCLAIMER: The authors and publisher of this book have used their     #
# best efforts in preparing the book. These efforts include the          #
# development, research, and testing of the theories and programs        #
# to determine their effectiveness. The authors and publisher make       #
# no warranty of any kind, expressed or implied, with regard to these    #
# programs or to the documentation contained in these books. The authors #
# and publisher shall not be liable in any event for incidental or       #
# consequential damages in connection with, or arising out of, the       #
# furnishing, performance, or use of these programs.                     #
##########################################################################
