### The simple linear regression model

To understand why these numbers are so important, we first have to understand the general form of linear regression, or specifically simple linear regression.  We have an example of simple linear regression, whenever we have a model in the following form.

$$y = mx + b$$

These `coef_` and `intercept_` numbers show that our model is of that form.  Specifically, we can plug these numbers in like so.

$$tshirt\_sales = .38*ad\_spend + 153.26$$.  

Lining these numbers up with our simple linear regression formula to our example we see that: 

* $y$ corresponds to `tshirt_sales`
* $x$ corresponds ad_spend, and 
* $m$ is .38
* $b$ is 153.26

Now let's better understand the $y$, $m$ and $x$ components of simple linear regression, as these are the most important parts.  We'll discuss the $b$ component later.

### Dependent and Independent Variables

The way to understand $y = mx$ is to think of $x$ as the input and $y$ as the output.  So in our T-shirt example, we input advertising dollars spent, and the output is the T-shirt sales.

In linear regression, $y$ is called our **dependent variable** as the output changes *depending* on a the input.  Here our dependent variable is T-shirt sales, as it *depends* on different spending of advertising.  

In linear regression $x$ is called the **independent variable**, as it does not depend on anything.  We can plug in any value for $x$ to get an output of $y$.  In our example, advertising spending is our independent variable.  

Notice that this also affects every other prediction.  Every previous prediction our model made also increase by 100.  So when we spend 1000 on advertising, we now predict sales of 480 instead of 380.  And when we spend 1100, we predict sales of 518 instead of 418.  

Now let's plot the models $tshirt\_sales =.38∗ad\_spend$ and $tshirt\_sales =.38∗ad\_spend + 100 $ side by side.

In [1]:
third_hyp = Hypothesis(.38, 100, inputs)
import plotly.plotly as py
py.plot([data_trace, first_hyp.trace(mode = 'lines'), third_hyp.trace(mode = 'lines')],  layout = layout, auto_open = True)

NameError: name 'Hypothesis' is not defined

So notice that this matches what we said above.  We said that including the value of $b$ increases our predicted output of T-shirts by the same amount, that value of $b$, 100.  And that's what we see.

This is different from changing our slope $m$, which changes the steepness of the line.  

Try changing the slopes and intercepts in the code above, to see how the line changes, or feel free to add in another line with different slopes and intercepts.

### Summary

In this lesson, we learned about a simple linear regression model.  A simple linear model has one input and one output.  The input is called the independent variable, $x$.  In our example above, customers is the dependent variable and temperature is the independent variable.  This makes sense as the number of customers predicted by our model *depends* on the temperature.  The temperature does not depend on anything in our model, so it's independent.

We also discussed our coefficient, which is the number we multiply our independent variable by.  It is represented by $m$ in the fromula $y = mx $.  We can interpret our coefficient as the impact that our independent variable $x$ has on our dependent variable $y$.  So in the model $ tshirt\_sales = .38*ad\_spending$, this means our model predicts that a one dollar increase in spending increases the number of sales by .38.  The value of $m$ also determines the slope of our line -- the further $m$ is from zero, the steeper the slope.

Finally we saw another component of our line, the intercept.  The intercept is the predicted output when our independent variable is zero.  So in our model of $ tshirt\_sales = .38*ad\_spending + 153.26 $ we say that even when the ad spending is zero, we expect sales of 153.  We also noticed that the intercept increases our predicted output by that value (here 153), for every input value.  So where we previously to predicted an output of 380 with spending of 1000 dollars, the updated formula predicts 380 + 153 = 533.  And where we previously to predicted sales of 418 with spending of 1100, the updated formula predicts 418 + 153 = 571.
