# Notes
* Linear regression with multiple input features is actually quite simple. Where in a $x$ vs $y$ regression your formula is just $\vec{y}=\beta_1 \vec{x} + \beta_0$, with multiple input features $x_1, x_2, x_2$, your formula is just $\vec{y}=\beta_1\vec{x_1}+\beta_2\vec{x_2}+\beta_3\vec{x_3}+\beta_0$ 
* LASSO loss function is $\Sigma(\mathrm{residuals})^2+\Sigma|\beta|$ 
* The only difference between Ridge and LASSO regression is that the regularization term in LASSO is in terms of absolute value, which has a huge impact on the model. Thus, LASSO overcomes the disadvantage of Ridge regression by not only punishing high values of the coefficients $\beta$ but also by actually setting them to zero if they are not relevant. Therefore, you might end up with fewer features included in the model than you started with, which is a huge advantage.
* When you set ```alpha = 0``` in LASSO, this is equivalent to an ordinary least squares

In [2]:
import math 
import matplotlib.pyplot as plt 
import pandas as pd
import numpy as np
from sklearn.linear_model import Lasso, LinearRegression
from sklearn.datasets import load_breast_cancer

In [10]:
cancer = load_breast_cancer()

X = cancer.
Y = cancer.target

cancer_df = pd.DataFrame(cancer.data, columns=cancer.feature_names)
cancer_df['outcome'] = Y

     mean radius  mean texture  mean perimeter  mean area  mean smoothness  \
0          17.99         10.38          122.80     1001.0          0.11840   
1          20.57         17.77          132.90     1326.0          0.08474   
2          19.69         21.25          130.00     1203.0          0.10960   
3          11.42         20.38           77.58      386.1          0.14250   
4          20.29         14.34          135.10     1297.0          0.10030   
..           ...           ...             ...        ...              ...   
564        21.56         22.39          142.00     1479.0          0.11100   
565        20.13         28.25          131.20     1261.0          0.09780   
566        16.60         28.08          108.30      858.1          0.08455   
567        20.60         29.33          140.10     1265.0          0.11780   
568         7.76         24.54           47.92      181.0          0.05263   

     mean compactness  mean concavity  mean concave points  mea