# Linear Regression
You should build a machine learning pipeline using a linear regression model. In particular, you should do the following:
- Load the `housing` dataset using [Pandas](https://pandas.pydata.org/docs/reference/api/pandas.read_csv.html). You can find this dataset in the datasets folder.
- Split the dataset into training and test sets using [Scikit-Learn](https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html). 
- Train and test a linear regression model using [Scikit-Learn](https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LinearRegression.html).
- Check the documentation to identify the most important hyperparameters, attributes, and methods of the model. Use them in practice.

Importing the Libraries


In [None]:
import pandas as pd
import sklearn.model_selection
import sklearn.linear_model
import sklearn.metrics

Loading Data Sets


In [None]:
df = pd.read_csv('housing.csv')
df.set_index('id')
df.head()

Unnamed: 0,id,price,area,bedrooms,bathrooms,stories,stories.1,guestroom,basement,hotwaterheating,airconditioning,parking,prefarea,furnishingstatus
0,0,13300000,7420,4,2,3,1,0,0,0,1,2,1,1.0
1,1,12250000,8960,4,4,4,1,0,0,0,1,3,0,1.0
2,2,12250000,9960,3,2,2,1,0,1,0,0,2,1,0.5
3,3,12215000,7500,4,2,2,1,0,1,0,1,3,1,1.0
4,4,11410000,7420,4,1,2,1,1,1,0,1,2,0,1.0


Splitting the Dataset into Training and Test sets


In [None]:
x = df.drop(['id'], axis=1)
y= df['price']
x_train, x_test, y_train, y_test = sklearn.model_selection.train_test_split(x,y)

print("df:", df.shape)
print("x_train:", x_train.shape)
print("x_test:", x_test.shape)
print("y_train:", y_train.shape)
print("y_test:", y_test.shape)

df: (545, 14)
x_train: (408, 13)
x_test: (137, 13)
y_train: (408,)
y_test: (137,)


Training the Model

In [None]:
model = sklearn.linear_model.LinearRegression()
model.fit(x_train, y_train)

Testing the Model

In [None]:
y_predicted = model.predict(x_test)
mse = sklearn.metrics.mean_squared_error(y_test,y_predicted)
mse

1.2108243240166583e-19