A small program to implement multiple linear regression with Sklearn library and on my own.
For this example I used a data set on the cost of houses for 1500 rows


- One-hot encoding for categorial features
- Divide the sample randomly into train and test data (test - 25% of the data)
- Fill nan - in numerical features from the sample train
- Standard scaling by Train sample
Train mse:
19306.85068414516
Test mse:
3.3287891454200028e+16
Train MSE GD_Regressor: 197151.96183913611
Test MSE GD_Regressor: 198865.79451044145



