Tale-O-Regression

Understanding regression analysis.

Introduction

Regression models are used to predict target variables on a continuous scale, which makes them attractive for addressing many questions in science as well as applications in industry, such as understanding relationships between variables, valuating trends, or making forecasts.

Our Approach

• Exploring and visualizing datasets
• Looking at different approaches to implement linear regression models
• Training regression models that are robust to outliers
• Evaluating regression models and diagnosing common problems
• Fitting regression models to nonlinear data

Dataset Used

We will use the Housing Dataset, which contains information about houses in the suburbs of Boston collected by D. Harrison and D.L. Rubinfeld in 1978. The Housing Dataset has been made freely available and can be downloaded from the UCI machine learning repository at https://archive.ics.uci.edu/ml/machine-learning-databases/housing/housing.data.

Features

• CRIM: This is the per capita crime rate by town
• ZN: This is the proportion of residential land zoned for lots larger than 25,000 sq.ft.
• INDUS: This is the proportion of non-retail business acres per town
• CHAS: This is the Charles River dummy variable (this is equal to 1 if tract bounds river; 0 otherwise)
• NOX: This is the nitric oxides concentration (parts per 10 million)
• RM: This is the average number of rooms per dwelling
• AGE: This is the proportion of owner-occupied units built prior to 1940
• DIS: This is the weighted distances to five Boston employment centers
• RAD: This is the index of accessibility to radial highways
• TAX: This is the full-value property-tax rate per $10,000
• PTRATIO: This is the pupil-teacher ratio by town
• B: This is calculated as 1000(Bk - 0.63)^2, where Bk is the proportion of people of African American descent by town
• LSTAT: This is the percentage lower status of the population
• MEDV: This is the median value of owner-occupied homes in $1000s

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
figures		figures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data.py		data.py
linear_model.py		linear_model.py
reg_analysis.py		reg_analysis.py
regression-sklearn.ipynb		regression-sklearn.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tale-O-Regression

Introduction

Our Approach

Dataset Used

Features

About

Releases

Packages

Languages

License

prakharchoudhary/Tale-O-Regression

Folders and files

Latest commit

History

Repository files navigation

Tale-O-Regression

Introduction

Our Approach

Dataset Used

Features

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages