This repository stores a Jupyter Notebook file used for the HDB resale price prediction mini-project in CB0494: Introduction to Data Science and Artificial Intelligence. The .csv file containing the dataset used is also included.
- Exploratory analysis of the dataset to investigate which variables were most predictive of resale price.
- Separating the dataset by the "town" variable, and keeping all other numeric variables.
- Training linear regression models for each dataset.
- Evaluating the performance of the linear regression models with test datasets.
- Utilizing the linear regression models to create a visual that displays if a prospective buyer can afford their desired HDB resale flat.
- Myself: data pre-processing, model training and evaluation, visual creation.
- Balaji: Exploratory analysis.