This repository contains the results of my first Exploratory Data Analysis (EDA) project, while attending the neuefische Data Science Bootcamp.
For this task we were given a data set containing records of house sales in King County, near Seattle. It included house prices, date it was sold, size of lot/living space, average lot/living space nearby, latitude and longitude, etc.
As described in assignment.md:
-
Create a new repo using this template.
-
Through EDA/statistical analysis above please come up with AT LEAST 3 insights regarding the overall data. One should be geographical.
-
In addition also come up with AT LEAST 3 recommendations for your stakeholder.
Modified requirements.txt file from ds-visualization repository. Essentially removed SQL packages.
To install the environment, ran the following commands:
pyenv local 3.9.8
python -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt
A detailed description of my analysis is found in EDA.ipynb, and the slides for my presentation to the stakeholder is found in EDA_project_neuefische.pdf.