Skip to content

House Hocket Company data analysis and hypothesis validation project

License

Notifications You must be signed in to change notification settings

renankalfa/data-analysis-house-rocket

Repository files navigation

House Rocket - Data Analysis

House Hocket (fictional) Company data analysis and hypothesis validation.

house rocket wallpaper

1. Business Problem 🤵

House Rocket is a digital platform in which its business model is based on the purchase and sale of properties. As a data scientist, my mission is to validate business hypotheses and recommend the purchase of properties based on criteria, as well as set the sale price and profit.

This dataset contains properties prices for King County, one of 39 counties in the US state of Washington.

king country

2. Business Assumptions 👀

  • The data analyzed was from May 2014 to May 2015;
  • The seasons of the year, for simplicity, were defined as:
    • "Summer" starts in March and ends in August;
    • "Winter" starts in September and ends in February.

The criterion to recommend the properties to be acquired was based on the 'condition' attribute and the price:

  • The "condition" attribute must be greater than or equal to 3. This is equivalent to the "'condition_type" attribute being regular or good;
  • The property price must be less than the region's median price (zipcode).

3. Business Result

dash results

Result of the deal of 10579 properties:

  • Total cost: US$ 4.094.212.008,00;
  • Total sold: US$ 4.851.233.774,00;
  • Total profit: US$ 757.021.766,00.

4. Validation of Business Hypotheses

H1 - Properties with 3 bathrooms have a 15% MoM average growth.

newplot (11)

False! Average month-over-month growth for 3 bathroom properties is 23.28%.

For comparison purposes, follow the chart with all the properties:

MoM all

H2 - Properties without a basement have a total area (of the lot) 40% larger, on average, than properties with a basement.

average batch per basement

False! Properties without a basement have a total area (of the lot) 22,56% larger, on average, than properties with a basement.

H3 - Properties with water view are 20% more expensive on average.

average price per water view

False! Properties with water view are 212,64% more expensive on average.

H4 - Properties with a construction date less than 1955 are 20% cheaper on average.

average price per year built

False! Properties with a construction date less than 1955 are 0.78% cheaper on average, an irrelevant difference that can be seen in the graph.

5. Conclusion 📍

I consider that most of the objectives have been completed that have already determined more elaborate properties to be purchased, in addition to stipulating a sale value based on seasonality. As well as the validation of hypotheses.

I haven't completed all the goals, like deploying an online dashboard, to start a machine learning project right now, and I believe this will be more beneficial for me.

6. Next Steps 👣

  • Create and verify more business hypotheses;
  • Generate top 5 insights;
  • Analyze correlation in depth to gain insights;
  • Deploy a online dashboard to CEO.

7. Tools Used 🛠️

  • Python 3.8.12;
  • Jupyter Notebook;
  • Git/GitHub/GitHub Desktop;
  • Adobe Photoshop;
  • Notion;
  • Vegas Pro 15.

Back to top

About

House Hocket Company data analysis and hypothesis validation project

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published