This repository contains code and data for analyzing housing data. The dataset includes information on various features such as ocean proximity, median house value, households, population, total bedrooms, total rooms, housing median age, latitude, and longitude. The purpose of this project is to explore the relationships and patterns within the dataset to gain insights.
The dataset used for this analysis contains the following columns:
- longitude: Represents the longitude coordinate of a specific area.
- latitude: Represents the latitude coordinate of a specific area.
- housing_median_age: Refers to the median age of houses in a specific area.
- total_rooms: Represents the total number of rooms in a specific area.
- total_bedrooms: Indicates the total number of bedrooms in a specific area.
- population: Represents the total population in a specific area.
- households: Refers to the number of households in a specific area.
- median_house_value: Represents the median value of houses in a specific area.
- ocean_proximity: Indicates the proximity of the housing to the ocean.
To gain insights from the dataset, the following questions will be explored:
- What is the median house value in different ocean proximities?
- How does the number of households vary based on the ocean proximity?
- What is the population distribution across different ocean proximities?
- What is the average number of total bedrooms in each ocean proximity category?
- How does the number of total rooms differ based on the ocean proximity?
- What is the median age of housing in different ocean proximities?
- How does latitude affect the median house value?
- How does longitude impact the number of households?
- What is the relationship between median house value and population?
- How does the total number of bedrooms change with latitude and longitude?
- What is the average number of total rooms in each housing median age group?
- How does the median house value vary with the total number of rooms?
- What is the distribution of median house value across different latitude ranges?
- How does the population change with the housing median age?
- What is the impact of latitude and longitude on the total number of bedrooms?
The results of the analysis, including code and insights, can be found in the houding_data_analysis_queries.sql in this repository. Each question explored will have a code and insight under it.
Contributions to this project are welcome. If you have any suggestions, ideas, or improvements, please open an issue or submit a pull request.
- The dataset used for this analysis was sourced from Here.