![Redfin](Redfin.png)

# Predicting Housing Market Trends Using Data Science

**Potential Insights:** This dataset provides insights into housing market trends across different regions. I chose this dataset because housing market data reflects economic patterns, consumer behavior, and financial stability. By analyzing price fluctuations, sales trends, and inventory changes, I aim to identify key indicators that predict market shifts. The structured approach from Kaggle competitions will be applied to extract meaningful patterns, focusing on how market dynamics evolve over time.     

Additionally, the dataset's inclusion of multiple region types allows for comparisons between national trends and city-specific behaviors, providing an opportunity to explore how different real estate markets respond to economic factors. By leveraging machine learning techniques learned from the competitions, I will analyze potential leading indicators of price changes and market cycles.

### Project Scope
**Objectives:** Identify key trends and patterns in the housing market, predict market fluctuations, and provide actionable insights into real estate trend      

**Deliverables:** Charts, graphs, statistical reports, predictive models, and a final research paper detailing insights and methodologies.      

Milestones:      
- Data collection and cleaning (Week 1)    
- Exploratory Data Analysis (Weeks 2-3)    
- Model selection and training (Weeks 4-6)     
- Model validation and optimization (Weeks 7-8)    
- Final report and presentation (Weeks 9-10)     

Tasks:       
- Data acquisition and preprocessing      
- Exploratory data analysis (EDA)      
- Feature engineering and selection      
- Model training and validation      
- Performance evaluation and optimization     
- Presentation of findings      

**Resources:** Python, Pandas, NumPy, Scikit-Learn, Matplotlib, Seaborn, Jupyter Notebook      

### Research Plan
**Techniques and Methods:** Time-series analysis, regression models, machine learning techniques, and cross-validation methods.
Application: Apply time-series forecasting models to predict housing price trends, use feature engineering techniques to identify key drivers of price fluctuations, and validate results using robust evaluation metrics.      

**Hypothesis**
- **Hypothesis Statement:** Housing market trends can be predicted by analyzing historical price fluctuations, inventory changes, and sales patterns. By applying machine learning techniques, we can identify early indicators of market shifts, allowing for better-informed real estate investment decisions.

# Imports

In [1]:
import wrangle as w

In [2]:
redfin_df = w.check_file_exists_gbq("data.csv", "service-account-key.json", "iu-thesis-project.Redfin_Monthly_Housing_Market_Data.Redfin")

⚠️ CSV file 'data.csv' not found. Fetching clean data from BigQuery...
Downloading: 100%|[32m█████████████████████████████████████████████████████████████[0m|[0m
✅ Clean dataset fetched from BigQuery and saved locally as 'data.csv'.


In [3]:
redfin_df.dtypes

region                                   object
month_of_period_end         datetime64[ns, UTC]
median_sale_price                         Int64
median_sale_price_mom                   float64
median_sale_price_yoy                   float64
homes_sold                                Int64
homes_sold_mom                          float64
homes_sold_yoy                          float64
new_listings                              Int64
new_listings_mom                        float64
new_listings_yoy                        float64
inventory                                 Int64
inventory_mom                           float64
inventory_yoy                           float64
days_on_market                            Int64
days_on_market_mom                        Int64
days_on_market_yoy                        Int64
average_sale_to_list                    float64
average_sale_to_list_mom                float64
average_sale_to_list_yoy                float64
dtype: object

In [None]:
redfin_df.summary()