### First we will create a syntetic data for the business problem

**To simulate this scenario, we can generate a dataset with the following columns:**

- Date: Date for the time series data (monthly data from January 2010 to December 2023).
- City: Different city locations in Ontario where the retirement houses are located.
- Property_Price: Average property price in that city.
- Occupancy_Rate: Average occupancy rate of retirement houses in each city (percentage).
- Inflation_Rate: Monthly inflation rate (impact on the property market).
- Population_Age_65+: Population above 65 years old in the city (potential customers).

In [11]:
import pandas as pd
import numpy as np

# Parameters for the dataset
np.random.seed(42)
dates = pd.date_range(start='2010-01-01', end='2023-12-01', freq='MS')
cities = ['Toronto', 'Ottawa', 'Mississauga', 'Brampton', 'Hamilton', 'London', 'Markham', 'Vaughan', 'Kitchener', 'Windsor']

# Mean and standard deviation for property prices (600k to 1.2M)
mean_price = 900000
std_dev_price = 100000

# Create an empty dataframe
data = []

# Populate the dataset
for city in cities:
    for date in dates:
        property_price = round(np.random.normal(mean_price, std_dev_price))  # Normally distributed property prices
        # Ensure that prices stay within bounds of 600K to 1.2M
        property_price = max(600000, min(1200000, property_price))
        occupancy_rate = np.random.uniform(70, 95)  # Simulated occupancy rate percentage
        inflation_rate = np.random.uniform(1, 4)  # Inflation rate
        population_age_65 = round(np.random.normal(50000, 10000))  # Population aged 65+
        
        data.append([date, city, property_price, occupancy_rate, inflation_rate, population_age_65])

# Create the DataFrame
df = pd.DataFrame(data, columns=['Date', 'City', 'Property_Price', 'Occupancy_Rate', 'Inflation_Rate', 'Population_Age_65+'])

# Save the dataset
df.to_csv('../Data/ontario_retirement_house_investment_data.csv', index=False)

print("Dataset created successfully.")


Dataset created successfully.
