# HOPUS

HOPUS (**HO**using **P**ricing **U**tilitie**S**) contains a variety of routines used to predict real estate prices.

This notebook highlights what HOPUS can do, namely
- clean the raw data,
- perform exploratory analysis of the data,
- train a variety of models for the prediction of real estate prices, and
- evaluate the performance of these models.

## Technical preliminaries

In [1]:
# Standard imports
import pandas as pd

In [2]:
# We clone the HOPUS repository to have access to all its data and routines
!git clone https://github.com/aremondtiedrez/hopus.git
%cd hopus

Cloning into 'hopus'...
remote: Enumerating objects: 194, done.[K
remote: Counting objects: 100% (194/194), done.[K
remote: Compressing objects: 100% (139/139), done.[K
remote: Total 194 (delta 102), reused 126 (delta 48), pack-reused 0 (from 0)[K
Receiving objects: 100% (194/194), 733.23 KiB | 10.63 MiB/s, done.
Resolving deltas: 100% (102/102), done.
/content/hopus


In [3]:
# Import requisite modules from HOPUS
import preprocessing

## Data cleaning

In [4]:
hpi = preprocessing.home_price_index.load()
preprocessing.home_price_index.preprocess(hpi)

listings_data = preprocessing.property_listings.load()
listings_data = preprocessing.property_listings.preprocess(listings_data, hpi)

listings_data

Unnamed: 0_level_0,identification,identification,identification,identification,identification,identification,unused,identification,identification,unused,...,predictionFeatures,predictionFeatures,predictionFeatures,unused,predictionFeatures,unused,unused,unused,unused,unused
Unnamed: 0_level_1,id,formattedAddress,addressLine1,addressLine2,city,state,stateFips,zipCode,county,countyFips,...,yearBuilt_nan,saleMonth,saleYear,trueValueHomePriceIndex,availableValueHomePriceIndex,trueMinusAvailableHomePriceIndex,monthAvgTrueMinusAvailableHomePriceIndex,predictedValueHomePriceIndex,pricePerSqFt,timeNormalizedPricePerSqFt
0,"136-S-Hancock-St,-Madison,-WI-53703","136 S Hancock St, Madison, WI 53703",136 S Hancock St,,Madison,WI,55,53703,Dane,25.0,...,False,12,2022,294.220,300.326,-6.106,-0.247974,300.078026,291.313559,0.990122
1,"304-S-Baldwin-St,-Madison,-WI-53703","304 S Baldwin St, Madison, WI 53703",304 S Baldwin St,,Madison,WI,55,53703,Dane,25.0,...,False,1,2023,292.712,298.542,-5.830,-0.228921,298.313079,322.189922,1.100706
2,"223-N-Baldwin-St,-Madison,-WI-53703","223 N Baldwin St, Madison, WI 53703",223 N Baldwin St,,Madison,WI,55,53703,Dane,25.0,...,False,1,2023,292.712,298.542,-5.830,-0.228921,298.313079,240.167919,0.820492
3,"1317-E-Johnson-St,-Madison,-WI-53703","1317 E Johnson St, Madison, WI 53703",1317 E Johnson St,,Madison,WI,55,53703,Dane,25.0,...,False,2,2023,293.507,296.727,-3.220,0.273500,297.000500,320.062451,1.090476
4,"1151-E-Johnson-St,-Madison,-WI-53703","1151 E Johnson St, Madison, WI 53703",1151 E Johnson St,,Madison,WI,55,53703,Dane,25.0,...,False,2,2023,293.507,296.727,-3.220,0.273500,297.000500,216.216216,0.736665
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
1777,"1908-Arlington-Pl,-Madison,-WI-53726","1908 Arlington Pl, Madison, WI 53726",1908 Arlington Pl,,Madison,WI,55,53726,Dane,25.0,...,False,9,2024,324.710,325.309,-0.599,1.146641,326.455641,510.204082,1.571261
1778,"1602-Summit-Ave,-Madison,-WI-53726","1602 Summit Ave, Madison, WI 53726",1602 Summit Ave,,Madison,WI,55,53726,Dane,25.0,...,False,10,2024,324.029,325.631,-1.602,0.363051,325.994051,364.990689,1.126414
1779,"2415-Chamberlain-Ave,-Madison,-WI-53726","2415 Chamberlain Ave, Madison, WI 53726",2415 Chamberlain Ave,,Madison,WI,55,53726,Dane,25.0,...,False,10,2024,324.029,325.631,-1.602,0.363051,325.994051,371.479928,1.146440
1780,"2421-Chamberlain-Ave,-Madison,-WI-53726","2421 Chamberlain Ave, Madison, WI 53726",2421 Chamberlain Ave,,Madison,WI,55,53726,Dane,25.0,...,False,11,2024,323.745,325.076,-1.331,-0.026289,325.049711,374.313187,1.156198
