current_model
: contains current model used in production.
N.B the two data contained in COLUMNS_ORDER.csv are mu
and std
which are used to normalize data.
dvf_nantes_cleaned.csv
: Original DVForigin_enriched_v2.csv
: DVF + POIorigin_enriched_v3.csv
: DVF + POI + Economy
DVF + POI + Economy. Postal code are fixe, we added Nantes district to get more details. Contains longitude and latitude
origin_enriched_v4_lonlat.csv
DVF + POI + Economy, data one-hot encoded, ready to use. The second datasets has longitude and latitude for debugging purposes
dataset_ready_v1.csv
dataset_ready_v1_lonlat.csv
DVF + POI + Economy, data one-hot encoded, ready to use. Postal code are fixe, we added Nantes district to get more details. Removed surface area < 25 m² Removed housing outside of this range: in 50k€ < price < 1M€ Removed housing where price are less than 1600€/m² Removed housing where they had same size, sold date, longitude and latitude. The second datasets has longitude and latitude for debugging purposes
dataset_ready_v2.csv
dataset_ready_v2_lonlat.csv
finance.csv
: Original datafinance_modified.csv
: Evolution of economy starting from 1st quarter of 2000. All other quarters are based on it.
code_postaux.csv
: Postal code associated to town (yes we have same postal code for differents town ¯\_(ツ)_/¯ )
“Ministry of Economy and Finance — Original data downloaded from https://www.data.gouv.fr/fr/datasets/demandes-de-valeurs-foncieres-geolocalisees/ updated on 25w April 2019”