Skip to content

Database for visualizing and understanding the impacts of declared disasters on real estate prices

License

Notifications You must be signed in to change notification settings

jsflevitt/DevEstates

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DevEstates

A database for visualizing and understanding the impacts of federal disasters on real estate prices.

Presentation Link.


Introduction

Large disasters are known to depress real estate prices for months or years. Some areas recover quickly while others do not. This database is designed to assist home buyers and sellers, investors, and researchers in understanding past temporospatial trends in real estate pricing while enabling them to make predictions regarding current and future events.

Architecture

An outline of the data pipeline architecture. Not every database requires geospark and therefore utilize Apache Spark. The ETL cluster can be reconfigured to optimize the processing of the incoming data. Please see EC2 Setups for scripts and more details.

Data Pipeline

Datasets

Real Estate Data Sources:

These consistute a small sample of what is available.

Private

Public

Disaster Data Sources:

These Federal Databases contain both geospatial and impact data, but frequently provide less information on exact dates.

Fire Perimeter Data

General Hazard Data

Engineering challenges

Challenges in creating this database came in two main areas:

  1. Working with the myriad different databases that store real estate sales and listing prices
    1. Creating a unified data model to connect the variety of data inputs
    2. Deploying customized ETL clusters to better handle varied data input sources
  2. Creating a database that works efficiently with data organized in both time and space
    1. A significant amount of preprocessing is done with geospark in order to speed up later database calls
    2. The database is deployed using both PostGIS and TimescaleDB to facilitate temporospatial indexing and deployment

About

Database for visualizing and understanding the impacts of declared disasters on real estate prices

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages