Skip to content
Tomer Sagi edited this page Nov 10, 2016 · 2 revisions

Welcome to the US-2014 dataset wiki

This wiki details the process required to generate the US-2014 dataset form it's sources - public DOT data collected from BTS.gov. A vertica database is assumed. For each component the DDL (create tables and views) is followed by the process required to generate the common data used by all scenarios and then by the specific steps required to generate the scenario-specific data. Throughout, we use two schemas. airlines for staging and scenarioDB_USA for the final dataset.

This process is done in two phases: In the first one we create the schema of the data tables and insert initial data into the tables, which will be the basis for creating scenarios.

In the second phase we create the scenarios and generate scenario-specific data.

  1. Creating Tables and Generating Base Data
  2. Creating Scenarios
  3. Creating Itineraries

Clone this wiki locally