#DOT Data Science Working Group @ Code for SF (CFA) : App for DoT Prompt 1 (Hazmat Incidents)
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
figure
random_forest_v2_files/figure-markdown_github
shiny_anomaly_detection
website
DOT.r
Oil_gas_statistics_code_book
README.md
hazmat_anomaly_detection_DS-JC.md
oil_gas_data_processing.R
random_forest_v2.Rmd
random_forest_v2.md

README.md

Data Science Working Group @ Code for SF (CFA) :

Applications and Model Building for DoT Prompt 1 (Hazmat Incidents)

Synopsis:

In order to address the Department of Transportation prompt, the team has developed two tools which assist in detecting and predicting traffic related incidents.

The first tool is a model to predict the number of incidents that involved hazardous materials for a month in a particular state. The model was developed taking into account various publicly available data sources such as records on land development applications, employment data in energy related sectors, oil prices, and more.

Data sources used:

A Random Forest model was built using features from employment data from BLS and Oil and gas statistics data to predict number of Hamzat incidences at the state level. Further details can be found in the markdown document which describes the model selecion and building process:

  • Markdown Report
  • The model uses features of employment from BLS and Oil and gas statistics

The model projection and actual data by state can be accessed and visualized through an interactive front-end tool.

The second tool monitors the number of fatalities as well as the number of reported incidences involving hazardous materials and flags breakouts in trends using an anomaly detection algorithm. This can be used in real time for early detection of accumulations of incidents and accidents to trigger a fast investigation on the underlying problem.

Outlook:

Both tools have proven basic capabilities and can be further enhanced through a more thorough development. For instance, predictive capability of the model can be improved through taking into account further detailed and more granular data sources as they become available. This project has identified features and areas that can be used to inform possible preventative measures that the Department of Transportation can take to prevent Hazmat related incidents.

Slides:

deck here

####Contributors:

  • Jude Cavillo
  • Rocio Ng
  • Wade Fuller
  • Tyler Field
  • Catherine Zhang
  • Daniel Schweigert
  • Ben Lucas