R project in Data Science for Business program X-HEC
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
R Rename trainpack/R/zzz.R to R/zzz.R Dec 17, 2018
data-raw Add files via upload Dec 17, 2018
data Add files via upload Dec 17, 2018
inst
man Rename trainpack/man/ypep.Rd to man/ypep.Rd Dec 17, 2018
tests Rename trainpack/tests/testthat/test-rev_geocoding.R to tests/testtha… Dec 17, 2018
vignettes Rename trainpack/vignettes/trainpack.Rmd to vignettes/trainpack.Rmd Dec 17, 2018
.Rbuildignore Rename trainpack/.Rbuildignore to .Rbuildignore Dec 17, 2018
DESCRIPTION Rename trainpack/DESCRIPTION to DESCRIPTION Dec 17, 2018
LICENSE Create LICENSE Dec 17, 2018
NAMESPACE Rename trainpack/NAMESPACE to NAMESPACE Dec 17, 2018
README.md Update README.md Oct 31, 2018
trainpack.Rproj Rename trainpack/trainpack.Rproj to trainpack.Rproj Dec 17, 2018

README.md

Rproject

R project in Data Science for Business program X-HEC

SUBJECT - "Win money when your train is late"

Situation

SNCF trains are reportedly making a habit of being late (30 min or more)

Task

We decided to deep dive into SNCF historic datasets and its real time API to gather some insights. We want to build a model predicting how often a train of a specific line / period of time.

Action Plan

Our objective is to build a model able to give us a confidence interval of the probability that a specific train on a specific line will be late by 30 min or more. Then, share those results to the public in the format of a dashboard.

Result

Creation of a basic MVP on which you can bet if the train is going to be late or not. Our system will automatically compute the quote of a specific train and compare it the the real data. If you bet correctly you get your money*1/probability