NYC MTA Turnstile Usage Dataset
We have a pool of questions and the dataset source to use to find answers in empirical way.
- Dataset provides data on Turnstile (i.e. tripod gates) usage in NYC MTA (
Ⓜ️ subway)- Could be used as a proxy of subway usage and people movements over the time
- Date interval we work with is first three month of 2013, Jan 1st to March 31st
- Some questions require different temporal slices but whole interval
Current report available, see [html] | [ipynb]
Project structure:
+--data <- folder for datasets
¦ L--raw <- ... 1. raw data
¦ L--interim <- ... 2. auxiliary, generated, temporary, preprocessed data
¦ L--processed <- ... 3. final, ready-to-analysis data
¦ L--external <- ... +. additional datasets
¦
+--notebooks <- folder for *.ipynb files
¦ L--*.ipynb 1 <- ... file 1
¦
+--src <- folder for .py scripts
¦ L--*.py 1 <- ... file 1
¦ L--*.py 2 <- ... file 2
¦ L--*.py 3 <- ... file 3
¦
+--figures <- folder for charts and images to reports
¦ L--external <- ... 1. raw data
¦ L--internal <- ... 2. raw data
¦
+--reports <- folder for reports (i.e. *.pptx, *.html, *.ipynb)
¦
+--docs <- folder for documentation files
¦
+--README.md <- the top-level README for developers using this project
¦
+--requirements.txt <- packages to build the python environment
Cookiecutter Data Science | tsdataclinic - SubwayCrowds
Placeholder
Ask before use.