Open Data Bikes Sharing Stations Analysis
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
articles
data
images
notebooks
sources
.gitignore
LICENSE
README.md
environment.yml
requirements.txt

README.md

Open Data Bikes Analysis

License: MIT

Analyze bikes sharing station data from Bordeaux and Lyon Open Data (French cities).

Use the Python 3 programming language in Jupyter notebooks and the following libraries: pandas, numpy, seaborn, matplotlib, scikit-learn, xgboost.

See the requirements.txt file for the dependencies. If you use conda and the conda environnement, you can just do: conda env create -f environment.yml and the source activate bikes.

Clustering

Higly inspired by the Usage Patterns Of Dublin Bikes Stations article and his great notebook.

Analyze the daily profile and plot a map with a color for each usage pattern.

Example of pattern

You can see the percentage of available bikes for 4 different daily profiles. Note the analysis only keep job days.

  • Blue profile: people who take bikes in the morning, roll them into 'green' stations and go back home in the evening.
  • Green profile: opposite of the blue profile.
  • Orange profile: not very used stations. Sometimes too far from city center. Sometimes very close the tramway stations.
  • Red profile: stations where people go in the evening

Bordeaux-Pattern

Maps

Bordeaux Map Clustering

Bordeaux-Map

Lyon Map Clustering

Lyon-Map

Predict (draft)

Play with some different models to predict the number of available bikes (or a kind of availability).

Prediction Map

From history data (two weeks), prediction at T+30 minutes for every station in Lyon (France).

  • Blue means there are several available bikes
  • Red means there are just a few available bikes

Lyon-Prediction-Map

Data

See the lyon.tar.gz and bordeaux.tar.gz.