Skip to content

Stacking of individual Ensemble methods to predict Bike Rental count at any hour of the year

Notifications You must be signed in to change notification settings

shashankvmaiya/Bike-Sharing-Demand-Prediction

Repository files navigation

Bike-Sharing-Demand-Prediction

In this project, we train a model to predict the number of bike rentals at any hour of the year given the weather conditions. The data set was obtained from the Capital Bikeshare program in Washington, D.C. which contained the historical bike usage pattern with weather data spanning two years.

We first build several individual regression models such as Linear, Ridge, Random Forest, Gradient Boost and Adaboost. We then use 'Stacking' approach where the predictions from these level 1 individual models are used as meta-features to build a second level model (Linear Regressor, Random Forest and Gradient Boost) to further enhance the predicting capabilities.

The key takeaway from the EDA and Modeling are

  • Bike rental count is strongly correlated with the time of the day, with weaker dependence on the temperature and humidity
  • There are 2 rental patterns across the day in bike rentals count
    • First on a Working Day where the rental count high at peak office hours (8am and 5pm) and
    • Second on a Non-working day where rental count is more or less uniform across the day with a peak at around noon.

Below figure shows the bike rental behavior and our model predictions on the test set data for a Random Forest Model

About

Stacking of individual Ensemble methods to predict Bike Rental count at any hour of the year

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published