Skip to content

Commit-Thomas/Mod4_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 

Repository files navigation

🚲 Bike Share Demand Analysis

🧩 Business Problem & Stakeholders

This project analyzes hourly bike-share usage data to identify key demand drivers and guide decisions across product, operations, and marketing teams.

Stakeholders:

  • Product Manager: Rider behavior & feature testing.
  • Operations Lead: Staffing, rebalancing, maintenance planning.
  • Marketing Lead: Promo timing, rider segmentation.
  • Policy & Ethics Advisor: Ensure equitable access and responsible insights.

🔬 Methods Summary

Exploratory Data Analysis (EDA)

  • Cleaned and transformed data using pandas in Python.
  • Visualized hourly ridership trends by time, weather, and user type.

📈 Key Trends & Insights

Screenshot 2025-10-08 at 2 50 49 PM
  • Peak hours are 8 a.m. and 5 p.m. with roughly 350 & 475 average hourly riders repectively
  • Rindership increases before and decrease after theses points
  • Ridership has a strong relation ship with people commuting to work
Screenshot 2025-10-08 at 2 51 24 PM
  • Less clear weather correlates with lower average ridership
  • Clear weather has >200 Average riders compared to heavy snow/rain with < 100
Screenshot 2025-10-08 at 2 53 36 PM
  • Most riders are from registered users with 81.2% comapared to casual users making up the remaining 18.8%
  • People who rode a bikes are likely to be repeat users so they register

Hypothesis Testing

Q1: Working vs. Non-Working Days

  • Hypotheses:

Null hypothesis (H0): Mean hourly rides on working and non-working days are equal.

Alternative hypothesis (H1): Mean hourly rides on working and non-working days differ.

📊 Hypothesis Test Results

Screenshot 2025-10-08 at 3 14 16 PM
  • Test: Welch’s t-test
  • Result: p-value = 0.00004
  • 95% CI: (6.15, 17.45)
  • Decision: Reject null hypothesis!
  • Practical Insight: There is a statistically and practically significant difference in hourly ride counts between working and non-working days. On average, working days see 6 to 17 more rides per hour, which can translate to hundreds of additional daily rides. This pattern reflects likely commuter behavior and should guide weekday-focused operational planning and targeted pricing or promotional strategies.

Q2: Seasonal Differences

Hypotheses:

Null hypothesis (H0): All seasonal means are equal (no difference in average hourly rides).

Alternative hypothesis (H1): At least one seasonal mean is different.

Screenshot 2025-10-08 at 3 23 23 PM
  • Test: One-way ANOVA

  • Post-hoc: TukeyHSD

  • Decision: Reject the null hypothesis!

  • Insight: Average hourly rides do significantly differ between at least some seasons. The size of these differences with such a small p-value have clear operational, financial, and strategic implications for pricing, resource allocation, and customer engagement. making it not only statistically significant but practically significant aswell.

  • The appropriate post-hoc approch would be a Tukey's HDS for pairwise comparisons between seasons to determine which specific pairs of seasons differ significantly. This would allow stakeholders to focus on seasons that may be underperforming.


🚴 A/B Test — Commuter Hour Ridership

Objective:
Measure the impact of an app feature (launched on 2012-09-01) on weekday evening (17:00–19:00) ridership during good weather.

Eligibility Criteria:

  • workingday == 1
  • hr ∈ {17, 18, 19}
  • weathersit ∈ {1, 2}
  • hum ≤ 0.70

Windows:

  • Pre (Group A): 2012-08-04 → 2012-08-31
  • Post (Group B): 2012-09-01 → 2012-09-28

Design:
Data was stratified by (weekday × hour) and matched by truncating to equal sample sizes across time slots. This ensured balanced groups and fair comparison.

Balance Check

Screenshot 2025-10-08 at 6 58 55 PM

Test:

  • Paired t-test
  • Null: No difference in avg. hourly rides
  • α = 0.05

Results:

  • p-value: 0.0006
  • 95% CI: [-63.6, -18.7]
  • Mean Increase: +41.15 rides/hour
  • Cohen’s d: 0.30 (small–medium)

Statistically and practically significant (~15% lift).
Suggests increased demand likely due to the feature — may warrant operational changes (e.g., staffing, fleet availability).


✅ Recommendations

👩‍💼 Product Manager

  • Invest in feature development that supports commuter behavior
    • A/B test showed a +41.15 rides/hour increase after feature launch.
    • Prioritize features aimed at weekday evening rides (17:00–19:00)
  • Support the registered user base
    • With ~81% of rides from registered users, optimize for retention (e.g., loyalty rewards, referrals)
  • Run additional experiments
    • Continue controlled A/B testing to evaluate product changes over time

🛠️ Operations Lead

  • Prioritize commuter peak hours
    • Focus staffing, bike rebalancing, and maintenance during 8 a.m. and 5 p.m. peaks.
  • Implement weather-based resource planning
    • Clear weather = high demand; adjust fleet and station support accordingly
  • Plan for seasonal fluctuations
    • Use seasonal trends to shift operational focus (e.g., more bikes in summer/fall)

📣 Marketing Lead

  • Target weekday commuters
    • Use promotions and messaging during high-traffic windows (morning/evening)
  • Design seasonal campaigns
    • Create special promotions in underperforming seasons identified via ANOVA results
  • Leverage weather
    • Run weather-sensitive promotions

⚖️ Ethics & Limitations

Limitations

  • Doesn't include pricing model
  • Dates only range from 2011 - 2012
  • season date range is incorrect

Ethics

  • dont over prioitize peak hour commuters
  • dont over prioritize registered users

📁 Files in This Repo

  • slides/: presention with findings and insights.
  • data/: Cleaned and raw datasets.
  • notebooks/: Jupyter notebooks for EDA, testing, and modeling.
  • README.md: Project overview and conclusions.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published