Skip to content

PatrickDorante/CAPSTONE-PROJECT-R-PROGRAMMING

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

Case Study: Bike-Share Analysis

The goal of this case study is to showcase the skills learned throughout the Google Data Analytics Proffessional Cerficate program on Coursera.

This project will demonstrate some data processing and visualization


2022data_pic


6 Phases of Data Analysis

Ask, Prepare, Process, Analyze, Share, Act

we'll learn the different techniques how processing each phase and exploring data analysis. Several software tools were used for this case study.

Company Mission

Since launched in 2016, Cyclistic has grown across 
Chicago with a fleet of 5,824 bicycles that are geo-tracked 
and locked into a network of 692 stations. 
Their systems have made it easy for riders to unlock bikes 
from one station and return to any other station at any time. 
The Cyclistic’ s marketing strategy success made possible 
because of the flexibility of its pricing plans: single-ride passes, 
full-day passes = casual riders, and annual memberships.

Ask Phase

Cyclistic’s marketing Problem

The Cyclistic’s marketing strategy is to predict customer interactions in the last 12 months. As members of the marketing team, our tasks are going to explore hypothesis and predictions by comparing how Cyclistic's bikes usage differ between annual and casual riders.

Ask questions related to your case study before solving the project

  • How can you help the stakeholders resolve their questions?
  • How can your insights drive business decisions?
Key tasks
  • Identify stakeholders of the project
  • Marketing Director/Manager
  • Executive Team
Deliverable Scenario

Provide insights with relevant data sources and cleaning, key findings, provide meaningful data visualization support and data driven decisions.

Prepare Phase

Data Extraction

Given real datasets available at Motivate International Inc. license here, we will use 12 months data from April 2020 to May 2021 for our fictional business, which aims at increasing revenues with their service's bikes share, by converting casual riders into annual members.

Process Phase

Keys tasks

Janitor, Tidyverse, and gglplot2 packages for data cleansing and visualization using in R. Data will be loaded & stored with different variables. Missing and non-null values to sort and filter out. Data missed key such as demographic, income, age not included.

Data cleaning

Inspection of data structures and data types for errors:

  • Total Rows: 4,358,611
  • Total Columns: 13
  • Total missing - NA or non-values: 431585
Data Transformation - Manipulation

The old datasets will change into a new data-frame including columns (members classification, bike types, week-days, months, ride-duration) for computational and descriptive analysis.

Analyze Phase

This part focus strictly on descriptive analysis. Comparing the charts, casual users ride the share-bikes more than annual member users during the weekend.

Data Visualization

avg_trips datetime Days_of_week percent_usrs average_time_month

Data Preparation Tools
  • R - analysis and visualization
  • Microsoft SQL Server & Excel
  • Tableau - create data visualization and reports
  • Jupyter Notebook - data analysis and visualizations

Resources

Stackoverflow community

Github community

bike-share-pic

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published