This project is a capstone part of the Google Data Analytics Professional Certificate on Coursera.
The purpoes of this project is to analyze Cyclistic's historical bike trip data in 2022 in order to identify how annual members and casual riders use Cyclistic bikes differently. It also aims to extract insights and develop the most appropriate marketing strategy that appeal to casual riders and encourage them to subscribe to annual memberships.
This project follows the six steps of the data analysis process, as follows:
- ASK PHASE: Focus on the question: How do annual members and casual riders use Cyclistic bikes differently?
- PREPARE PHASE: Download the trip data and check its itegrity
- PROCESS PHASE: Clean the trip data in order to be ready for the data analysis
- ANALYZE PHASE: Analyze the trip data using RStudio
- SHARE PHASE: Visualize the trip data and share the insights to the stakeholders using Tableau or Google Slides
- ACT PHASE: Make the top three recommendtions based on the analysis
- Cyclistic Bike-Sharing Data Analysis: The complete R Markdown that explains the whole process throughout this project.
- Cyclistic Bike-Sharing Presentation: The slide deck that reports the whole anlysis to stakeholders.
- Data Processing: The R code that provides the process of how I clean the data.
- Data Analysis and Visualization: The R code that provides the analysis and visualization of the data.
This project analyzes Cyclistic's historical bike trip data from January to December 2022 from the Divvy system data website.
Run the following scripts to download the trip data in 2022:
# Change to the 'code' directory
cd ./code/
# Run retrieve_data.sh
./retrieve_data.sh
RStudio (the latest version) to gather all the information, process it, explore the data, and use it to find bike usage's patterns and insights.
Here are the R packages I used for the project:
This project is licensed under the MIT License - see the LICENSE file for details.