# How Can a Wellness Technology Company Play It Smart?

## Case Study Overview

* This case study is a **capstone project** for the *Google Data Analytics Professional Certificate*.
* I am an imaginative **junior data analyst** on the **marketing analytics team** at **Bellabeat**, a high-tech manufacturer of health-focused products for women.
* Bellabeat is a successful small company, but they have the potential to become a larger player in the global smart device market.
* **Urška Sršen**, co-founder and Chief Creative Officer of Bellabeat, believes that analyzing smart device fitness data could help unlock new growth opportunities for the company.
* I was asked to focus on one of Bellabeat’s products and analyze smart device data to gain insight into how consumers are using their smart devices. The insights I discovered will then help guide marketing strategy for the company.

## Bellabeat Products

* **Bellabeat App:** Central platform that syncs with Bellabeat devices to track activity, sleep, stress, menstrual cycle, and mindfulness habits.
* **Leaf:** Wearable wellness tracker (bracelet, necklace, or clip) that monitors activity, sleep, and stress.
* **Time:** Stylish wellness watch that tracks activity, sleep, and stress, combining classic design with smart features.
* **Spring:** Smart water bottle that tracks daily water intake to support proper hydration.
* **Bellabeat Membership:** Subscription service offering 24/7 personalized guidance on nutrition, activity, sleep, beauty, and mindfulness based on individual goals and lifestyle.

## Business Objective

Analyze smart device usage data in order to gain insight into how consumers use non-Bellabeat smart devices. Select and apply these insights to a Bellabeat product. These questions will guide my analysis:
1. What are some trends in smart device usage?
2. How could these trends apply to Bellabeat customers?
3. How could these trends help influence Bellabeat marketing strategy?

## Data Sources

* Using [FitBit Fitness Tracker Data](https://www.kaggle.com/datasets/arashnic/fitbit) (CC0: Public Domain, dataset made available through Mobius) for this case study.
* This Kaggle data set contains personal fitness tracker from thirty fitbit users, who consented to the submission of personal tracker data, including minute-level output for physical activity, heart rate, steps, and sleep monitoring that can be used to explore users’ habits.
* This Kaggle data set has an usability score of 9.41, which proves that data is credible and holds integrity.
* **Kaggle's R kernel** will be used to clean, analyze, and visualize the data.

Load R's 'tidyverse' package for this analysis.

In [None]:
library("tidyverse")

Create a dataframe named 'daily_activity' and read in one of the CSV files from the dataset.

In [None]:
daily_activity <- read.csv(
    "../input/fitbit/mturkfitbit_export_4.12.16-5.12.16/Fitabase Data 4.12.16-5.12.16/dailyActivity_merged.csv"
)

Create another dataframe named 'sleep_day' for the sleep data.

In [None]:
sleep_day <- read.csv(
    "../input/fitbit/mturkfitbit_export_4.12.16-5.12.16/Fitabase Data 4.12.16-5.12.16/sleepDay_merged.csv"
)

Take a look at the daily_activity data.

In [None]:
head(daily_activity)

Identify all the columns in the daily_activity data.

In [None]:
colnames(daily_activity)

Take a look at the sleep_day data.

In [None]:
head(sleep_day)

Identify all the columns in the sleep_day data.

In [None]:
colnames(sleep_day)

How many unique participants are there in each dataframe?

In [None]:
n_distinct(daily_activity$Id)
n_distinct(sleep_day$Id)

It looks like there may be more participants in the daily activity dataset than the sleep dataset.

How many observations are there in each dataframe?

In [None]:
nrow(daily_activity)
nrow(sleep_day)

## Data Preparation

* Loaded R's 'tidyverse' package for this analysis.
* Created a data frame named 'daily_activity' and read in one of the CSV files from the dataset.
* Created another data frame named 'sleep_day' for the sleep data.
* Took a look at the daily_activity data.
* Identified all the columns in the daily_activity data: 'Id', 'ActivityDate', 'TotalSteps', 'TotalDistance', 'TrackerDistance', 'LoggedActivitiesDistance', 'VeryActiveDistance', 'ModeratelyActiveDistance', 'LightActiveDistance', 'SedentaryActiveDistance', 'VeryActiveMinutes', 'FairlyActiveMinutes', 'LightlyActiveMinutes', 'SedentaryMinutes', 'Calories'.
* Took a look at the sleep_day data.
* Identified all the columns in the sleep_day data: 'Id', 'SleepDay', 'TotalSleepRecords', 'TotalMinutesAsleep', 'TotalTimeInBed'.
* How many unique participants are there in each dataframe?
	* Daily Activity: 33 participants.
	* Sleep Data: 24 participants.
* It looks like there may be more participants in the daily activity dataset than the sleep dataset.
* How many observations are there in each data frame?
	* Daily Activity: 940 observations.
	* Sleep Data: 413 observations.

What are some quick summary statistics we'd want to know about each data frame?

In [None]:
# For the daily activity dataframe:
daily_activity %>% 
    select(
        TotalSteps, 
        TotalDistance, 
        SedentaryMinutes
    ) %>% 
    summary()
cat("--------------------------------------------------\n")
# For the sleep dataframe:
sleep_day %>% 
    select(
        TotalSleepRecords, 
        TotalMinutesAsleep, 
        TotalTimeInBed
    ) %>% 
    summary()

What's the relationship between steps taken in a day and sedentary minutes?

In [None]:
plot(daily_activity$TotalSteps, daily_activity$SedentaryMinutes,
     main = "Total Steps vs Sedentary Minutes",
     xlab = "Total Steps",
     ylab = "Sedentary Minutes",
     pch = 19,      # solid circle
     col = "blue")  # point color

What's the relationship between minutes asleep and time in bed?

In [None]:
plot(sleep_day$TotalMinutesAsleep, sleep_day$TotalTimeInBed,
     main = "Total Minutes Asleep vs Total Time in Bed",
     xlab = "Total Minutes Asleep",
     ylab = "Total Time in Bed",
     pch = 19,      # solid circles
     col = "darkgreen")  # optional color

## Analysis Summary

**Daily Activity Data:**

* **Total Steps:** The average is 7,638 steps per day, with a median of 7,406. This suggests an overall active population, though some very high step counts (up to 36,019) skew the average.
* **Total Distance:** People walk an average of 5.49 km daily, consistent with moderate activity levels.
* **Sedentary Minutes:** On average, individuals are sedentary for 991 minutes (16.5 hours) per day, with a median of 1,057.5 minutes (17.5 hours), highlighting a key opportunity for health improvement.

**Sleep Data:**

* **Total Sleep Records:** Most users have one sleep record per day (with Q1, Q2, and Q3 all at 1), though some log up to 3, likely capturing naps or fragmented sleep.
* **Total Minutes Asleep:** Users sleep an average of 419.5 minutes (just under 7 hours), which is on the lower end of the recommended 7–9 hours for adults.
* **Total Time in Bed:** The average time in bed is 458.6 minutes, slightly more than time asleep—reflecting time spent falling asleep, waking during the night, or relaxing.


**Overall Implications for Bellabeat:**

* **Sedentary Behavior is a Concern:** High sedentary time highlights an opportunity for Bellabeat to promote more movement through features or targeted campaigns.
* **Sleep is Adequate, but Improvable:** Average sleep duration is acceptable but could be better. Bellabeat could support users with tools like sleep tracking, personalized tips, and relaxation features.
* **Activity Levels Vary Widely:** Given the range in activity, personalized insights and recommendations can help users stay engaged and improve their habits.
* **Napping as a Feature:** Multiple daily sleep records suggest interest in napping. Bellabeat could highlight nap benefits or enhance nap detection.

**Relationship Between Steps and Sedentary Minutes:**

* **Negative Correlation:** As step count increases, sedentary minutes generally decrease—an expected pattern showing more active users sit less.
* **Stronger Impact at Lower Step Counts:** The correlation is clearer below 10,000 steps, where increasing activity more significantly reduces sedentary time.
* **Weaker Link at Higher Step Counts:** Above 10,000 steps, the trend weakens—some highly active users still log high sedentary minutes.
* **Possible Plateau Effect:** Sedentary time seems to level off above 500 minutes, regardless of step count, suggesting a limit to how much movement reduces sitting time.

**Relationship Between Minutes Asleep and Time in Bed:**

* **Strong Positive Correlation:** More time in bed generally leads to more sleep, as expected—showing a clear link between ‘Total Time In Bed’ and ‘Total Minutes Asleep’.
* **Linear Relationship:** The trend is fairly linear, suggesting consistent gains in sleep with increased time in bed.
* **Gap Between Bedtime and Sleep:** Time in bed typically exceeds time asleep, reflecting time spent falling asleep, waking up, or simply resting.
* **Sleep Efficiency Varies:** The spread in data shows differences in sleep efficiency—some users sleep efficiently, while others need more time in bed, possibly due to habits or sleep issues.

## Key Findings

* **Personalized Marketing is Essential:** Avoid generic messaging—tailor marketing, product suggestions, and content to individual user data like activity levels and sleep patterns.
* **Target Key Pain Points:** Address the unique needs of different user segments:
    * **Highly Sedentary:** "Feeling sluggish? A little movement goes a long way."
    * **Inefficient Sleepers:** "Struggling to fall asleep? Try personalized sleep solutions."
    * **Short Sleepers:** "Make every minute of sleep count."
* **Highlight Relevant Features:** Showcase features based on user needs:
    * **Sedentary Users:** Step tracking, movement reminders, gamified challenges.
    * **Sleep Inefficient Users:** Sleep insights, guided meditations, ambient sound detection.
    * **All Users:** Holistic health tracking through data integration.
* **Showcase Data Value:** Emphasize how Bellabeat empowers users with meaningful health insights—visuals like step vs. sedentary time or time in bed vs. sleep can reinforce this.
* **Use Social Proof:** Promote testimonials from users who’ve improved their activity or sleep, reinforcing product credibility and effectiveness.

## Recommendations

1. **Bellabeat "Micro-Movement Motivator":** To combat high sedentary time, this feature encourages small, frequent movements throughout the day.
    * **Key Features:** Timed movement reminders, mini challenges (e.g., “Walk to the kitchen”, “Do 5 squats”, etc), and optional smart desk integration.
    * **Marketing Focus:** Target highly sedentary users by emphasizing ease and accessibility.
    * **Why It Works:** Breaks inactivity into manageable actions, making it easier for users to stay active.
2. **Bellabeat "Sleep Efficiency Enhancer":** Designed to reduce the gap between time in bed and actual sleep, this tool helps users improve sleep quality.
    * **Key Features:** Ambient noise tracking, guided relaxation, personalized schedules, and smart home integration.
    * **Marketing Focus:** Appeal to inefficient sleepers by promoting better rest through environment and habit optimization.
    * **Why It Works:** Addresses a specific sleep issue with personalized, data-driven support.
3. **Bellabeat "Nap Navigator":** With users logging multiple sleep sessions, this feature detects naps and offers personalized insights.
    * **Key Features:** Automatic nap detection, smart nap timing recommendations, and nap quality analytics.
    * **Marketing Focus:** Target short or inefficient sleepers by highlighting the benefits of smart napping.
    * **Why It Works:** Enhances an existing behavior while supporting better overall rest and recovery.

## Next Steps

* **Investigate Sleep Inefficiency:** Use surveys or focus groups to identify lifestyle factors (e.g., stress, screen time) that contribute to poor sleep.
* **Analyze Nap Impact:** Study how napping affects sedentary behavior, sleep quality, and overall activity to inform feature development.
* **Explore Sleep–Activity Links:** Examine correlations between sleep and physical activity to enable more personalized health insights.