NOTEBOOK 1: Exploratory Data Analysis

# Sleep Data EDA (Fitbit + Apple Health)

This notebook performs initial exploratory data analysis on the cleaned sleep datasets.
We analyze:

- Sleep duration trends
- Sleep score distributions
- Bedtime/wakeup behavior
- Apple deep/light sleep structure
- Heart-rate patterns

In [None]:
# CELL 1: import datasets 
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

fitbit = pd.read_csv("../data/clean/fitbit_clean.csv", parse_dates=["DATE"])
apple = pd.read_csv("../data/clean/apple_sleep_nightly_summary.csv", parse_dates=["DATE"])

In [None]:
# CELL 2: show summary statistics
fitbit.describe()
apple.describe()

In [None]:
# CELL 3: display fitbit trends
plt.plot(fitbit["DATE"], fitbit["HOURS_OF_SLEEP_HOURS"])
plt.title("Fitbit Sleep Duration Over Time")
plt.show()

In [None]:
# CELL 4: display apple deep/light sleep trends
apple[["pct_deep","pct_light"]].hist(figsize=(10,4))
plt.show()

In [None]:
# CELL 5: display correlations
sns.heatmap(fitbit.corr(), annot=True)
plt.show()

In [None]:
# CELL 6: sleep routine distributions
fitbit["bedtime_hour"] = pd.to_datetime(fitbit["BEDTIME"]).dt.hour
sns.histplot(fitbit["bedtime_hour"])


This notebook explores a variety of displayed forms of the cleaned sleep logs, and applying exploratory data analysis (EDA).

This explains:

How long do you sleep?

How much does your bedtime vary?

How much deep sleep do you get?

How do Fitbit & Apple compare?

These questions help us articulate how we can predict future sleep quality.