# An Exploratory Data Analysis on Habits

### Introduction
### Problem Statement
### Project Description
James Clear, writer of the critically-acclaimed productivity book Atomic Habits, declares that habits are the compound interest of self-improvement. This exploratory data analysis (EDA) focuses on Val's habits during his second semester of sophomore year. The goal is to extract data-driven insights that will be useful for his first semester of junior year. Specifically, the following data are used:
- `habits`: Sourced from a Google Form Val made that records daily habits such as number of toothbrushes in a day, minutes meditated, and overall mood.
- `exercise`: Sourced from Val's Strava account using the app's API. Strava is an app that lets users record their exercise data.
- `steps`: Sourced from Val's Samsung Galaxy Watch 6 through the Samsung Health app.
- `sleep`:  Sourced from Val's Samsung Galaxy Watch 6 through the Samsung Health app.
- `events`: A table containing significant academic or extracurricular events Val participated in.

## Step 0: Imports and Reading Data

In [3]:
# Packages
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import plotly
import requests

In [4]:
# Styling and preferences
plt.style.use('ggplot')

In [12]:
# For manually created CSV data
habits = pd.read_csv('data/habits.csv')
events = pd.read_csv('data/events.csv')

In [17]:
# For automated CSV data
steps = pd.read_csv('data/steps-converted.csv')
sleep = pd.read_csv('data/sleep-converted.csv')
exercise = pd.read_csv('data/exercise.csv')

- Downloading my health data was done through the Samsung Health app. The data provided consists of JSON files and a lot of CSV files. I decided to use the `com.samsung.shealth.tracker.pedometer_day_summary.[date of download].csv` for steps and `com.samsung.shealth.sleep.[date of download].csv` for sleep.
- Also, using `pd.read_csv` on these files produced a `ParserError` so I first imported them to Google Sheets and removed the first row. After downloading them again as CSV files, they are now imported.
- Downloading my Strava exercise data was done through https://entorb.net/strava/. The app allowed me to skip the trouble of having to access Strava's API.