---
title: "Google Capstone Project"
author: "Chuah Wei Jie"
date: "6/6/2021"
output:
  word_document: default
  html_document: default
  pdf_document: default
---

```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE)
```

## Install packages and load libraries
Note: This steps includes installing the packages required for the analysis

```{r}
install.packages('tidyverse')
install.packages('ggplot2')
install.packages('skimr')
install.packages("janitor")
library(skimr)
library(tidyverse)
library(ggplot2)
library(janitor)
library(dplyr)
library(lubridate)
```

## Load the relevant csv files
Note: csv files have been cleaned prior to loading to R for analysis

```{r}
dailyActivity <- read_csv("dailyActivity_merged.csv")
sleepDay <- read_csv("sleepDay_merged.csv")
weightLogInfo <- read_csv("weightLogInfo_merged.csv")
timeIntensities <- read_csv('timeIntensities_merged.csv')
timeCalories <- read_csv('timeCalories_merged.csv')
timeSteps <- read_csv('timeSteps_merged.csv')
```

## Join tables
Note:  

* **time_intensities_calories_steps** contains information for averages at different hours of a day  
* **weight_activity_sleep** contains information of the 30 individuals by day

```{r}
weight_activity <- full_join(dailyActivity, weightLogInfo,by = c('Id' = 'Id', 'ActivityDate'='Date'))
weight_activity_sleep <- full_join(weight_activity, sleepDay,by = c('Id' = 'Id', 'ActivityDate'='SleepDay'))
time_intensities_calories <- full_join(timeIntensities, timeCalories, by = c('Time'='Time'))
time_intensities_calories_steps <- full_join(time_intensities_calories, timeSteps, by = c('Time'='Time'))
```

## Preview tables

```{r}
head(weight_activity_sleep)
head(time_intensities_calories_steps)
```

## Calories Vs TotalSteps
Note: Relationship between steps taken in the day and calories burnt  

* Correlation of 0.5929493
* Number of steps taken can be used as a gauge to estimate the amount of calories burnt by an individual in a day

```{r}
ggplot(data = weight_activity_sleep, mapping = aes(x=TotalSteps,y=Calories)) + geom_point()
cor(weight_activity_sleep$TotalSteps,weight_activity_sleep$Calories)
```

## Calories Vs Sedentary Mintes
Note: Relationship between amount of Sedentary Minutes and calories burnt  

* Correlation of -0.1117858
* People are less likely to burn calories if they engage in more sedentary activities

```{r}
ggplot(data = weight_activity_sleep, mapping = aes(x=SedentaryMinutes,y=Calories)) + geom_point()
cor(weight_activity_sleep$SedentaryMinutes,weight_activity_sleep$Calories)
```

## Weight vs Total sleep time
Note: Relationship between amount of sleep and weight of individual  

* Correlation NA
* There is no correlation in the weight of an individual and the amount of sleep he gets.

```{r}
ggplot(data = weight_activity_sleep, mapping = aes(x=TotalMinutesAsleep,y=WeightKg)) + geom_point()
cor(weight_activity_sleep$TotalMinutesAsleep,weight_activity_sleep$WeightKg)
```

## Level of Intensity of activities over the course of a day
Note: Depicts when people are more likely to engage in more intense activities 

* People are more likely to engage in intense activities between 5-7pm(after work)

```{r}
ggplot(data = time_intensities_calories_steps, mapping = aes(x=Time,y=AverageTotalIntensity)) + geom_line()
```

## Amount of calories burnt over the course of a day
Note: Depicts when people are more likely to be more active/exercise  

* People are more likely to exercise in general between 5-7pm(after work)

```{r}
ggplot(data = time_intensities_calories_steps, mapping = aes(x=Time,y=AverageCalories)) + geom_line()
```