The data set used for this analysis is available on Kaggle
Analyze FitBit fitness tracker data to gain insights into how consumers are using the FitBit app and discover trends for marketing strategy.
- Data is publicly available on Kaggle: FitBit Fitness Tracker Data and stored in 18 csv files.
- Generated by respondents from a survey via Amazon Mechanical Turk between 12 March 2016 to 12 May 2016.
- 30 FitBit users consented to the submission of personal tracker data.
- Data collected includes physical activity recorded in minutes, heart rate, sleep monitoring, daily activity, steps and calories.
A good dataset must ROCCC or in other words must be Reliable, Original, Comprehensive, Current, Cited.
- Reliable: LOW - Not reliable as it only has 30 respondents
- Original: LOW - Generated by Third Party (Amazon Mechanical Turk)
- Comprehensive: MED - Data minute-level output for physical activity, heart rate, and sleep monitoring. While the data tracks many factors in the user activity and sleep, but the sample size is small and most data is recorded during certain days of the week.
- Current: LOW - Dataset is almost 7 years old and might not be relevant
- Cited: LOW - Data collected from third party, hence unknown
Hence this dataset is considered bad quality and must not be used of making any business decisions.
Python is used for data cleaning, transformation and visulization
- dailyActivity_merged.csv
- sleepDay_merged.csv
- hourlyIntensities_merged.csv
There are 33 unique Ids avaialable in the csv file. NO NULL values
- On average, users logged 7,637 steps or 5.4km which is not adequate. As recommended by CDC, an adult female has to aim at least 10,000 steps or 8km per day to benefit from general health, weight loss and fitness improvement.
- Average sedentary time is 991 minutes or 16 hours which is a lot!
- Average calories burned is 2,303 calories equivalent to 0.6 pound. Details cannot be adeuqately interpreted as calories burned depend on several factors such as the age, weight, daily tasks, exercise, hormones and daily calorie intake.
- From the Histogram above is is discovered that users prefer or remember to track their activity on the app during midweek from Tuesday to Friday.
- The frequency dropped on Friday and continue on weekends and Monday.
- There is a positive correlation seen in the calories vs steps taken visulization above.
- The intensity of calories burnt is more between 0 to 15,000 steps taken as compared to higher value of steps
Calories burnt every hours logged weak positive relationship, this maybe be due to higher sedentary hours. According to the pie chart sedentary times accounts for 81% of the total time logged. This could highly be because the users might be utilizing the fitness band for daily commute, inactive movements (moving from one spot to another) or running errands and lesser for tracking their fitness.
Sedentary time also shows the negative linear correlation with total minutes asleep. This shows that higher sedentary minutes accounts for lesser sleep.
After visualizing Total Intensity hourly, it is found out that people are more active between 5 am and 10pm.
As per the analysis, the markteing team of any other fitness brands which have the similar devices can benefit from these findings:
- The marketing team can encourage users by enlightening and empowering them with knowledge about the advantages of fitness, suggesting various forms of exercise (for example, a straightforward 10 minute exercise on weekdays and a more intense exercise on weekends), and providing information on calorie intake and burn rate on the thier respective app.
- The company's app can also send out notifications on the weekends to motivate users to work out.
- If the users wants to improve their sleep they can consider reducing the sedenatary hours and increase in time on bed hours. The company app can send notifications to go to sleep earlier.
- Since most activity happens between 5 pm and 7 pm it is safe to assume that people go to a gym or for a walk after finishing work. This time frame can be utilized by the companies to remind and motivate users to go for a run or walk.