![gym](gym.png)


You are a product manager for a fitness studio and are interested in understanding the current demand for digital fitness classes. You plan to conduct a market analysis in Python to gauge demand and identify potential areas for growth of digital products and services.

### The Data

You are provided with a number of CSV files in the "Files/data" folder, which offer international and national-level data on Google Trends keyword searches related to fitness and related products. 

### workout.csv

| Column     | Description              |
|------------|--------------------------|
| `'month'` | Month when the data was measured. |
| `'workout_worldwide'` | Index representing the popularity of the keyword 'workout', on a scale of 0 to 100. |

### three_keywords.csv

| Column     | Description              |
|------------|--------------------------|
| `'month'` | Month when the data was measured. |
| `'home_workout_worldwide'` | Index representing the popularity of the keyword 'home workout', on a scale of 0 to 100. |
| `'gym_workout_worldwide'` | Index representing the popularity of the keyword 'gym workout', on a scale of 0 to 100. |
| `'home_gym_worldwide'` | Index representing the popularity of the keyword 'home gym', on a scale of 0 to 100. |

### workout_geo.csv

| Column     | Description              |
|------------|--------------------------|
| `'country'` | Country where the data was measured. |
| `'workout_2018_2023'` | Index representing the popularity of the keyword 'workout' during the 5 year period. |

### three_keywords_geo.csv

| Column     | Description              |
|------------|--------------------------|
| `'country'` | Country where the data was measured. |
| `'home_workout_2018_2023'` | Index representing the popularity of the keyword 'home workout' during the 5 year period. |
| `'gym_workout_2018_2023'` | Index representing the popularity of the keyword 'gym workout' during the 5 year period.  |
| `'home_gym_2018_2023'` | Index representing the popularity of the keyword 'home gym' during the 5 year period. |

In [67]:
# Import the necessary libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

In [68]:
# Start coding here
workout = pd.read_csv('./data/workout.csv')
workout.head(3)

Unnamed: 0,month,workout_worldwide
0,2018-03,59
1,2018-04,61
2,2018-05,57


In [69]:
workout_geo = pd.read_csv('./data/workout_geo.csv')
workout_geo.head(3)

Unnamed: 0,country,workout_2018_2023
0,Guam,
1,Falkland Islands (Islas Malvinas),
2,Cook Islands,


In [70]:
three_keywords = pd.read_csv('./data/three_keywords.csv')
three_keywords.head(3)

Unnamed: 0,month,home_workout_worldwide,gym_workout_worldwide,home_gym_worldwide
0,2018-03,12,16,10
1,2018-04,12,18,10
2,2018-05,13,16,9


In [71]:
three_keywords_geo = pd.read_csv('./data/three_keywords_geo.csv')
three_keywords_geo.head(3)

Unnamed: 0,Country,home_workout_2018_2023,gym_workout_2018_2023,home_gym_2018_2023
0,Gibraltar,,,
1,Lesotho,,,
2,Guam,,,


YEAR when the global search for 'workout' peaked

In [72]:
max_month = workout[workout['workout_worldwide'] == workout['workout_worldwide'].max()]
year = max_month.iloc[0][0]
year_str = year[:4]

print(type(year_str))
print(year_str)

<class 'str'>
2020


Most popular keywords during the pandemic (peak_covid) and now (current)

In [73]:
then = three_keywords[(three_keywords['month'] > '2019-12') & (three_keywords['month'] < '2021-01')].sum()[1:]
peak_covid = then[then == then.max()].reset_index()['index'][0]


now = three_keywords[three_keywords['month'] > '2022-12'].sum()[1:]
current = now[now == now.max()].reset_index()['index'][0]

print(peak_covid)
print(current)

home_workout_worldwide
gym_workout_worldwide


Country with the highest interest for workouts among United States, Australia, and Japan

In [74]:
c = workout_geo[workout_geo['country'].isin(['United States', 'Australia', 'Japan'])]
top = c[c['workout_2018_2023'] == c['workout_2018_2023'].max()]
top_country = top.reset_index()['country'][0]
top_country

'United States'

Country with the highest interest in home workouts between the Philippines and Malaysia

In [75]:
two_countries = three_keywords_geo[three_keywords_geo['Country'].isin(['Philippines', 'Malaysia'])]
top_guy = two_countries[two_countries['home_workout_2018_2023'] == two_countries['home_workout_2018_2023'].max()]['Country']
home_workout_geo = top_guy.reset_index()['Country'][0]

home_workout_geo

'Philippines'