## Exploratory Data Analysis of Fitness & Workout Dataset:
---

### Author: Caitlin Driscoll
### Date: October 2025
### Source: Kaggle
---

In [16]:
# Fitness Dataset Exploratory Data Analysis and Recommendation System

## 1. Import Libraries and Load Data

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.preprocessing import LabelEncoder, StandardScaler
from sklearn.metrics.pairwise import cosine_similarity
from sklearn.feature_extraction.text import TfidfVectorizer
import warnings
warnings.filterwarnings('ignore')

# Set display options
pd.set_option('display.max_columns', None)
pd.set_option('display.max_colwidth', 50)

# Load the dataset
df = pd.read_csv('../Data/fitness_and_workout_dataset.csv')

print("Dataset shape:", df.shape)
print("\nFirst few rows:", df.head(10))

## 2. Data Overview and Cleaning

# Check data types and missing values
print("Data Info:", df.info())

print("\nMissing values:", df.isnull().sum())

print("\nBasic statistics:", df.describe())

# Convert date columns to datetime
df['created'] = pd.to_datetime(df['created'])
df['last_edit'] = pd.to_datetime(df['last_edit'])

## 3. Exploratory Data Analysis

# Unique values in categorical columns
print("Unique Levels:", df['level'].unique())
print("\nUnique Goals:", df['goal'].unique())
print("\nUnique Equipment types:", df['equipment'].value_counts().head(10))



Dataset shape: (2598, 10)

First few rows:                                                title  \
0  (MASS MONSTER) High Intensity 4 Day Upper Lowe...   
1                         (NOT MY PROGRAM)SHJ Jotaro   
2  1 PowerLift Per Day Powerbuilding 5 Day Bro Split   
3                      10 Week Mass Building Program   
4                             10 week deadlift focus   
5                            100 Push Ups in 6 Weeks   
6                                      1000 lbs Club   
7                               100x Push Up’s / Day   
8                                       109273941442   
9                                 10x3 Powerbuilding   

                                         description  \
0  Build tones of muscular with this high intensi...   
1                            Build strength and size   
2  Based off of Andy Baker's KCS (Kingwood Streng...   
3  This workout is designed to increase your musc...   
4                                  Increase deadlift   
5  3