# Exploratory Data Analysis of User Behavior Dataset


This notebook explores the **User Behavior Dataset** by analyzing various factors such as device usage, 
battery drain, data usage, and user demographics. We will perform basic exploratory data analysis (EDA) to uncover trends, 
relationships, and insights within the data.

The following steps will be covered:
1. Data Overview
2. Descriptive Statistics
3. Data Visualization
4. Insights and Conclusion


## 1. Data Overview

In [None]:

# Import necessary libraries
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

# Load the dataset
data = pd.read_csv('/mnt/data/user_behavior_dataset.csv')

# Display the first few rows
data.head()


## 2. Descriptive Statistics

In [None]:

# Check for missing values and data types
data.info()

# Generate descriptive statistics for numerical columns
data.describe()


## 3. Data Visualization

In [None]:

# Distribution of App Usage Time
plt.figure(figsize=(10, 6))
sns.histplot(data['App Usage Time (min/day)'], kde=True, bins=30)
plt.title('Distribution of App Usage Time (min/day)')
plt.xlabel('App Usage Time (min/day)')
plt.ylabel('Frequency')
plt.show()


In [None]:

# Screen On Time vs Battery Drain
plt.figure(figsize=(10, 6))
sns.scatterplot(x='Screen On Time (hours/day)', y='Battery Drain (mAh/day)', hue='Operating System', data=data)
plt.title('Screen On Time vs Battery Drain')
plt.xlabel('Screen On Time (hours/day)')
plt.ylabel('Battery Drain (mAh/day)')
plt.legend(title='Operating System')
plt.show()


In [None]:

# Age Distribution by User Behavior Class
plt.figure(figsize=(10, 6))
sns.boxplot(x='User Behavior Class', y='Age', data=data)
plt.title('Age Distribution by User Behavior Class')
plt.xlabel('User Behavior Class')
plt.ylabel('Age')
plt.show()


In [None]:

# Data Usage by Gender
plt.figure(figsize=(10, 6))
sns.boxplot(x='Gender', y='Data Usage (MB/day)', data=data)
plt.title('Data Usage by Gender')
plt.xlabel('Gender')
plt.ylabel('Data Usage (MB/day)')
plt.show()


## 4. Insights and Conclusion


In this exploratory data analysis, we observed various trends and patterns within the dataset, including:
- The distribution of app usage time among users,
- The relationship between screen on time and battery drain across operating systems,
- Age distributions for different user behavior classes,
- Data usage patterns based on gender.

These insights provide a foundational understanding of user behavior, which could be beneficial for further analysis 
or machine learning model development.
