# Exploratory Data Analysis on Student Habits and Performance

In this notebook, we will perform exploratory data analysis (EDA) on the student habits and performance dataset. We will visualize the data and derive insights to understand the relationships between student habits and their performance.

In [None]:
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

# Load the dataset
df = pd.read_csv('../data/student_habits_performance.csv')

# Display the first few rows of the dataset
df.head()

In [None]:
# Summary statistics of the dataset
df.describe()

In [None]:
# Check for missing values
missing_values = df.isnull().sum()
missing_values[missing_values > 0]

## Visualizations

Let's create some visualizations to understand the data better.

In [None]:
# Distribution of student performance
plt.figure(figsize=(10, 6))
sns.histplot(df['performance_metric'], bins=30, kde=True)
plt.title('Distribution of Student Performance')
plt.xlabel('Performance Metric')
plt.ylabel('Frequency')
plt.show()

In [None]:
# Correlation heatmap
plt.figure(figsize=(12, 8))
sns.heatmap(df.corr(), annot=True, fmt='.2f', cmap='coolwarm', square=True)
plt.title('Correlation Heatmap')
plt.show()

## Conclusion

In this notebook, we performed exploratory data analysis on the student habits and performance dataset. We visualized the distribution of performance metrics and examined the correlations between different variables. Further analysis and modeling can be conducted based on these insights.