# 📊 Student Performance Analysis

This notebook contains an exploratory data analysis (EDA) of student performance data, focusing on the influence of socio-economic and parental factors on educational outcomes.

In [None]:
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
import numpy as np

%matplotlib inline

## 🔍 Load Dataset

In [None]:
df = pd.read_csv('data/student-mat.csv')
df.head()

## 📈 Dataset Overview

In [None]:
df.info()
df.describe()

## 🔗 Correlation Matrix

In [None]:
plt.figure(figsize=(12, 8))
sns.heatmap(df.corr(numeric_only=True), annot=True, cmap='coolwarm', fmt='.2f')
plt.title('Correlation Matrix')
plt.show()

## 🧠 Average Grades by Parental Education

In [None]:
if 'Medu' in df.columns and 'Fedu' in df.columns:
    df['avg_parent_edu'] = (df['Medu'] + df['Fedu']) / 2
if 'G1' in df.columns and 'G3' in df.columns:
    df['avg_grade'] = (df['G1'] + df['G3']) / 2

sns.boxplot(x='avg_parent_edu', y='avg_grade', data=df)
plt.title('Average Grade by Parental Education Level')
plt.xlabel('Average Parental Education (0-4)')
plt.ylabel('Average Grade')
plt.show()

## 🚀 Future Work
- Apply classification models to predict student performance.
- Analyze differences between schools.
- Visualize the impact of absences or alcohol consumption.

---

*Author: Anthiha*