This project analyzes student data from an Excel file (Python Data.xlsx
) using Python, pandas, seaborn, and matplotlib. The analysis is performed in a Jupyter Notebook (23102B2002_internship.ipynb).
23102B2002_internship.ipynb
: Main notebook containing all analysis code and visualizations.Python Data.xlsx
: Source data file with student information.
-
Install dependencies
Make sure you have Python 3.x installed.
Install required packages:pip install pandas seaborn matplotlib numpy
-
Run the notebook
Open23102B2002_internship.ipynb
in Jupyter Notebook or VS Code and run the cells.
The notebook covers the following analyses:
- Unique Students: Counts the number of unique students.
- Average CGPA: Calculates the average CGPA.
- Graduation Year Distribution: Visualizes student distribution across graduation years.
- Python Experience: Shows the distribution of students' experience with Python.
- Average Family Income: Computes average family income (mapped to numeric values).
- CGPA by College: Visualizes top 5 colleges by average CGPA.
- Outlier Detection: Detects outliers in attendee status and course quantity.
- CGPA by City: Plots average CGPA for each city.
- Family Income vs CGPA: Examines relationship between family income and CGPA.
- Expected Salary Factors: Visualizes how CGPA, family income, and Python experience affect expected salary.
- Popular Events by Field: Identifies which events attract students from specific fields.
- Leadership Analysis: Compares CGPA and expected salary for students with/without leadership skills.
- Leadership Correlation: Calculates correlation between leadership skills and expected salary.
- Graduating Students: Counts students graduating by end of 2024.
- Promotion Channel Effectiveness: Shows which channels bring more student participation.
- Data Science Event Attendance: Counts students attending Data Science events.
- High CGPA & Experience Salary Expectation: Analyzes salary expectations for students with high CGPA and Python experience.
- Event Awareness by College: Finds students who learned about the event from their college and lists top 5 colleges.
- Update the path to
Python Data.xlsx
in the notebook if needed. - Run each cell to see the analysis and visualizations.
- Modify or extend the notebook for additional insights.
This project is for educational and internship purposes.