GitHub - sanjayaben/student-performance-prediction

student-performance-prediction

Installation

There should be no necessary libraries to run the code here beyond the Anaconda distribution of Python. The code should run with no issues using Python versions 3.X

Project Motivation

The objective of the project was to analyze how various factors affect student performances when taking a specific course. Throughout the analysis we tried to answer following questions.

How effective are the traditional academic indicators?
How does the Socioeconomic factors impact the final grade?
Does attendance has any contribution towards the final grade?
Which behaviors are vital for a good grade? In analysing the numberic attributes we used the correlation matrix to identify the type of influence these attributes have. The categorical attributes were analysed by simply averaging final grade grouped by the attributes in interest.

File Descriptions

There are couple of Jupyter notebooks available here

Data Understanding - contains the data exploration and analysis steps
Modelling - contains the modelling steps including data preparation, model training and evaluation The data set is available in the student-mat.csv file

Results

The finding are elaborated in the following article Key findings

Continuous/interim assessments seem to standout as strong indicators of the final grade achieved by students.
The guardians who are educated has more influence on students while staying with parents seems to help them perform better. Also the urban setup with facilities like home internet also seems to positively impact performance.
The absences do not demonstrate a negative correlation towards the final grade as expected. It could be that the secondary school students cope up with absences better compared to primary students.
Behavioral attributes are complex to analyze and may need to be looked at in the context of the culture that exists within the sample. i.e. the conclusions may not be true in all cultural contexts.

Future work

We can alternatively try to use a classification model which would predict if the final grade is going to be "good" or "bad". For this we need to first transform the G3 into a categorical label, which we can then use to train the model. This would be a better use of the data set in it's current form.

Licensing, Authors, Acknowledgements

Must give credit to UCI for the data. You can find the Licensing for the data and other descriptive information at the UCI link available here. Otherwise, feel free to use the code here as you would like!

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Data Understanding.ipynb		Data Understanding.ipynb
Modelling.ipynb		Modelling.ipynb
README.md		README.md
student-mat.csv		student-mat.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

student-performance-prediction

Installation

Project Motivation

File Descriptions

Results

Future work

Licensing, Authors, Acknowledgements

About

Releases

Packages

Languages

sanjayaben/student-performance-prediction

Folders and files

Latest commit

History

Repository files navigation

student-performance-prediction

Installation

Project Motivation

File Descriptions

Results

Future work

Licensing, Authors, Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages