Welcome to Learning Analytics repository

About

This is a mini-project for SC1015 (Data Science and Artificial Intelligence) which focuses on Learning Analytics from the Student Performance Dataset. For detailed walkthrough please view in the following format:

Video Presentation available here

Contributions

Alan, Keith, Mavis, all done together with even work distribution

Problem Definition

Are we able to determine if a student will fail based on his lifestyle?
Can we predict which "band" a student will fall into based on previous records?
Following these bands we can remedy their scores and provide appropriate support.

Models Used

Elbow Plot for Cluster Selection
KMeans & KPrototype clustering
Hierarchical Clustering (Dendrogram)
Decision Tree

Conclusion

Parent's education place a part in one's school performance.
Mother staying at home could be detrimental to one's school performance.
Studying more equates to better grades, but given it is based on qualitative grounds, this might be reliant on one's confidence.
Aiming for higher education does improve one's school performance (driven).
Internet access plays a part in performance.
Being absent for classes does not necessarily equate to bad performance, but there is a general trend of performing badly.
Being self-aware and accepting of high alcohol consumption on weekly and daily basis would negatively affect performance.

What did we learn from this project?

Learning Analytics
Clustering
Decision Tree
Collaborating using GitHub
Concepts about Distance, Accuracy, Noise and Data Handling

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
Correlation Table.ipynb		Correlation Table.ipynb
DTallvar.ipynb		DTallvar.ipynb
DTnum.ipynb		DTnum.ipynb
DTusefulcluster-FeduIncluded (xgboosted).ipynb		DTusefulcluster-FeduIncluded (xgboosted).ipynb
DTusefulcluster-FeduIncluded.ipynb		DTusefulcluster-FeduIncluded.ipynb
DTusefulcluster.ipynb		DTusefulcluster.ipynb
Data Exploration.ipynb		Data Exploration.ipynb
Hierarchical Clustering.ipynb		Hierarchical Clustering.ipynb
KMeans Clustering.ipynb		KMeans Clustering.ipynb
KPrototypes Clustering.ipynb		KPrototypes Clustering.ipynb
README.md		README.md
SC1015 Presentation.pptx		SC1015 Presentation.pptx
SC1015 UMAP.ipynb		SC1015 UMAP.ipynb
SC1015 XGboost_allvar.ipynb		SC1015 XGboost_allvar.ipynb
SC1015_XGB2.ipynb		SC1015_XGB2.ipynb
Unsupervised UMAP.ipynb		Unsupervised UMAP.ipynb
student-por.csv		student-por.csv

alanwalker23/SC1015-Overbyte

Folders and files

Latest commit

History

Repository files navigation

Welcome to Learning Analytics repository

About

Contributions

Problem Definition

Models Used

Conclusion

What did we learn from this project?

References

About

Resources

Stars

Watchers

Forks

Languages