Web-based Clustering Application Using a Shiny Framework for Determining Student Engagement Levels in Virtual Learning Environments
https://ds-analytics.shinyapps.io/Student-Segmentation/
- Explored two clustering algorithms - Gaussian Mixture Model and K-means
- Compared and evaluated the clutering algorithms using the following metrics:
Clustering Time:
How long it takes the algorithm to fit the dataSilhoutte Coefficient (SC):
Calculates the mean distance between the data points to find the better defined clusters - SC scikit-learnCalinski-Harabasz Index (CHI):
Calculates the ratio of the sum of within and between cluster dispersion for all clusters used to find well-defined clusters - CHI scikit-learnDavies-Bouldin Index (DBI):
Calculates the average similarity between each cluster and its most similar one, used to indicate well-partitioned clusters - DBI scikit-learn
- More information on clustering metrics: https://scikit-learn.org/stable/modules/clustering.html
- Built a R Shiny dashboard with three main functionalities:
GMM cluster Analysis
,User Guide
,Descriptive Analysis
Knowledge Discovery in Database (KDD) Methodology
- Higher Educational Institutions are greatly relying on online learning platforms, such as Moodle, Blackboard, Canvas and etc to assess, communicate and share resources with students.
- These platforms capture and store each user's online interaction data in the system (both students and educators) by generating large and varied data sets providing the opportunity to discover valuable information
- Opportunity - using clustering algorithms:
- To determine low engaged students online
- To understand the different students engaging with online learning platforms
- To perform student segmentation
- To provide personalised support to students engaging with the platforms differently
RStudio Version: 2022.07.1 Build 554
Libraries: tidyverse
, mclust
, highcharter
, shiny
, bs4Dash
, waiter
, shinycssloaders
, DT
, callr
, data.table
, rintrojs
, readxl
, lubridate
, oulad
Open University Learning Analytics Dataset (OULAD): https://analyse.kmi.open.ac.uk/open_dataset