This repository contains our group submission for the CS3751 Data Visualization class project.
- This project focuses on analyzing student quiz-attempt data and validating key hypotheses through effective visualizations.
The quiz data includes the following attributes:
- Student code
- Attempt start time
- Attempt end time
- Time taken
- Marks out of 10
- Marks for individual questions
Use suitable visualizations to verify the following hypotheses:
- Students who take longer to complete the quiz tend to score higher.
- Some questions are consistently harder than others.
- High-performing students consistently improve scores over multiple attempts, whereas low performers show erratic progress.
- More difficult questions take longer to answer, but higher-performing students answer them faster than lower-performing students.
- There is an optimal time range for achieving higher scores, and students who finish too quickly or take too long tend to score lower.
- Students attempting quizzes during the night tend to score higher than those who attempt at day
- Score improvements decrease after several attempts (learning plateaus over time).
- Students tend to exhibit consistent performance across all quizzes.
- Students who fail the first question (Q1) are statistically more likely to have a lower success rate on subsequent questions (Q2-Q5) compared to those who get Q1 correct.
- Students who score on hard questions achieve disproportionately higher overall grades than students who only score on easy questions.
Task 1/: notebooks and data for Task 1 analysis. Python notebooks are numbered according to hypothesis question number.Task 2/: notebooks and analysis for the five additional hypotheses. Python notebooks are numbered according to aboveTask 2order.