🐍 Python Learning Activity Dataset & Analysis

📌 Overview

This repository contains a synthetic dataset of 3,000 students learning Python, designed for data analysis, machine learning, and educational research. The dataset captures demographics, learning behaviors, engagement metrics, and final exam outcomes, enabling insights into student performance and actionable recommendations for course design.

📂 Dataset Description

The dataset simulates student engagement in a Python course and its relationship with performance outcomes.

Key Characteristics:

Number of Students: 3,000
Age Range: 16–55 years
Course Duration: Up to 15 weeks
Target Variables: final_exam_score, passed_exam

🧾 Column Definitions

Click to expand column definitions

Column Name	Type	Description
`student_id`	int	Unique student identifier
`age`	int	Age of learner (16–55)
`country`	object	Student country (e.g., India, Bangladesh, USA, UK)
`prior_programming_experience`	category	Programming experience level before Python
`weeks_in_course`	int	Number of weeks enrolled (1–15)
`hours_spent_learning_per_week`	float	Average weekly learning hours
`practice_problems_solved`	int	Number of coding challenges solved
`projects_completed`	int	Number of Python projects completed
`tutorial_videos_watched`	int	Number of tutorial videos watched
`uses_kaggle`	binary	Kaggle usage (1 = Yes, 0 = No)
`participates_in_discussion_forums`	binary	Forum participation (1 = Yes, 0 = No)
`debugging_sessions_per_week`	int	Average debugging sessions per week
`self_reported_confidence_python`	int	Self-reported Python confidence (1–10)
`final_exam_score`	float	Final exam score (0–100)
`passed_exam`	binary	Exam result (1 = Passed, 0 = Failed)

🛠️ Tools

Python – Data processing, cleaning, and exploratory data analysis (EDA)
Power BI – Interactive data visualizations for insights and reporting

📊 Project Overview

This analysis explores 3,000 students’ final exam performance to identify factors influencing pass and fail outcomes. By comparing learning behaviors, engagement metrics, and demographic variables, the project highlights actionable insights to improve pass rates.

🖼️ Power BI Dashboard

The dashboard provides an interactive view of student performance, including pass rates, score distributions, and learning behavior comparisons between passed and failed students.

🔗 Dashboard Link: Power BI Dashboard

⚡ Key Drivers of Performance (Causes)

Insufficient Study Time
- Students who studied three hours or less per week performed poorly.
- Highlights insufficient study habits and the need for engagement strategies.
Project Completion
- More projects completed correlated with higher exam scores.
- Project completion is a strong predictor of success, emphasizing active engagement.
Practice Problem Solving
- The more practice problems students solved, the higher their exam scores.
- Reinforces understanding, strengthens analytical skills, and moves beyond rote memorization.

🔍 Key Insights

1. Severe Performance Gap vs. Target

Only 18% of students passed, significantly below the 80% target passing rate.
Indicates a systemic learning and assessment issue, not isolated underperformance.
Average final exam score: 43.32, reinforcing the urgent need for intervention.

2. Hands-on Effort Is the Strongest Differentiator

Students who passed showed higher engagement in active learning:
- Weekly study time: 8.6 vs. 6.7 hours
- Projects completed: 2.8 vs. 1.8
- Practice problems solved: 2.8 vs. 1.8
These factors strongly align with higher exam scores (69.0 vs. 37.8).

3. Passive Learning Has Minimal Impact

Tutorial video views, forum participation, Kaggle usage, and debugging sessions were nearly identical between pass and fail groups.
Passive or unstructured engagement alone is insufficient for meaningful outcomes.

4. Confidence Is Correlated but Not Causal

Self-reported confidence correlates with higher exam scores but does not independently predict performance.
Confidence reflects mastery from hands-on practice rather than driving success.

5. Age Is Not a Performance Driver

Age has no meaningful relationship with exam scores.
Performance gaps are behavioral and instructional, not demographic.

✅ Recommendations

1. Shift Curriculum Toward Mandatory Hands-On Practice

Increase required coding projects and practice problems.
Tie activities to graded milestones.
Introduce minimum weekly practice thresholds.
📌 Rationale: Active engagement shows the strongest link to passing.

2. Early Risk Detection and Intervention

Identify students with low study time, few projects, or minimal practice.
Trigger early support via tutoring, guided labs, or structured study plans.
📌 Rationale: Most failures likely stem from insufficient early engagement.

3. Redesign Learning Materials to Be Practice-Centered

Convert tutorial videos into interactive exercises and code-along labs.
Reduce reliance on purely observational formats.
📌 Rationale: Passive content shows no meaningful performance impact.

4. Align Confidence With Demonstrated Skill

Replace self-assessed confidence with skill-based diagnostics and weekly coding challenges.
Use confidence metrics only as supporting indicators.
📌 Rationale: Confidence without practice may create false readiness.

5. Reassess Exam Difficulty and Curriculum Alignment

Ensure exam questions align with taught material and activities.
Ensure difficulty progression is gradual, not abrupt.
Pilot midterm diagnostics to measure readiness.
📌 Rationale: An 82% failure rate suggests misalignment between teaching and assessment.

📌 Conclusion

Consistent hands-on practice is the primary driver of student success. Improving pass rates requires:

Curriculum redesign for active learning
Assessment alignment
Early interventions guided by data-driven monitoring

📜 License

This dataset is provided for educational and research purposes only. You are free to use, modify, and share it with attribution.

Author: Data Analytics Project | Tools: Python, Power BI

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Student_Performance_Analysis		Student_Performance_Analysis
images		images
notebook		notebook
powerpoint		powerpoint
python_learning_exam_performance.csv		python_learning_exam_performance.csv
python_learning_exam_performance.pbix		python_learning_exam_performance.pbix
python_learning_exam_performance_clean.csv		python_learning_exam_performance_clean.csv
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🐍 Python Learning Activity Dataset & Analysis

📌 Overview

📖 Table of Contents

📂 Dataset Description

🧾 Column Definitions

🛠️ Tools

📊 Project Overview

🖼️ Power BI Dashboard

⚡ Key Drivers of Performance (Causes)

🔍 Key Insights

1. Severe Performance Gap vs. Target

2. Hands-on Effort Is the Strongest Differentiator

3. Passive Learning Has Minimal Impact

4. Confidence Is Correlated but Not Causal

5. Age Is Not a Performance Driver

✅ Recommendations

1. Shift Curriculum Toward Mandatory Hands-On Practice

2. Early Risk Detection and Intervention

3. Redesign Learning Materials to Be Practice-Centered

4. Align Confidence With Demonstrated Skill

5. Reassess Exam Difficulty and Curriculum Alignment

📌 Conclusion

📜 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🐍 Python Learning Activity Dataset & Analysis

📌 Overview

📖 Table of Contents

📂 Dataset Description

🧾 Column Definitions

🛠️ Tools

📊 Project Overview

🖼️ Power BI Dashboard

⚡ Key Drivers of Performance (Causes)

🔍 Key Insights

1. Severe Performance Gap vs. Target

2. Hands-on Effort Is the Strongest Differentiator

3. Passive Learning Has Minimal Impact

4. Confidence Is Correlated but Not Causal

5. Age Is Not a Performance Driver

✅ Recommendations

1. Shift Curriculum Toward Mandatory Hands-On Practice

2. Early Risk Detection and Intervention

3. Redesign Learning Materials to Be Practice-Centered

4. Align Confidence With Demonstrated Skill

5. Reassess Exam Difficulty and Curriculum Alignment

📌 Conclusion

📜 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages