Data Transformation and Machine Learning for Student Performance Prediction

This project is designed to predict student performance in math, reading, and writing based on various demographic and educational factors such as gender, parental level of education, race/ethnicity, lunch type, and test preparation course. The project leverages Python, data processing pipelines, and machine learning algorithms to build predictive models.

Overview

The main goal of this project is to transform raw student performance data and build machine learning models that can predict a student's scores in math, reading, or writing based on their demographic and educational backgrounds. The project uses feature engineering, data preprocessing techniques, and machine learning algorithms like linear regression and random forests to accomplish these predictions.

Features

Data Transformation Pipelines: Includes preprocessing steps such as missing value imputation, one-hot encoding for categorical variables, and scaling for numerical features.
Machine Learning Models: Regression models like Linear Regression, Random Forest, and Gradient Boosting to predict student scores.
Exploratory Data Analysis (EDA): Visualization and analysis of relationships between features and target variables (student scores).
Automated Pipelines: Uses Scikit-Learn's Pipeline and ColumnTransformer to streamline the preprocessing and modeling process.

Tech Stack

Programming Languages: Python
Libraries:
- Data Manipulation: Pandas, NumPy
- Data Visualization: Matplotlib, Seaborn
- Machine Learning: Scikit-Learn, XGBoost
- Others: OS, Logging, Dataclasses
Tools:
- Git, GitHub
- Jupyter Notebooks (for EDA and development)
- Visual Studio Code

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
catboost_info		catboost_info
notebook		notebook
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Data Transformation and Machine Learning for Student Performance Prediction

Overview

Features

Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

ttlarson7/test_score_predictor

Folders and files

Latest commit

History

Repository files navigation

Data Transformation and Machine Learning for Student Performance Prediction

Overview

Features

Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages