This project builds and deploys a machine learning model to predict student performance in math, reading, and writing using demographic and educational factors. It showcases an end-to-end machine learning pipeline, including data preprocessing, model training, and deployment on both AWS and Azure cloud platforms.
- Develop predictive models for student scores.
- Explore feature relationships and identify factors influencing performance.
- Deploy trained models as scalable web services on AWS and Azure.
- Demonstrate a complete ML lifecycle from data to deployment.
The dataset contains student information, including:
- Gender
- Race/Ethnicity
- Parental Level of Education
- Lunch Type
- Test Preparation Course
- Math Score
- Reading Score
- Writing Score
- Clone the repository:
https://github.com/PiusSunday/student-performance-analysis - Navigate to the project directory:
cd student-performance-analysis - Create a virtual environment (recommended):
conda create -n student-performance-analysis-venv -y - Activate the virtual environment:
- macOS/Linux:
conda activate student-performance-analysis-venv
- macOS/Linux:
- Install dependencies:
pip install -r requirements.txt - Explore the data and train the model using the Jupyter notebooks in the
notebooksdirectory. - Follow the instructions in the
aws/andazure/directories to deploy the model on your chosen cloud platform.
- pandas
- numpy
- scikit-learn
- matplotlib
- seaborn
- Flask/FastAPI (for API deployment)
-
AWS: coming soon...
-
Azure: coming soon...
Contributions are welcome! Please feel free to submit pull requests or open issues.