Skip to content

Feature Engineering and Machine Learning model to predict efficient use of time on NAEP Assessment from student log data.

License

Notifications You must be signed in to change notification settings

levinnat/NAEPDataMining

Repository files navigation

NAEPDataMining

This repository provides the code for implementing the predictive model built for the 2019 NAEP Data Mining Competition. https://sites.google.com/view/dataminingcompetition2019/home?authuser=0

Step 1: Obtain the dataset from - https://sites.google.com/view/dataminingcompetition2019/dataset?authuser=0 Copy all data files into the TrainingData folder

Step 2: Run DataScraping to generate all features

Step 3: Use ModelBuildingAndPrediction to train the model, generate predictions, and evaluate them against the hidden evaluation data in the EvaluationData folder

Disclaimer: This code was built over a series of months for academic research. There may be small bugs, there are probably inefficiencies, and there are most certainly some places where more comments are needed. I was simultaneously teaching myself data science and working on this project. Any and all feedback/constructive criticism is welcome...but please be kind.

DOI

About

Feature Engineering and Machine Learning model to predict efficient use of time on NAEP Assessment from student log data.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published