OKT: Open-ended Knowledge Tracing

OKT provides the first exploration into open-ended knowledge tracing by studying the new task of predicting students’ exact open-ended responses to questions. This repository contains code for Open-Ended Knowledge Tracing for Computer Science Education
Naiming Liu, Zichao Wang, Richard G. Baraniuk, Andrew Lan, to be presented at EMNLP 2022.

A block diagram of OKT is shown here:

Dependencies

python 3.8.12
torch 1.10.0
transformers 4.6.1
scikit-learn 0.24.2
numpy 1.22.1
munch 2.5.0
nltk 3.7
neptune-client 0.14.2

Data

We use CSEDM dataset and preprocess the data by 1). removing all codes that can't be parsed as abstract syntax tree. 2). convert student codes to vector representation using ASTNN. You can download the preprocessed data with the commands below.

Download preprocessed data

cd scripts
bash data.sh

Fine-tuned/Pre-trained models

Download fine-tuned GPT models

We provide two fine-tuned GPT-2 models to test the performance of pre-trained response generation model. One with funcom dataset, while the other is further on CSEDM based on the first one. Models can be downloaded with the following commands.

cd scripts
bash pretrained_lm.sh

Training LSTM and classifier

In order to pre-train knowledge estimation (LSTM) and classifier, run python main_student_model on the command line. All parameters can be changed in the configs_student_model.yaml file.

Training OKT

In order to train OKT model, run python main_okt.py on the command line. All parameters can be changed in the configs_okt.yaml file. We use Neptune.ai to track our experiment results. If you also want to use Neptune.ai, you should change neptune_project and neptune_api in the parameter list to your own neptune credentials.
Note: To use other knowledge tracing (KT) models instead of LSTM as knowledge estimation for OKT, you should use pre-trained KT models. We integrate two KT models (AKT, DKVMN) in our code (need to uncomment first). If you want to use them, please follow AKT and DKVMN repo to pretrain corresponding KT models.

Results and Evaluation

We use two metrics: CodeBLEU and Dist-N and integrate their codes into this repo. To understand more about evalution metrics, please follow their corresponding websites. Training models and generation results will be saved in a directory checkpoints\$TIME you just created, where $TIME is the current time in data_time format. It will contain two models (lstm for knowledge tracing and model for generative model). It also includes an eval_log.pkl file, which shows CodeBLEU score, Dist-1 and generated student answers together with ground-truth answers for comparison. A set of trained results can be downloaded here.

cd scripts
bash results.sh

Some codes to create the plots included in the paper (visualization of knowledge states in latent space and its trajectory) can be found at directory notebooks.

Citations

Please cite our paper if your find it helpful to you work!

@article{liu2022open,
  title={Open-Ended Knowledge Tracing},
  author={Liu, Naiming and Wang, Zichao and Baraniuk, Richard G and Lan, Andrew},
  journal={arXiv preprint arXiv:2203.03716},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
evaluator		evaluator
notebooks		notebooks
scripts		scripts
LICENSE		LICENSE
OKT-code.png		OKT-code.png
README.md		README.md
configs_okt.yaml		configs_okt.yaml
configs_student_model.yaml		configs_student_model.yaml
data_loader.py		data_loader.py
eval.py		eval.py
main_okt.py		main_okt.py
main_student_model.py		main_student_model.py
model.py		model.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OKT: Open-ended Knowledge Tracing

Dependencies

Data

Download preprocessed data

Fine-tuned/Pre-trained models

Download fine-tuned GPT models

Training LSTM and classifier

Training OKT

Results and Evaluation

Citations

About

Releases

Packages

Contributors 2

Languages

License

lucy66666/OKT

Folders and files

Latest commit

History

Repository files navigation

OKT: Open-ended Knowledge Tracing

Dependencies

Data

Download preprocessed data

Fine-tuned/Pre-trained models

Download fine-tuned GPT models

Training LSTM and classifier

Training OKT

Results and Evaluation

Citations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages