ExRec – Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing (NeurIPS 2025)

TL;DR. ExRec turns a calibrated Knowledge‑Tracing (KT) model into a fully fledged RL environment and learns exercise‑recommendation policies that directly optimise students’ knowledge gains.

This is the repository of ExRec: Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing. [link to our paper]

Note: If you find our work valuable, we kindly ask you to consider citing our work.

@article{ozyurt2025personalized,
  title={Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing},
  author={Ozyurt, Yilmazcan and Almaci, Tunaberk and Feuerriegel, Stefan and Sachan, Mrinmaya},
  journal={NeurIPS},
  year={2025}
}

What is ExRec?

ExRec is a general‑purpose framework for personalising learning materials with minimal supervision.
It (i) annotates questions with Knowledge Concepts (KCs) & solution steps, (ii) learns semantic embeddings, (iii) trains & calibrates a KT model to output KC‑level mastery, and (iv) plugs this calibrated model into a variety of RL algorithms—including our Model‑Based Value Estimation (MVE)—to recommend the next best exercise.

Key features

Open‑corpus ready – no need for manual KC labels.
Compact student state & KC‑aware rewards – designed for efficient RL.
Algorithm‑agnostic – works with discrete or continuous action spaces, on‑ or off‑policy learners.
Interpretable – every action corresponds to a real exercise and maps back to target KCs.

Installation

Datasets
- XES3G5M – download the original corpus from https://github.com/ai4ed/XES3G5M and copy to data/XES3G5M/.
  – We ship English question texts, KC annotations & clusters in data/XES3G5M/metadata/ (no additional cost).
Python environments
We recommend 3 separate python virtual environments for separate modules.
- Modules 1 and 2: KC Annotation and Representation Learning
```
# Install required libraries
pip install -r requirement_env_rep_learning_.txt
```
- Module 3: KT Training with KC Calibration
```
# Install required libraries
pip install -r requirements_env_pykt.txt

# Install our customized PyKT to be loaded as a module
cd pykt-toolkit && pip install -e .
```
- Module 4: RL Framework for Exercise Recommendation
```
# Install required libraries
pip install -r requirements_env_rl.txt

# Install our customized PyKT to be loaded as a module
cd pykt-toolkit && pip install -e .

# Install our customized exercise_recommender to be loaded as a module
cd ../exercise_recommender && pip install -e .
```
Note: The default version of Tianshou library does not provide a an efficient implementation of vectorized environment for environments (such as our calibrated KT model) on GPU. For this, we wrote multiple wrappers (see exercise_recommender/wrappers) and process multiple in environments in batches in GPU. As a result, we support a particular version of Tianshou library as our wrappers may not work with the future updates of the library.

Module Details

Module 1 – KC Annotation

This part shows the example use case of our fully automated KC annotation pipeline. First, locate the kc_annotation folder. Then you can follow the steps below.

# a) generate solution steps
python get_step_by_step_solutions.py --original_question_file ../data/XES3G5M/metadata/questions_translated.json --annotated_question_file ../data/XES3G5M/metadata/questions_translated_sol_annotated.json

# b) annotate KCs and map steps ↔ KCs (two steps merged for efficiency)
python get_kc_annotations_and_mapping.py --original_question_file ../data/XES3G5M/metadata/questions_translated_kc_annotated.json --annotated_question_file ../data/XES3G5M/metadata/questions_translated_kc_sol_annotated_mapped.json

Skip above step if you reuse our pre‑computed file
data/XES3G5M/metadata/questions_translated_kc_sol_annotated_mapped.json.

Module 2 – Representation Learning

For this part, first locate the representation_learning folder. Then you can train the language model as below.

python train.py --json_file_dataset ../data/XES3G5M/metadata/questions_translated_kc_sol_annotated_mapped.json --json_file_cluster_kc ../data/XES3G5M/metadata/kc_clusters.json --json_file_kc_questions data/XES3G5M/metadata/kc_questions_map.json --wandb_project_name <your_wandb_project_name>

Note that the above command requires you to setup your wandb account first.

After training, you can save the embeddings by following save_embeddings.ipynb.

Skip above step if you would like to reuse our pre‑computed embeddings:

Due to GitHub’s file size limits, we split the original qid2content_sol_avg_emb.json into two smaller files: data/XES3G5M_embeddings/part1_qid2content_sol_avg_emb.json and data/XES3G5M_embeddings/part2_qid2content_sol_avg_emb.json .

To reconstruct the full embeddings file, simply run:

python data/XES3G5M_embeddings/reconstruct_embeddings.py

This will create qid2content_sol_avg_emb.json in the same directory. You may also reuse data/XES3G5M_embeddings/kc_emb.json directly, as it is below GitHub’s size threshold.

Module 3 – KT Training with KC Calibration

As explained in the paper, we first train the KT model with the performance prediction objective.

cd pykt-toolkit/train_test
python queemb_dkt_train.py --emb_path=data/XES3G5M_embeddings/qid2content_sol_avg_emb.json --flag_load_emb --use_wandb --wandb_project_name=<your_wandb_project_name>

Then we calibrate the KT model for running the inference directly on KC embeddings to predict the knowledge states.

cd pykt-toolkit/train_test
python calibration_train.py --flag_use_cluster --flag_joint_train --flag_lstm_frozen --emb_path=data/XES3G5M_embeddings/qid2content_sol_avg_emb.json  --clusters_to_qids_path=data/XES3G5M/metadata/cluster_to_que_ids_map.json --kc_emb_path=data/XES3G5M_embeddings/kc_emb.json --pretrained_model_path=</path/to/pretrained_model> --kc_to_questions_path=data/XES3G5M/metadata/kc_questions_map.json

Skip this step if you reuse our pre‑trained KT model
data/pretrained_kt_model.ckpt .

Module 4 – RL for Exercise Recommendation

We support multiple back‑ends (via Tianshou library) and both continuous & discrete action agents.

Example: Soft Actor‑Critic (SAC) + MVE

python train_test/current_kc_train_sac.py --critic_model=critic_dkt --tau=0.01 --gamma=0.99 --alpha=0.2 --deterministic_eval --use_wandb --wandb_project_name=current_kc_sac

Benchmark Tasks

Task ID	Description	Reward
`global`	Improve overall mastery across all KCs	Δ mean mastery
`practiced`	Focus on the KC most recently practised	Δ mastery of the practiced KC
`upcoming`	Anticipate the empirically next KC	Δ mastery of upcoming KC
`weakest`	Always target the student’s weakest KC	Δ mastery of weakest KC

How to run each experiment

Below we provide the example scripts for how to run each experiment. Although we provide the examples for one RL algorithm, it can be easily extended to other algorithms by choosing their respective training scripts.

Task 1: Global Knowledge Improvement

python train_test/all_kc_train_sac.py --critic_model=critic_dkt --tau=0.01 --gamma=0.99 --alpha=0.2 --deterministic_eval --use_wandb --wandb_project_name=all_kc_sac

Task 2: Knowledge Improvement in Practiced KC

python train_test/current_kc_train_sac.py --critic_model=critic_dkt --tau=0.01 --gamma=0.99 --alpha=0.2 --deterministic_eval --use_wandb --wandb_project_name=current_kc_sac

Task 3: Task 3: Knowledge Improvement in Upcoming KC

python train_test/upcoming_kc_train_sac.py --tau=0.01 --gamma=0.99 --alpha=0.2 --deterministic_eval --use_wandb --wandb_project_name=upcoming_kc_sac

Task 4: Task 4: Knowledge Improvement in Weakest KC

python train_test/worst_kc_train_sac.py --tau=0.01 --gamma=0.99 --alpha=0.2 --deterministic_eval --use_wandb --wandb_project_name=worst_kc_sac

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
exercise_recommender		exercise_recommender
kc_annotation		kc_annotation
pykt-toolkit		pykt-toolkit
representation_learning		representation_learning
train_test		train_test
.DS_Store		.DS_Store
README.md		README.md
__init__.py		__init__.py
overview.png		overview.png
requirement_env_rl_.txt		requirement_env_rl_.txt
requirements_env_pykt.txt		requirements_env_pykt.txt
requirements_env_rep_learning.txt		requirements_env_rep_learning.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ExRec – Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing (NeurIPS 2025)

Table of Contents

What is ExRec?

Installation

Module Details

Module 1 – KC Annotation

Module 2 – Representation Learning

Module 3 – KT Training with KC Calibration

Module 4 – RL for Exercise Recommendation

Example: Soft Actor‑Critic (SAC) + MVE

Benchmark Tasks

How to run each experiment

Task 1: Global Knowledge Improvement

Task 2: Knowledge Improvement in Practiced KC

Task 3: Task 3: Knowledge Improvement in Upcoming KC

Task 4: Task 4: Knowledge Improvement in Weakest KC

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ExRec – Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing (NeurIPS 2025)

Table of Contents

What is ExRec?

Installation

Module Details

Module 1 – KC Annotation

Module 2 – Representation Learning

Module 3 – KT Training with KC Calibration

Module 4 – RL for Exercise Recommendation

Example: Soft Actor‑Critic (SAC) + MVE

Benchmark Tasks

How to run each experiment

Task 1: Global Knowledge Improvement

Task 2: Knowledge Improvement in Practiced KC

Task 3: Task 3: Knowledge Improvement in Upcoming KC

Task 4: Task 4: Knowledge Improvement in Weakest KC

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages