Code for reproducing the experiments in What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?.
conda create -n reasoning_generalization python=3.9
conda activate reasoning_generalization
cd reasoning_generalization
pip install -r requirements.txtFill in huggingface token in huggingface_params.py.
See gsm8k_run.sh or math_run.sh for examples of training and evaluation scripts.
See gsm8k_analyze.ipynb or gsm8k_analyze.ipynb for analysis code.
Our codebase borrows code from stanford_alpaca.

