A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation of large language models
nlp machine-learning awesome reinforcement-learning rl awesome-list knowledge-distillation model-compression post-training opd distillation self-distillation llm rlhf gkd llm-training speculative-decoding on-policy-distillation minillm llm-distillation
-
Updated
May 11, 2026