VeriThinker: Learning to Verify Makes Reasoning Model Efficient
efficiency fine-tuning large-language-models reasoning-models deepseek-r1-distill-llama deepseek-r1-distill-qwen
-
Updated
Jul 11, 2025 - Python