grpotrainer

Here are 5 public repositories matching this topic...

An implementation of GRPO for Unsloth's VLMs training

reinforcement-learning vlm huggingface trl unsloth grpo grpotrainer

Your efficient and accurate answer verification system for RL training.

rl academic-project llm grpo grpotrainer

simpleR1: A Simple Framework for Training R1-like Models

Recreating the minimal training methods of DeepSeek-R1 for small langauge models.

reasoning r1 grpo grpotrainer

interface mcp model-context-protocol mcp-client grpo grpotrainer

Add a description, image, and links to the grpotrainer topic page so that developers can more easily learn about it.

To associate your repository with the grpotrainer topic, visit your repo's landing page and select "manage topics."