veri-examples

Runnable example scripts for the Veri RL post-training platform.

quickstart.py — End-to-end GRPO training: connect a HuggingFace dataset, upload a reward function, submit a job, wait, download the checkpoint.
math_reward_trl.py — TRL-format reward function for math reasoning (def reward(completions, answer, **kwargs) -> list[float]).
math_reward.py — Miles-format async reward function (async def reward(args, sample, **kwargs) -> float).
gsm8k_prompts.jsonl — 20-row sample dataset in Veri's chat format, useful for smoke tests without pulling the full GSM8K from HuggingFace.

Run

pip install veri-sdk
veri login --key vk_your_key
python quickstart.py

Full documentation: docs.veri.studio.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
gsm8k_prompts.jsonl		gsm8k_prompts.jsonl
math_reward.py		math_reward.py
math_reward_trl.py		math_reward_trl.py
quickstart.py		quickstart.py