Skip to content

fix: fix math verify error#27

Merged
lkevinzc merged 1 commit intomainfrom
math-verify
Jun 19, 2025
Merged

fix: fix math verify error#27
lkevinzc merged 1 commit intomainfrom
math-verify

Conversation

@lkevinzc
Copy link
Contributor

No description provided.

@lkevinzc
Copy link
Contributor Author

image

https://wandb.ai/axon-rl/goat/runs/0e10cch2?nw=nwuserlkevinzc

python train.py \
    --env_id math:Math12K \
    --gamma 0.9 \
    --gpus 8 \
    --gradient-checkpointing \
    --num_samples 1 \
    --rollout_batch_size 128 \
    --num_envs 16 \
    --rollout_batch_size_per_device 16 \
    --pi_buffer_maxlen_per_device 16 \
    --pretrain Qwen/Qwen3-0.6B-Base \
    --prompt_template qwen3_general \
    --enable_prefix_caching \
    --collocate \
    --vllm_sleep \
    --vllm_gpu_ratio 0.45 \
    --rnd-seed \
    --learning_rate 0.000001 \
    --lr_scheduler constant \
    --lr_warmup_ratio 0 \
    --num_ppo_epochs 2 \
    --train_batch_size 128 \
    --train_batch_size_per_device 1 \
    --beta 0 \
    --max_model_len 12800 \
    --generate_max_length 4096 \
    --temperature 1.0 \
    --top_p 1 \
    --eval_steps -1 \
    --save_steps -1 \
    --eval_temperature 0.6 \
    --eval_top_p 0.95 \
    --eval_generate_max_length 4096 \
    --max_train 65000 \
    --max_save_num 30 \
    --use-wb \
    --wb-org axon-rl \
    --wb-run-name zichen-qwen3-0.6b-base-$1-gamma0.9 \
    --wb_project goat \
    --debug

@lkevinzc lkevinzc merged commit 41ba2a9 into main Jun 19, 2025
@lkevinzc lkevinzc deleted the math-verify branch June 19, 2025 11:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant