Pulse · huggingface/open-r1

March 21, 2025 – March 28, 2025

Overview

7 Active pull requests

19 Active issues

Could not load contribution data

Please try again later

25 Unresolved conversations

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

🦜Enhance repetition penalty reward for language that cannot be split by whitespace
#516 commented on Mar 25, 2025 • 2 new comments
sft learn to generate eos token
#494 commented on Mar 25, 2025 • 0 new comments
Extend max_model_length to prevent context truncation
#463 commented on Mar 27, 2025 • 0 new comments
Resolve double BOS token issue
#462 commented on Mar 27, 2025 • 0 new comments
[DO NOT MERGE] SFT configs for Qwen coder models
#438 commented on Mar 23, 2025 • 0 new comments
New GRPO dataset and tasks: formally-verified program correctness
#379 commented on Mar 23, 2025 • 0 new comments
Crazy VRAM usage with longer prompts
#47 commented on Mar 27, 2025 • 0 new comments
lighteval script failed
#468 commented on Mar 27, 2025 • 0 new comments
The kl divergence collapses but the format reward becomes larger
#373 commented on Mar 27, 2025 • 0 new comments
Is it normal for a 1.5B model on an H100 80G to require several hundred hours for LiveCodeBench?
#466 commented on Mar 26, 2025 • 0 new comments
Prefix Caching should be turned off for GRPO
#491 commented on Mar 26, 2025 • 0 new comments
Instead of rising steadily, the reward fluctuates wildly
#403 commented on Mar 26, 2025 • 0 new comments
Does anyone have an working SFT training script for 1xH100? - OOM Error
#332 commented on Mar 26, 2025 • 0 new comments
grpo with multiple GPUs got stuck
#478 commented on Mar 26, 2025 • 0 new comments
How to increase the context window from 4k to 32k on qwen models ?
#444 commented on Mar 26, 2025 • 0 new comments
[Installation] Failed to Build vllm on ARM Architecture with uv pip install - Unknown Runtime Environment
#510 commented on Mar 25, 2025 • 0 new comments
GRPO OOM
#475 commented on Mar 25, 2025 • 0 new comments
Evaluate GRPO vs. other RL algorithms
#11 commented on Mar 25, 2025 • 0 new comments
Can I use two GPUs for VLLM?
#471 commented on Mar 25, 2025 • 0 new comments
failed (exitcode: -8) local_rank: 6 (pid: 58423) of binary: /opt/miniconda/bin/python When run GRPO
#254 commented on Mar 25, 2025 • 0 new comments
SFT model make repetitions during the inference phase
#492 commented on Mar 24, 2025 • 0 new comments
Fail to parse gold solution
#503 commented on Mar 24, 2025 • 0 new comments
different max_position_embeddings and rope_theta in and OpenR1-Qwen-7B-SFT and it's base Qwen2.5-Math-7B-Instruct ?
#469 commented on Mar 23, 2025 • 0 new comments
OOM, SFT Qwen2.5-1.5B-Instruct OpenR1-Math-220k
#506 commented on Mar 22, 2025 • 0 new comments
After completing Step-1 training using the given example of Qwen2.5-1.5B-Instruct, the performance has decreased. Is this normal?
#355 commented on Mar 22, 2025 • 0 new comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

March 21, 2025 – March 28, 2025

Overview

Could not load contribution data

6 Pull requests merged by 3 people

1 Pull request opened by 1 person

7 Issues closed by 5 people

12 Issues opened by 12 people

25 Unresolved conversations

Insights: huggingface/open-r1

March 21, 2025 – March 28, 2025

Overview

Could not load contribution data

6 Pull requests merged by 3 people

1 Pull request opened by 1 person

7 Issues closed by 5 people

12 Issues opened by 12 people

25 Unresolved conversations