-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Insights: huggingface/open-r1
Overview
-
- 3 Merged pull requests
- 2 Open pull requests
- 1 Closed issue
- 0 New issues
Could not load contribution data
Please try again later
3 Pull requests merged by 2 people
-
Fix style again :)
#636 merged
May 8, 2025 -
Code Execution using Morph Cloud
#614 merged
May 8, 2025 -
Fix style
#631 merged
May 5, 2025
2 Pull requests opened by 2 people
-
Use pass@1 for all evals
#633 opened
May 5, 2025 -
Add dataset filtering script
#637 opened
May 9, 2025
1 Issue closed by 1 person
-
Release 32B math-220k supervised fine-tuned weights
#634 closed
May 8, 2025
12 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Requiring the recipe for training the GRPO model of OlympicCoder
#623 commented on
May 2, 2025 • 0 new comments -
data-parallel evaluation cause an error of no GPU available
#245 commented on
May 4, 2025 • 0 new comments -
How to contribute
#23 commented on
May 6, 2025 • 0 new comments -
Training with vllm not supports Qwen3 seriers
#630 commented on
May 7, 2025 • 0 new comments -
When I run the GRPO demo, I find that format_reward is always 0!!!
#235 commented on
May 7, 2025 • 0 new comments -
Sequence length problem
#579 commented on
May 8, 2025 • 0 new comments -
OpenR1-Qwen-7B achieves 47.40 on AIME24, better than reported!
#622 commented on
May 9, 2025 • 0 new comments -
Reproducing GRPO based on Qwen2.5-1.5B-Instruct and using Math-220K dataset Yields Unexpected Results
#538 commented on
May 9, 2025 • 0 new comments -
The kl divergence collapses but the format reward becomes larger
#373 commented on
May 9, 2025 • 0 new comments -
[WIP] R1-Zero-like experiments
#569 commented on
May 8, 2025 • 0 new comments -
When the chat_template is not set in the YAML configuration file, crashes
#621 commented on
May 7, 2025 • 0 new comments -
GRPO with codeforces problems
#627 commented on
May 2, 2025 • 0 new comments