-
Notifications
You must be signed in to change notification settings - Fork 637
在GRPO训练中Weight_decay似乎没奏效? #3931
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
{ |
I encountered the same problem, but the 'eval_limit' setting was not taking effect in GRPO. |
我没有在训练命令行参数中明确指定了Weight_decay的数值大小,但是根据文档和实际使用参数,默认值均为0.1但是我并没看到其在学习率上的任何衰减,我想问一下这个是显示问题还是说单纯是Weight_decay没有起作用。但是请注意,warm_up是起作用的
The text was updated successfully, but these errors were encountered: