We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Describe the feature 目前GRPO的异步训练只提供了单机示例脚本,但多机情况下有如下问题希望得到解答:
感谢。
The text was updated successfully, but these errors were encountered:
同样的问题,目前在训练32B的多模态模型,GRPO这部分使用deepspeed分布式,想单独用一个节点部署模型,其他节点进行训练,能否提供下实例,谢谢 @
Sorry, something went wrong.
+1
No branches or pull requests
Describe the feature
目前GRPO的异步训练只提供了单机示例脚本,但多机情况下有如下问题希望得到解答:
感谢。
The text was updated successfully, but these errors were encountered: