We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我已经完成了一些基础sft的任务,现在我想让我的模型在新任务上表现的更好,同时不损失之前的知识。 我注意到ms-swift中提供了rft的脚本,并作出了相关说明,但是文档描述中好像grpo也是强化微调的一部分? 那我是使用强化微调还是直接使用grpo呢?
The text was updated successfully, but these errors were encountered:
No branches or pull requests
我已经完成了一些基础sft的任务,现在我想让我的模型在新任务上表现的更好,同时不损失之前的知识。
我注意到ms-swift中提供了rft的脚本,并作出了相关说明,但是文档描述中好像grpo也是强化微调的一部分?
那我是使用强化微调还是直接使用grpo呢?
The text was updated successfully, but these errors were encountered: