We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Describe the bug RewardTrainer中是支持设置coeffi参数的 但是如何通过swift rlhf --rlhf_type rm ...命令控制这个参数的传入呢? 如果不设置的话,从args.json文件可以看到 json { ..., "training_args": "RewardConfig(...,center_rewards_coefficient=None,...) }
json { ..., "training_args": "RewardConfig(...,center_rewards_coefficient=None,...) }
The text was updated successfully, but these errors were encountered:
#3917 You can now pass it using --center_rewards_coefficient.
Sorry, something went wrong.
还是不行的。
ps,我用的版本是
不好意思,没注意到你已经fix过了,我更新版本试试 @hjh0119
No branches or pull requests
Describe the bug

RewardTrainer中是支持设置coeffi参数的
但是如何通过swift rlhf --rlhf_type rm ...命令控制这个参数的传入呢?
如果不设置的话,从args.json文件可以看到
json { ..., "training_args": "RewardConfig(...,center_rewards_coefficient=None,...) }
The text was updated successfully, but these errors were encountered: