Skip to content

ms-swift vs r1v in grpo of Qwen-2.5vl #3901

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
Aurorana opened this issue Apr 16, 2025 · 0 comments
Open

ms-swift vs r1v in grpo of Qwen-2.5vl #3901

Aurorana opened this issue Apr 16, 2025 · 0 comments

Comments

@Aurorana
Copy link

请问在qwen-2.5vl的grpo训练上,ms-swift相较于r1v在速度和显存使用情况上有无优势?r1v可以生成同时生成num_generation个结果,而ms-swift在生成时只生成一个?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant