Skip to content

Issues: modelscope/ms-swift

Beta
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

fix _tp_plan
#4167 by Jintao-Huang was merged May 12, 2025 Loading…
训练kimivl报错 assert not self.training
#4166 by zss205 was closed May 12, 2025
support internvl3 pretrain instruct
#4164 by Jintao-Huang was merged May 11, 2025 Loading…
[megatron]Support packing & CP
#4163 by Jintao-Huang was merged May 11, 2025 Loading…
Support ulysses streaming
#4160 by tastelikefeet was merged May 10, 2025 Loading…
1 of 4 tasks
update readme
#4157 by Jintao-Huang was merged May 9, 2025 Loading…
Add more evaluation args
#4155 by Yunnglin was merged May 9, 2025 Loading…
1 of 4 tasks
Add sp script
#4154 by tastelikefeet was merged May 9, 2025 Loading…
1 of 4 tasks
[grpo] support gen rm
#4151 by hjh0119 was merged May 11, 2025 Loading…
1 of 4 tasks
Fix bugs
#4150 by Jintao-Huang was merged May 9, 2025 Loading…
fix ulysses dpo
#4149 by tastelikefeet was merged May 9, 2025 Loading…
1 of 4 tasks
fix init parameters
#4148 by lincq2000 was merged May 9, 2025 Loading…
1 of 4 tasks
Megatron SFT context_parallel_size>1时报cuda error bug Something isn't working
#4144 by Emperorizzis was closed May 11, 2025
Feature freezing/activating parameters via regex
#4143 by lincq2000 was merged May 9, 2025 Loading…
2 of 4 tasks
Support init parameters
#4141 by lincq2000 was merged May 9, 2025 Loading…
2 of 4 tasks
grpo code reward by judge0
#4140 by kevssim was merged May 9, 2025 Loading…
2 of 4 tasks
[grpo] fix labels pop and peftmodel parameter check
#4136 by hjh0119 was merged May 8, 2025 Loading…
1 of 4 tasks
support more vision dataset
#4132 by hjh0119 was closed May 12, 2025 Draft
1 of 4 tasks
[megatron] support max_epochs
#4125 by Jintao-Huang was merged May 9, 2025 Loading…
[grpo] fix multi modal doc
#4124 by hjh0119 was merged May 11, 2025 Loading…
1 of 4 tasks
update qwen3 more models
#4123 by Jintao-Huang was merged May 8, 2025 Loading…
fix sequence_parallel
#4122 by Jintao-Huang was merged May 7, 2025 Loading…
raise IndexError(f"Index {index} out of range for dataset of size {size}.") duplicate This issue or pull request already exists
#4120 by Sendren was closed May 8, 2025
ProTip! What’s not been updated in a month: updated:<2025-04-11.