-
Notifications
You must be signed in to change notification settings - Fork 718
Pull requests: modelscope/ms-swift
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[model] support Tencent-Hunyuan/Hunyuan-A13B-Instruct
#4745
by Jintao-Huang
was merged Jun 27, 2025
Loading…
[grpo] fix max_step for dataloader when applying sequence parallel
#4731
by 0russwest0
was merged Jun 26, 2025
Loading…
support Kimi-VL-A3B-Thinking-2506 & Kimi-Dev-72B
#4719
by Jintao-Huang
was merged Jun 25, 2025
Loading…
[doc] simplify environment variables & update best practices documentation
#4715
by 0russwest0
was merged Jun 25, 2025
Loading…
[megatron] support rednote-hilab/dots.llm1.inst
#4707
by Jintao-Huang
was merged Jun 25, 2025
Loading…
[grpo]Tool rl: add reward func for ToolRL
#4694
by tpx818
was merged Jun 27, 2025
Loading…
1 of 4 tasks
docs: correct typo "resonse" to "response"
#4672
by kv-chiu
was merged Jun 23, 2025
Loading…
1 of 4 tasks
[feat] support fine-tuning of reranker models
#4671
by 0russwest0
was merged Jun 24, 2025
Loading…
1 of 4 tasks
[channel loss]support packing & padding free
#4666
by kevssim
was merged Jun 23, 2025
Loading…
1 of 4 tasks
[megatron] support DeepseekV2ForCausalLM and DeepseekV3ForCausalLM
#4659
by Jintao-Huang
was merged Jun 25, 2025
Loading…
[gkd] support use_logits_to_keep/padding_free/packing & update gkd shell
#4658
by Jintao-Huang
was merged Jun 21, 2025
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.