Fix grpo eval when gas > 1 #4057

hjh0119 · 2025-05-01T07:50:41Z

PR type

Write the detail information belongs to this PR.

Paste your experiment result here(if needed).

* main: fix enable_cache (modelscope#4091) Support ulysses for llm/mllm,dpo/sft (modelscope#4085) update docs (modelscope#4078) feat: support megatron wandb (modelscope#4074) feat: add run name support (modelscope#4072) fix padding_side left (modelscope#4069) bump version support MiMo-7B (modelscope#4067) fix packing eval streaming (modelscope#4066) Support empty think loss scale (modelscope#4065) support qwen3-moe awq (modelscope#4059) Fix grpo eval when gas > 1 (modelscope#4057) fix rollout(modelscope#4055) updates GRPOTrainer compatible with trl 0.17 (modelscope#3969) support Qwen2.5-Omni-3B (modelscope#4052) update wechat (modelscope#4047) # Conflicts: # swift/llm/train/tuner.py

hjh0119 added 2 commits May 1, 2025 12:21

fix

55407d9

fix

c90cd03

Jintao-Huang approved these changes May 1, 2025

View reviewed changes

hjh0119 merged commit 49394e1 into modelscope:main May 1, 2025
2 checks passed

hjh0119 deleted the fix-eval branch May 1, 2025 08:25