Skip to content

[grpo] fix labels pop and peftmodel parameter check #4136

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
May 8, 2025

Conversation

hjh0119
Copy link
Collaborator

@hjh0119 hjh0119 commented May 8, 2025

PR type

  • Bug Fix
  • New Feature
  • Document Updates
  • More Models or Datasets Support

PR information

  1. in some case, model forward with labels will cause logits None, we pop labels here

  2. for peft model, inspect inner model to save memory with logits_to_keep parameter(super)

Experiment results

Paste your experiment result here(if needed).

@hjh0119 hjh0119 merged commit 7cee5b6 into modelscope:main May 8, 2025
1 of 2 checks passed
@hjh0119 hjh0119 deleted the fix-label branch May 8, 2025 12:40
tastelikefeet added a commit to tastelikefeet/swift that referenced this pull request May 9, 2025
* main:
  Support init parameters (modelscope#4141)
  Feature freezing/activating parameters via regex (modelscope#4143)
  grpo code reward by judge0 (modelscope#4140)
  [megatron] support max_epochs (modelscope#4125)
  [grpo] fix labels pop and peftmodel parameter check  (modelscope#4136)

# Conflicts:
#	docs/source/Instruction/命令行参数.md
#	docs/source_en/Instruction/Command-line-parameters.md
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants