🐛 fix: fix reward model train seq_cls #3921

gaohongkui · 2025-04-17T12:27:18Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

修复了使用奖励模型(reward model)作为基础模型进行多分类训练时的问题。当 reward model 用于序列分类(seq_cls)任务且 num_labels > 1 时，由于模型结构差异可能导致尺寸不匹配的问题。此修复通过自动设置 ignore_mismatched_sizes=True 来解决这个问题。

主要改动：

在 get_model_tokenizer_from_local 函数中，当检测到是 reward model 且用于多分类任务（num_labels > 1）时，自动设置 ignore_mismatched_sizes=True（重新添加分类头）
添加了相应的警告日志，提醒用户正在使用该配置

Experiment results

不需要实验结果，这是一个功能性修复，确保了在以下场景可以正常工作：

使用 reward model 作为基础模型
进行多分类任务（num_labels > 1）
模型加载时自动处理尺寸不匹配问题

Jintao-Huang · 2025-04-26T08:24:51Z

我理解这个修复了感谢

Jintao-Huang · 2025-04-26T08:30:35Z

请merge main分支并执行以下命令进行代码整理

pip install pre-commit
pre-commit run --all-files

gaohongkui · 2025-04-26T11:21:28Z

@Jintao-Huang 已 format 代码

gaohongkui mentioned this pull request Apr 18, 2025

使用 AutoModelForSequenceClassification 训练 seq_cls 任务时出错 #3927

Open

Jintao-Huang approved these changes Apr 26, 2025

View reviewed changes

🐛 fix: fix reward model train seq_cls

2ef6c96

gaohongkui force-pushed the main branch from b2474b3 to 2ef6c96 Compare April 26, 2025 11:18

Merge branch 'modelscope:main' into main

29166ed

Jintao-Huang merged commit a2a858d into modelscope:main Apr 26, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 fix: fix reward model train seq_cls #3921

🐛 fix: fix reward model train seq_cls #3921

gaohongkui commented Apr 17, 2025

Jintao-Huang commented Apr 26, 2025

Jintao-Huang commented Apr 26, 2025

gaohongkui commented Apr 26, 2025

🐛 fix: fix reward model train seq_cls #3921

🐛 fix: fix reward model train seq_cls #3921

Conversation

gaohongkui commented Apr 17, 2025

PR type

PR information

Experiment results

Jintao-Huang commented Apr 26, 2025

Jintao-Huang commented Apr 26, 2025

gaohongkui commented Apr 26, 2025