[Bug]: resize_token_embedding仍然存在bug，bart,pegasus无效，其他生成模型最后一层lm_head不会被resize #6008

xiaotingyun · 2023-05-23T08:44:50Z

软件环境

- paddlepaddle:2.4.0
- paddlepaddle-gpu: 2.4.0
- paddlenlp: 2.5.2

重复问题

I have searched the existing issues

错误描述

resize_token_embedding方法仍然存在问题，在bart和pegasus中无效，只更新share，但实际使用的encoder和decoder的embedding层不会更新，最后一层lm_head也不会更新，可能是因为bart和pegasus的encoder和decoder没有set_input_embeddings方法的原因,在t5和gpt中不会更新最后一层lm_head。

稳定复现步骤 & 代码

复现代码如下
https://aistudio.baidu.com/aistudio/projectdetail/6232319?contributionType=1&sUid=581675&shared=1&ts=1684831403394
版本号：resize_token bug

w5688414 · 2024-05-08T08:34:36Z

请升级一下paddle和paddlenlp的版本试试，现在已经支持了。

PaddleNLP/paddlenlp/transformers/model_utils.py

Line 1320 in 829e7f0

    
           def resize_token_embeddings(self, new_num_tokens: Optional[int] = None) -> nn.Embedding:

xiaotingyun added the bug Something isn't working label May 23, 2023

github-actions bot added the triage label May 23, 2023

sijunhe self-assigned this May 29, 2023

paddle-bot bot assigned wawltor Mar 8, 2024

paddle-bot bot closed this as completed May 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: resize_token_embedding仍然存在bug，bart,pegasus无效，其他生成模型最后一层lm_head不会被resize #6008

[Bug]: resize_token_embedding仍然存在bug，bart,pegasus无效，其他生成模型最后一层lm_head不会被resize #6008

xiaotingyun commented May 23, 2023

w5688414 commented May 8, 2024

[Bug]: resize_token_embedding仍然存在bug，bart,pegasus无效，其他生成模型最后一层lm_head不会被resize #6008

[Bug]: resize_token_embedding仍然存在bug，bart,pegasus无效，其他生成模型最后一层lm_head不会被resize #6008

Comments

xiaotingyun commented May 23, 2023

软件环境

重复问题

错误描述

稳定复现步骤 & 代码

w5688414 commented May 8, 2024