Skip to content

关于多标签长文本分类 #6652

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
luckyhfzd opened this issue Aug 8, 2023 · 2 comments
Closed

关于多标签长文本分类 #6652

luckyhfzd opened this issue Aug 8, 2023 · 2 comments
Assignees
Labels
others unknown issue type triage

Comments

@luckyhfzd
Copy link

luckyhfzd commented Aug 8, 2023

问题描述

小白一枚,此前有尝试使用ernie_health做医疗行业多标签文本分类,但因模型有512长度限制,微调效果不佳。
需求:
标签数据大约有5W+,文本长度约2000至3000.
想了解下paddle是否有相关解决方案或示例,请大佬指点。感激不尽~!

@luckyhfzd luckyhfzd added the others unknown issue type label Aug 8, 2023
@github-actions github-actions bot added the triage label Aug 8, 2023
@liuzhipengchd
Copy link

可以用pipline方式,把文本切分,训练多个ernie_health,然后把每个模型的输出结果concat,后面再接一个多分类的模型

@w5688414
Copy link
Contributor

w5688414 commented May 6, 2024

现在推荐使用大模型来做分类

@paddle-bot paddle-bot bot closed this as completed May 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
others unknown issue type triage
Projects
None yet
Development

No branches or pull requests

4 participants