Skip to content

希望增加qwen2.5-omni推理保存音频文件 #3898

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
gouqi666 opened this issue Apr 16, 2025 · 3 comments
Open

希望增加qwen2.5-omni推理保存音频文件 #3898

gouqi666 opened this issue Apr 16, 2025 · 3 comments
Labels
enhancement New feature or request

Comments

@gouqi666
Copy link

Describe the feature
Please describe the feature requested here(请在这里描述需求)
使用qwen2.5-omni推理时,目前智能保存文本输出,无法保存音频输出,这对于一个多模态输出的模型来说,效果大打折扣,看到huggingface都已经实现这个过程,希望swift能接入一下。
Paste any useful information
Paste any useful information, including papers, github links, etc.(请在这里描述其他有用的信息,比如相关的论文地址,github链接等)
https://huggingface.co/Qwen/Qwen2.5-Omni-7B
Additional context
Add any other context or information here(其他信息可以写在这里)

Image
@hjjjk
Copy link

hjjjk commented Apr 16, 2025

哥你好,请教一下你通过swift部署qwen2.5-omni模型用的是NPU还是GPU

@gouqi666
Copy link
Author

gouqi666 commented Apr 16, 2025 via email

@hjjjk
Copy link

hjjjk commented Apr 16, 2025

好的哥,感谢回复

@Jintao-Huang Jintao-Huang added the enhancement New feature or request label Apr 19, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants