DashScope Realtime

🚀 Async Python SDK for DashScope Realtime ASR (Speech Recognition) & TTS (Speech Synthesis)

简介

DashScope Realtime 是一个支持异步 WebSocket 的 Python SDK，适配阿里 DashScope 的实时流式语音识别（ASR）和流式语音合成（TTS）能力。

为什么开发这个项目？

阿里云官方提供的DashScope Python SDK 是同步 WebSocket 实现，存在以下问题：

不支持 async / await
回调不在同一事件循环，无法直接使用 async 上下文
与 OpenAI API 生态的开源项目（如 FastAPI、Chainlit）不兼容

为了解决这些问题，本项目基于 DashScope WebSocket API，重新实现了异步版本的 ASR（语音识别）与 TTS（语音合成）SDK，具备：

纯异步 API 设计
支持流式音频输入输出
支持上下文无感知切换
更易接入 OpenAI API 风格的开源项目

安装

pip install dashscope-realtime

快速上手

实时语音识别（ASR）

from dashscope_realtime import DashScopeRealtimeASR

async with DashScopeRealtimeASR(api_key="your-api-key") as asr:
    await asr.send_audio(b"...")  # 发送音频片段

实时语音合成（TTS）

from dashscope_realtime import DashScopeRealtimeTTS

async with DashScopeRealtimeTTS(api_key="your-api-key") as tts:
    await tts.say("Hello, DashScope!")  # 发送文本
    await tts.finish()  # 完成任务

特性

✅ 全异步设计（async / await）
✅ ASR 支持流式音频输入
✅ TTS 支持流式音频输出
✅ 自动重连 & 错误处理
✅ 接口风格对齐 OpenAI Realtime
✅ 方便集成任意异步 Python 项目

License

MIT License — see LICENSE for details.

Made with ❤️ by mikuh

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
examples		examples
src/dashscope_realtime		src/dashscope_realtime
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
makefile		makefile
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DashScope Realtime

简介

为什么开发这个项目？

安装

快速上手

实时语音识别（ASR）

实时语音合成（TTS）

特性

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

mikuh/dashscope-realtime

Folders and files

Latest commit

History

Repository files navigation

DashScope Realtime

简介

为什么开发这个项目？

安装

快速上手

实时语音识别（ASR）

实时语音合成（TTS）

特性

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages