Skip to content
View SHINE-MU-DEV's full-sized avatar

Block or report SHINE-MU-DEV

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

对于IndexTTS2的复现

Python 8 4 Updated Oct 24, 2025

ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models

Python 29 3 Updated Nov 18, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,179 179 Updated Nov 19, 2025
Python 313 19 Updated Aug 28, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,506 765 Updated May 27, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,335 1,904 Updated Oct 21, 2025
Python 262 46 Updated Jun 21, 2025

汉字拼音数据

Python 1,398 227 Updated Jun 14, 2025

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

Python 222 21 Updated Jan 14, 2025

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 1,174 163 Updated Nov 17, 2025

使用FreeSWITCH接受用户手机呼叫,通过UniMRCP Server集成讯飞开放平台(xfyun)插件将用户语音进行语音识别(ASR),并根据自定义业务逻辑调用语音合成(TTS),构建简单的端到端语音呼叫中心。

349 156 Updated Dec 27, 2018
Python 14 5 Updated Aug 1, 2025

[Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models

Python 68 4 Updated Mar 31, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,659 294 Updated Aug 14, 2025

In this repository, you will learn how code works in VITS(Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech) in Jupyter Notebooks, including normalizing da…

Jupyter Notebook 156 21 Updated Jun 5, 2023

模型压缩的小白入门教程,PDF下载地址 https://github.com/datawhalechina/awesome-compression/releases

337 37 Updated Jun 14, 2025

大模型基础: 一文了解大模型基础知识

6,203 523 Updated Feb 24, 2025

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

Jupyter Notebook 178 36 Updated Mar 18, 2024