Skip to content
View SHINE-MU-DEV's full-sized avatar

Block or report SHINE-MU-DEV

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
12 stars written in Python
Clear filter

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,342 1,905 Updated Oct 21, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,506 765 Updated May 27, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,661 294 Updated Aug 14, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,189 179 Updated Nov 19, 2025

汉字拼音数据

Python 1,398 227 Updated Jun 14, 2025
Python 313 19 Updated Aug 28, 2025
Python 263 46 Updated Jun 21, 2025

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

Python 222 21 Updated Jan 14, 2025

[Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models

Python 68 4 Updated Mar 31, 2024

ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models

Python 29 3 Updated Nov 18, 2025
Python 14 5 Updated Aug 1, 2025

对于IndexTTS2的复现

Python 8 4 Updated Oct 24, 2025