Highlights
- Pro
Stars
6
stars
written in Python
Clear filter
FinOps and cloud cost optimization tool. Supports AWS, Azure, GCP, Alibaba Cloud and Kubernetes.
MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech…
spring-media / ForwardTacotron
Forked from fatchord/WaveRNN⏩ Generating speech in a single forward pass without any attention!
An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.
Implementation of "SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection" paper
lokkelvin2 / dc_tts_GUI
Forked from Kyubyong/dc_ttsGUI Wrapper for 'A TensorFlow Implementation of DC-TTS: yet another text-to-speech model'