@@ -32,12 +32,13 @@ pip install -r requirements-mac.txt
3232## Usage🛠️
3333We have released 4 models for different purposes:
3434
35- | Version | Name | Purpose | Sampling Rate | Content Encoder | Vocoder | Hidden Dim | N Layers | Params | Remarks |
36- | ---------| ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| --------------------------------| ---------------| -------------------------| ---------| ------------| ----------| --------| --------------------------------------------------------|
37- | v1.0 | seed-uvit-tat-xlsr-tiny ([ 🤗] ( https://huggingface.co/Plachta/Seed-VC/blob/main/DiT_uvit_tat_xlsr_ema.pth ) [ 📄] ( configs/presets/config_dit_mel_seed_uvit_xlsr_tiny.yml ) ) | Voice Conversion (VC) | 22050 | XLSR-large | HIFT | 384 | 9 | 25M | suitable for real-time voice conversion |
38- | v1.0 | seed-uvit-whisper-small-wavenet ([ 🤗] ( https://huggingface.co/Plachta/Seed-VC/blob/main/DiT_seed_v2_uvit_whisper_small_wavenet_bigvgan_pruned.pth ) [ 📄] ( configs/presets/config_dit_mel_seed_uvit_whisper_small_wavenet.yml ) ) | Voice Conversion (VC) | 22050 | Whisper-small | BigVGAN | 512 | 13 | 98M | suitable for offline voice conversion |
39- | v1.0 | seed-uvit-whisper-base ([ 🤗] ( https://huggingface.co/Plachta/Seed-VC/blob/main/DiT_seed_v2_uvit_whisper_base_f0_44k_bigvgan_pruned_ft_ema.pth ) [ 📄] ( configs/presets/config_dit_mel_seed_uvit_whisper_base_f0_44k.yml ) ) | Singing Voice Conversion (SVC) | 44100 | Whisper-small | BigVGAN | 768 | 17 | 200M | strong zero-shot performance, singing voice conversion |
40- | v2.0 | hubert-bsqvae-small ([ 🤗] ( https://huggingface.co/Plachta/Seed-VC/blob/main/v2 ) [ 📄] ( configs/v2/vc_wrapper.yaml ) ) | Voice & Accent Conversion (VC) | 22050 | [ ASTRAL-Quantization] ( https://github.com/Plachtaa/ASTRAL-quantization ) | BigVGAN | 512 | 13 | 67M | Best in suppressing source speaker traits |
35+ | Version | Name | Purpose | Sampling Rate | Content Encoder | Vocoder | Hidden Dim | N Layers | Params | Remarks |
36+ | ---------| ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| --------------------------------| ---------------| ------------------------------------------------------------------------| ---------| ------------| ----------| --------| --------------------------------------------------------|
37+ | v1.0 | seed-uvit-tat-xlsr-tiny ([ 🤗] ( https://huggingface.co/Plachta/Seed-VC/blob/main/DiT_uvit_tat_xlsr_ema.pth ) [ 📄] ( configs/presets/config_dit_mel_seed_uvit_xlsr_tiny.yml ) ) | Voice Conversion (VC) | 22050 | XLSR-large | HIFT | 384 | 9 | 25M | suitable for real-time voice conversion |
38+ | v1.0 | seed-uvit-whisper-small-wavenet ([ 🤗] ( https://huggingface.co/Plachta/Seed-VC/blob/main/DiT_seed_v2_uvit_whisper_small_wavenet_bigvgan_pruned.pth ) [ 📄] ( configs/presets/config_dit_mel_seed_uvit_whisper_small_wavenet.yml ) ) | Voice Conversion (VC) | 22050 | Whisper-small | BigVGAN | 512 | 13 | 98M | suitable for offline voice conversion |
39+ | v1.0 | seed-uvit-whisper-base ([ 🤗] ( https://huggingface.co/Plachta/Seed-VC/blob/main/DiT_seed_v2_uvit_whisper_base_f0_44k_bigvgan_pruned_ft_ema.pth ) [ 📄] ( configs/presets/config_dit_mel_seed_uvit_whisper_base_f0_44k.yml ) ) | Singing Voice Conversion (SVC) | 44100 | Whisper-small | BigVGAN | 768 | 17 | 200M | strong zero-shot performance, singing voice conversion |
40+ | v2.0 | hubert-bsqvae-small ([ 🤗] ( https://huggingface.co/Plachta/Seed-VC/blob/main/v2 ) [ 📄] ( configs/v2/vc_wrapper.yaml ) ) | Voice & Accent Conversion (VC) | 22050 | [ ASTRAL-Quantization] ( https://github.com/Plachtaa/ASTRAL-quantization ) | BigVGAN | 512 | 13 | 67M | Best in suppressing source speaker traits |
41+
4142Checkpoints of the latest model release will be downloaded automatically when first run inference.
4243If you are unable to access huggingface for network reason, try using mirror by adding ` HF_ENDPOINT=https://hf-mirror.com ` before every command.
4344
0 commit comments