You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[x] Hugging Face Space 演示: [](https://huggingface.co/spaces/Plachta/Seed-VC)
55
+
-[x] HTML 演示页面(可能包含与其他 VC 模型的比较): [Demo](https://plachtaa.github.io/seed-vc/)
Copy file name to clipboardExpand all lines: README.md
+11-8Lines changed: 11 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,19 +1,21 @@
1
-
# Seed-VC
1
+
# Seed-VC
2
+
*English | [简体中文](README-CN.md)*
3
+
2
4
A new zero-shot voice conversion scheme inspired by SEED-TTS.
3
5
4
-
Currently released model supports *zero-shot voice conversion* and *zero-shot singing voice conversion*. Without any training, it is able to clone a voice given a reference speech of 1~30 seconds.
6
+
Currently released model supports *zero-shot voice conversion*🔊 and *zero-shot singing voice conversion* 🎙. Without any training, it is able to clone a voice given a reference speech of 1~30 seconds.
5
7
6
-
To find a list of demos and comparisons with previous voice conversion models, please visit our [demo page](https://plachtaa.github.io/seed-vc/)
8
+
To find a list of demos and comparisons with previous voice conversion models, please visit our [demo page](https://plachtaa.github.io/seed-vc/)🌐
7
9
8
10
We are keeping on improving the model quality and adding more features.
9
11
10
-
## Installation
12
+
## Installation📥
11
13
Suggested python 3.10 on Windows or Linux.
12
14
```bash
13
15
pip install -r requirements.txt
14
16
```
15
17
16
-
## Usage
18
+
## Usage🛠️
17
19
Checkpoints of the latest model release will be downloaded automatically when first run inference.
18
20
19
21
Command line inference:
@@ -39,13 +41,14 @@ where:
39
41
-`n-quantizers` is the number of quantizers from FAcodec to use, default is 3, the less quantizer used, the less prosody of source audio is preserved
40
42
-`f0-condition` is the flag to condition the pitch of the output to the pitch of the source audio, default is False, set to True for singing voice conversion
41
43
-`auto-f0-condition` is the flag to auto adjust source pitch to target pitch level, default is False, normally not used in singing voice conversion
42
-
-`semi-tone-shift` is the pitch shift in semitones for singing voice conversion, default is 0
44
+
-`semi-tone-shift` is the pitch shift in semitones for singing voice conversion, default is 0
45
+
43
46
Gradio web interface:
44
47
```bash
45
48
python app.py
46
49
```
47
50
Then open the browser and go to `http://localhost:7860/` to use the web interface.
0 commit comments