Skip to content

Commit 06e69bf

Browse files
authored
Update README.md
1 parent b8fd64f commit 06e69bf

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ Command line inference:
1515
python inference.py --source <source-wav> \
1616
--target <referene-wav>
1717
--output <output-dir>
18-
--diffusion-steps 10
18+
--diffusion-steps 25
1919
--length-adjust 1.0
2020
--inference-cfg-rate 0.7
2121
--n-quantizers 3
@@ -24,7 +24,7 @@ where:
2424
- `source` is the path to the speech file to convert to reference voice
2525
- `target` is the path to the speech file as voice reference
2626
- `output` is the path to the output directory
27-
- `diffusion-steps` is the number of diffusion steps to use, default is 10, use 50~100 for best quality
27+
- `diffusion-steps` is the number of diffusion steps to use, default is 25, use 50~100 for best quality, use 4~10 for fastest inference
2828
- `length-adjust` is the length adjustment factor, default is 1.0, set <1.0 for speed-up speech, >1.0 for slow-down speech
2929
- `inference-cfg-rate` has subtle difference in the output, default is 0.7
3030
- `n-quantizers` is the number of quantizers from FAcodec to use, default is 3, the less quantizer used, the less prosody of source audio is preserved
@@ -52,4 +52,4 @@ Then open the browser and go to `http://localhost:7860/` to use the web interfac
5252
- 2024-09-14:
5353
- Updated v0.2 pretrained model, with smaller size and less diffusion steps to achieve same quality, and additional ability to control prosody preservation
5454
- Added command line inference script
55-
- Added installation and usage instructions
55+
- Added installation and usage instructions

0 commit comments

Comments
 (0)