Some Piper voices. #430
Replies: 11 comments 9 replies
-
Awesome work, thank you! I will get these uploaded to the piper-voices repo 🙂 |
Beta Was this translation helpful? Give feedback.
-
Forgot I had a medium quality settings version of the Cori voice. That is up as well now. |
Beta Was this translation helpful? Give feedback.
-
that's awesome! Would you mind to share some screenshots of how the gen and disc graphs from tensorboard look like from these trainings? I'm having difficulty understanding what a good graph looks like. thanks! |
Beta Was this translation helpful? Give feedback.
-
fair enough. Would you by any chance have used a 3090? I borrow my son's 4090 to do some training but I'm considering buying a second hand 3090 or a 4080. I don't want to spend too much on a 4090 unless it's really that faster than a 4080 or 3090. Do you think I would benefit from training a large dataset (100+ hours of audio) until it's good enough, then training a smaller dataset ~10h using a checkpoint from the 100+ hours training? Would it get as good as the one trained with 100+ hours? thanks! |
Beta Was this translation helpful? Give feedback.
-
Thank you for your work. |
Beta Was this translation helpful? Give feedback.
-
Hi. Also, it has the same issue as @StoryHack described. The quality of the generated audio varries from sentence to sentence. i guess there aren't any tools you could use to equalise all those recordings without some serious technical audio fiddling? |
Beta Was this translation helpful? Give feedback.
-
Good luck @StoryHack |
Beta Was this translation helpful? Give feedback.
-
I just put 3 additional voices on the page, all public domain. One is my voice (Bryce), which I was experimenting with to see the minimum samples needed. I recorded using a Vivitar USB mic and the piper recording studio. I used the Harvard balanced sentences as most of the corpus, along with some longer and shorter sentences that I made up. Some day I will record more and redo this voice, but it sounds reasonably close to real me now. The other 2 are both US Male voices built from datasets made from Librivox recorings. |
Beta Was this translation helpful? Give feedback.
-
Grabbing them now, thank you for training these! |
Beta Was this translation helpful? Give feedback.
-
Do you happen to have screenshots of what your tensorboard graphs looked like after say 1000 epochs when training from scratch? I have about 14 hours of source audio that I'm training from scratch on a 4080, batch size of 24, and after about 300 epochs my |
Beta Was this translation helpful? Give feedback.
-
Okay, so this is long since a finished convo, but I figured I'd ask anyway since it's the best place I've found to ask.... I am just starting my foray into finetuning and training voices, but since I'm swedish my options are limited. I would like to start from a baseline using the medium Lisa model, but I can't seem to find a checkpoint for it. The "sv" is missing from the huggingface repo, when checking the checkpoints... Anyone ideas on what to do? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
So, I have been playing with training voices for a while now. I really wanted to have several good sounding voices available that have friendly licenses. So I'm posting 6 voices (well, 5, with a high and medium quality version of one) that I have trained and think sound pretty good. I include ckpt files for several, in case you want to fine tune with them.
Updated with 3 additional voices on 5/10/2024
https://brycebeattie.com/files/tts
If somebody wants to upload these to huggingface or similar, you have my blessing.
Beta Was this translation helpful? Give feedback.
All reactions