-
Notifications
You must be signed in to change notification settings - Fork 48
Siwis good training on bad prompts #19
Comments
Is there enough of a pattern that we could automate some prompt corrections and re-train? |
I have to compare the prompts I use with the original.. How many prompts do you need, you think ? |
in parl, there are 4 occurrences of "rerai" text/part1/neut_parl_s02_0531.txt: text/part1/neut_parl_s02_0589.txt: text/part1/neut_parl_s03_0372.txt: the only correct is |
I use Siwis as the "base" model for French, since it's one where I had the most data available. So any corrections to the transcripts will improve it and all of the downstream models when I re-train. Should I create a repo to share the corrected transcripts, or would you like to do that? Also, thanks for your effort :) |
I have notice quite a lot of problems of "reading". For my voice I've just changed the prompts .. and yes it improved my voice particularly when the defaults are repeated, of course text/part1/neut_parl_s01_0633.txt: gagnerions vs gagnerons text/part1/neut_parl_s03_0462.txt: oserions vs oserons text/part1/neut_parl_s03_0622.txt: text/part1/neut_parl_s04_0310.txt: text/part1/neut_parl_s04_0378.txt: text/part1/neut_parl_s06_0096.txt: text/part1/neut_parl_s06_0666.txt: y is read as e (SAMPA) text/part2/neut_book_s06_0092.txt: text/part3/emph_parl_s01_0633.txt: gagnerions vs gagnerons a repo is a good idea, right now I am putting a lot effort to chase all these imperfections, |
If it would help you out, I have the prompt alignments too. I trained a French Kaldi model on these same IPA phonemes, and used the alignments in the training labels and to trim the WAV files. |
in Siwis, the talent rarely respects the pronunciation of verbs in conditional mode
for example, she would say "il tirait" instead of "il tirerait " .. so
despite the correct phonemes
I can hear "il tirait le premier".
The text was updated successfully, but these errors were encountered: