German Voice for RHVoice #24

winman3000 · 2015-12-09T10:17:15Z

It would be nice to have RH Voice in German. What do you Need for German language?

abitrolly · 2015-12-10T12:52:09Z

Training data: text, audio files that read that text, and mapping that maps audio to the text.

winman3000 · 2015-12-10T13:05:25Z

If I understand you right, you need a nativ speaker who reads a text an
an audio file, right?

And what you mean with:

[...] and mapping that maps audio to the text.

You need the spoken text as a text file?

What criteria should have the read text?

Is it necessary that this text is recorded in a studio or could it be
with a headset too?

Sorry for the beginner questions, but I am a beginner in this aria

abitrolly · 2015-12-10T13:13:12Z

If I understand you right, you need a nativ speaker who reads a text an
an audio file, right?

Yes.

You need the spoken text as a text file?

Yes.

What criteria should have the read text?

That depends on a language. Basically the text should be chosen in a way that audio for it contains all possible combinations of sounds, or at least cover most popular.

Is it necessary that this text is recorded in a studio or could it be
with a headset too?

It is better to record in studio with highest quality, because then you can convert audio to different formats. But.. it is possible to record with headset too.

Mind you that software is dumb - it doesn't know where words starts in your audio, so you will have to mark audio files with text manually (mapping between text and audio).

winman3000 · 2015-12-10T13:21:37Z

OK, now I am closer to understand you. :)

Mind you that software is dump - it doesn't know where words starts in
your audio, so you will have to mark audio files with text.

How can I do this? I am a blind person. If I give you a spoken text and
the text file, whould it be possible that you can create the voice? I am
a beginner so I am not able to create a German Voice database.

If it is too difficult to create a German Voice Database, we need
someone who can create such database. I am not a right person to create
such one I think.

abitrolly · 2015-12-10T13:30:21Z

@winman3000 how do you use RHVoice primarily? Maybe there is already a voice for your platform or there might be available training bases. I think that any training material for HTS-type synthesizer will do.

And answering your question, me personally is unlikely to create the voice, because this project is not funded and I am afraid there is still a lot of missing bits to fill in that take time. But if we get training data, it is at least possible to see what is next.

winman3000 · 2015-12-10T14:18:34Z

how do you use RHVoice primarily?

Currently I don't use RHVoice as it is not in German, but I've tested the English ones with NVDA. If a German Voice would be available in the future, I'll use this Voice.

Maybe there is already a voice for your platform or there might be available training bases.

There should be training bases, as MaryTTS is using HTS based voices and MaryTTS has German voices. But I am not able to find the training data. If I could find the training data, it would be easier to find the next steps I think. As far as I know, the German voices are publiced as a BSD license, so I'm afraid that we cannot use these voices. It would be nice to have the training materials.

abitrolly · 2015-12-10T18:27:27Z

Yes, MaryTTS is what I thought of too. They also provide a nice diagram of the whole process, but details are still a bit elusive. Guys are also very responsive. So we should ask them. I think it is also possible to hire Olga, but I don't have contact with her. BSD license is not a problem. It is not hard to give proper credit in voice documentation. But we need better docs - targeted at simple users who are not linguists or mathematicians, but who want to hack on voice technologies. And I personally think that blind users should take the lead and explain to everybody else how it works and what should we do. =)

winman3000 · 2015-12-10T18:50:14Z

Have you an idea how to contact the developers of Mary TTS? I know there are in GitHub, but I don't know the way to contact the team leaders. The only thing I can do is only open a ticket.

nshmyrev · 2015-12-10T19:44:59Z

It's better to give a link to marytts/marytts#440

You can also contact marytts developers on the mailing list http://www.dfki.de/mailman/cgi-bin/listinfo/mary-users

Voice building for rhvoice is not trivial, you could probably better just use nvda plugin for openmary with openmary voices. Openmary also has better synthesis quality due to mixed excitation vocoder and advanced NLP components.

winman3000 · 2015-12-10T19:46:23Z

I've got a link from a developer or so for German. Can you work with it? http://www.voxforge.org/de/Downloads

winman3000 · 2015-12-10T19:48:56Z

The bad thing in MaryTTS is that the responsiveness is not fast as in RH Voice. The second thing is that you need Java.

abitrolly · 2015-12-10T22:54:33Z

As for Voxforge files, I need to check if audio files are annotated, and if RHVoice training pipeline can handle this format.

abitrolly · 2015-12-10T22:54:33Z

Right. RHVoice is optimized for responsiveness. But I don't understand why building voice for RHVoice should be harder than for any other HTS synthesizer. For now the main problem is the lack of complete picture.

winman3000 · 2015-12-10T23:00:43Z

As for Voxforge files, I need to check if audio files are annotated,

As far as I know, these files are annotated.

and if
RHVoice training pipeline can handle this format.

It would be very great when you can check this.

Thank you very much for your help!

abitrolly · 2015-12-11T16:44:41Z

Some analysis. Prompts.tgz contains text that should be dictated. Filename master_prompts_train_16kHz-16bit contains strings like:

ralfherzog-20080131-de71/mfc/de71-62 DIE AUSGABEN KONNTEN GESPART WERDEN
ralfherzog-20080131-de71/mfc/de71-63 MAN WIRD AUF DEN NÄCHSTEN ABSCHWUNG WARTEN MÜSSEN
ralfherzog-20080131-de71/mfc/de71-64 DA MUSS MAN AUF ANDERE EREIGNISSE WARTEN

I am not really sure what this label means ralfherzog-20080131-de71/mfc/de71-63. ralfherzog looks like a name of text and everything else is still a mystery.

abitrolly · 2015-12-11T16:46:57Z

Okay. http://www.repository.voxforge1.org/downloads/de/Trunk/Audio/Original/48kHz_16bit/ contains ralfherzog-20080131-de71.tgz Download is very slow, so I don't yet see what's inside. Looks like it should be audio for the text and name is just identifier what-when-shortid. mfc/de71-64 is still unclear. 64 looks like line number, de71 a short text identifier, but what mfc is - it is not clear.

winman3000 · 2015-12-13T19:16:41Z

Hmm. So we cannot build a voice for RHVoice with this stuff? ralfherzog is the name, but I don't know the rest. After this it is the sentence.

nshmyrev · 2015-12-14T10:10:50Z

I would not be that pessimistic and try well documented openmary voice import procedure first.

abitrolly · 2016-01-15T07:19:15Z

@nshmyrev is that it? http://mary.opendfki.de/wiki/VoiceImportToolsTutorial

nshmyrev · 2016-01-15T16:21:18Z

@abitrolly latest wiki is on github:

https://github.com/marytts/marytts/wiki/VoiceImportToolsTutorial

that one on opendfki might be slightly outdated.

abitrolly · 2016-01-16T05:50:29Z

@nshmyrev thanks. Just need to get some free time now.

rugk · 2020-10-16T21:19:41Z

Why not just use Common Voice?
https://commonvoice.mozilla.org/de/datasets has 19GB German-spoken data.

maniyax · 2020-10-16T21:55:24Z

@rugk Hello!
Sorry, but it's very bad quality.
Besides the text is missing in dataset.

I'm not sure about the German dataset, but russian dataset include a lot of different dictors.

beqabeqa473 · 2020-10-17T04:21:38Z

hello. common voice as i know is targetted for creation of rnn voices. the way they're built is a bit different. this method doesn't specifically need one voice as it builds acoustic model. but for rhvoice it is a requirement of good quality recording of one speaker to create a voice as technologies are different.

…

On 10/17/20, Artem Plaksin ***@***.***> wrote: @rugk Hello! Sorry, but it's very bad quality. Besides the text is missing in dataset. I'm not sure about the German dataset, but russian dataset include a lot of different dictors. -- You are receiving this because you are subscribed to this thread. Reply to this email directly or view it on GitHub: #24 (comment)

-- with best regards beqa

rugk · 2020-10-17T08:08:30Z

Ah okay, thanks, this makes sense. Of course, yeah, their idea, was to collect many voices…

winman3000 · 2021-02-09T11:29:13Z

Is there any progress on this in the meantime? Has anyone tried to import the voices from MaryTTS for RHVoice?

If there is no existing data we can use for this project, we will have to do everything from scratch. Unfortunately I have to repeat my questions:

How long does the audio need to be?
How exactly do you do the mapping if you have the text to go with it?
Is there really no material we could use for a German voice?

I would love to finish the project so that we finally have a German voice as well...

citizenserious · 2021-12-07T18:46:03Z

I have found some voices including German, I use them with VocalizerEx2 TTS. They are working fine, but I do not trust VocalizerEx2. Would be nice to have these voices in RHVoice (:

https://vocalizer-nvda.com/downloads

zstanecic · 2021-12-07T19:36:58Z

Not really! It will be the infringement of copyright, and it is not possible, because of the synthesis method. From: citizenserious ***@***.***> Sent: Tuesday, December 7, 2021 7:46 PM To: RHVoice/RHVoice ***@***.***> Cc: Subscribed ***@***.***> Subject: Re: [RHVoice/RHVoice] German Voice for RHVoice (#24) I have found some voices including German, I use them with VocalizerEx2 TTS. They are working fine, but I do not trust VocalizerEx2. Would be nice to have these voices in RHVoice (: https://vocalizer-nvda.com/downloads — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#24 (comment)> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACVCDE4DLTITFBTH6ZBDTY3UPZI7TANCNFSM4BWG54KA> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub> . <https://github.com/notifications/beacon/ACVCDE3FWPFE55BT7KTHKSDUPZI7TA5CNFSM4BWG54KKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOHLTG4DI.gif>

uncle-ben-devel · 2022-08-14T15:37:58Z

Any progress?
I'd like to get involved as I very much like the voice output from this application and it's openness.
I am a german native speaker. How can I contribute voice / training data? I have decent recording equipment also.

winman3000 · 2022-08-14T17:05:46Z

As far as I know, there is no progress on this. Apparently it is not enough to make language recordings, modules must also be adapted in C++, i.e. libraries must be developed for the German language.

nshmyrev · 2022-08-14T19:22:41Z

You need to ask Torsten https://github.com/thorstenMueller/Thorsten-Voice, he will make the voice for you.

winman3000 · 2022-08-15T12:01:53Z

Maybe we should contact Torsten Müller and ask if he would be willing to port his voice for RHVoice. We will definitely need someone to write the modules for German in C++. Unfortunately I do not have programming skills myself.

zstanecic · 2022-08-15T12:43:12Z

Hi, You don’t need to write anything in c++. Please see, for example, the polish language as an example. You can create the data only language, which is imported by RHVoice without knowing it by the engine. You need to learn how foma language regexes work. There are tutorials and regex references available. From: winman3000 ***@***.***> Sent: Monday, August 15, 2022 7:02 PM To: RHVoice/RHVoice ***@***.***> Cc: Zvonimir Stanečić ***@***.***>; Comment ***@***.***> Subject: Re: [RHVoice/RHVoice] German Voice for RHVoice (#24) Maybe we should contact Torsten Müller and ask if he would be willing to port his voice for RHVoice. We will definitely need someone to write the modules for German in C++. Unfortunately I do not have programming skills myself. — Reply to this email directly, view it on GitHub <#24 (comment)> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACVCDE3JV3XG3NOT3YNHJZTVZIWTXANCNFSM4BWG54KA> . You are receiving this because you commented. <https://github.com/notifications/beacon/ACVCDE2CG7K6KXRI45I3XT3VZIWTXA5CNFSM4BWG54KKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOJBVGZ7Q.gif> Message ID: ***@***.*** ***@***.***> >

thorstenMueller · 2023-01-07T14:18:24Z

Hi and sorry for joing late to discussion.

I tried to figure out required steps to add german language based on my Thorsten-Voice dataset. The "Polish" language was referenced but honestly i'm not sure what to do.
@zstanecic Would you mind helping me with the first steps?

zstanecic · 2023-01-07T14:29:24Z

Hi Thorsten, First of all, you will need to define the language rules in the foma scripting language, then compile these to fst, with the foma compiler and interpreter. The version which is needed is v 0.9.18. Newer version causes issues. Secondly, you will need to create the phonemes features in xml file, as defined for example for polish. For german you will need to modify thinks a bit, or probably from scratch. Thirdly, you will need to have labelling.xml. As the language you will create will be data only, you will need the language.conf and graph.tx.. This is for registering the letters and telling the synthesizer which of these are consonants, and which are wovels. Feel free to ask questions if these will arise. From: Thorsten Müller ***@***.***> Sent: Saturday, January 7, 2023 3:19 PM To: RHVoice/RHVoice ***@***.***> Cc: Zvonimir Stanečić ***@***.***>; Mention ***@***.***> Subject: Re: [RHVoice/RHVoice] German Voice for RHVoice (#24) Hi and sorry for joing late to discussion. I tried to figure out required steps to add german language based on my Thorsten-Voice dataset. The "Polish" language was referenced but honestly i'm not sure what to do. @zstanecic <https://github.com/zstanecic> Would you mind helping me with the first steps? — Reply to this email directly, view it on GitHub <#24 (comment)> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACVCDE2HW6XSYR6OQJPLVITWRF3LZANCNFSM4BWG54KA> . You are receiving this because you were mentioned. <https://github.com/notifications/beacon/ACVCDE45O46AMV4GVZZTWOTWRF3LZA5CNFSM4BWG54KKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOKHWSLBA.gif> Message ID: ***@***.*** ***@***.***> >

thorstenMueller · 2023-03-11T15:18:46Z

Hi @zstanecic ,

what i did so far:

Installed foma in version 0.9.18
Created folder /data/languages/German
In that "German" folder i've copied the following files from "Polish" directory: graph.txt, labelling.xml, language.conf, language.info, locale.info, phonemes.xml

Am i right, that ...

i have to adjust all these files (or less or more) to german?
running "foma" (do not know this tool at all, yet) will create several.fst and dt files?

Once this is done i guess i've to create a "Thorsten" folder in the /data/voices folder? And do whatever way of magic there ;-).

zstanecic · 2023-03-11T15:31:33Z

Hi Thorsten, That’s not the right way to start. You need to check the foma scripts first, and write the rules for your language. Scripts are located in rhvoice/scripts/language. You will need examples of data only languages. Let me contact privately on some whatsapp or email to explain this further. ***@***.*** ***@***.***> From: Thorsten Müller ***@***.***> Sent: Saturday, March 11, 2023 4:19 PM To: RHVoice/RHVoice ***@***.***> Cc: Zvonimir Stanečić ***@***.***>; Mention ***@***.***> Subject: Re: [RHVoice/RHVoice] German Voice for RHVoice (#24) Hi @zstanecic <https://github.com/zstanecic> , what i did so far: * Installed foma in version 0.9.18 * Created folder /data/languages/German * In that "German" folder i've copied the following files from "Polish" directory: graph.txt, labelling.xml, language.conf, language.info, locale.info, phonemes.xml Am i right, that ... * i have to adjust all these files (or less or more) to german? * running "foma" (do not know this tool at all, yet) will create several.fst and dt files? Once this is done i guess i've to create a "Thorsten" folder in the /data/voices folder? And do whatever way of magic there ;-). — Reply to this email directly, view it on GitHub <#24 (comment)> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACVCDE6RLCUZN7RQCO7BJKDW3SJWFANCNFSM4BWG54KA> . You are receiving this because you were mentioned. <https://github.com/notifications/beacon/ACVCDE4SFFKZIKTDZXC7ZFLW3SJWFA5CNFSM4BWG54KKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOK5IRWSQ.gif> Message ID: ***@***.*** ***@***.***> >

thorstenMueller · 2023-03-11T15:36:23Z

Thanks for your quick reply and support offer. As i've never worked with foma this is really highly appreciated.
Github mail doesn't show your mail please contact me here and we can talk by mail then.
https://www.thorsten-voice.de/en/contact/

zstanecic · 2023-03-11T15:53:47Z

Hi, I sent you a msg. please check your inbox From: Thorsten Müller ***@***.***> Sent: Saturday, March 11, 2023 4:37 PM To: RHVoice/RHVoice ***@***.***> Cc: Zvonimir Stanečić ***@***.***>; Mention ***@***.***> Subject: Re: [RHVoice/RHVoice] German Voice for RHVoice (#24) Thanks for your quick reply and support offer. As i've never worked with foma this is really highly appreciated. Github mail doesn't show your mail please contact me here and we can talk by mail then. https://www.thorsten-voice.de/en/contact/ — Reply to this email directly, view it on GitHub <#24 (comment)> , or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACVCDE5BA2BYU4FLSWCTIVTW3SLYFANCNFSM4BWG54KA> . You are receiving this because you were mentioned. <https://github.com/notifications/beacon/ACVCDEYGRXR5OUVRV5ORCSLW3SLYFA5CNFSM4BWG54KKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOK5ISO7Q.gif> Message ID: ***@***.*** ***@***.***> >

thorstenMueller · 2023-03-13T15:40:17Z

Hi,
thanks @zstanecic for the really nice and helpful videochat this sunday. Based on that i forked the repo and created a "german" branch. Inside that /src/scripts/German folder i copied some foma files from Polish language that have to be changed or created from scratch for German.
https://github.com/thorstenMueller/RHVoice/tree/german/src/scripts/German

g2p.foma
gpos.foma
lseq.foma
spell.foma
stress.foma

This basic work has nothing to do with the actual voice, it's more a basic grammar setup and rules for german language. Honestly i cannot work this out all by my self. I'll read foma regex reference in foma wiki and contribute my actual voice. But i need help (probably by german speaking people) to adjust the basic grammar and rules.

So any help on this is appreciate if we'd like to add german / Thorsten-Voice to RHVoice.

BluePixel4k · 2023-03-19T13:17:16Z

@thorstenMueller thx for your effort. Maybe I can help a bit with adjusting the basic grammar and rules. But I don't know how exactly.

svnpsc · 2023-08-26T07:01:05Z

May this be helpful?
https://github.com/ikekonglp/NLPLAB/tree/master/FOMA_Scripts

Foexle11 · 2023-11-23T21:24:47Z

any news?

thorstenMueller · 2023-11-25T08:14:55Z

Not from my side yet. Topic is absolutely interesting but remaining free time is (as mostly) is limited.

citizenserious · 2024-04-17T14:28:59Z

What about the Piper voices?

berce mentioned this issue Dec 24, 2017

Spanish argentinian voice #53

Open

alex19EP added (P5 - Long-term) Long-term WIP, may stay on the list for a while. <Documentation> internal info, manuals and help Data: Languages/Pronunciation Data: Voices labels Jun 16, 2020

winman3000 mentioned this issue Mar 13, 2021

How to add your own voices? #95

Open

IzzySoft mentioned this issue Jul 6, 2021

please add German language #334

Closed

winman3000 mentioned this issue Aug 15, 2022

Porting the German voice into RHVoice thorstenMueller/Thorsten-Voice#39

Open

thorstenMueller mentioned this issue Mar 17, 2023

[Feature request] Easier running of tts under Windows coqui-ai/TTS#2384

Closed

German Voice for RHVoice #24

German Voice for RHVoice #24

Comments

winman3000 commented Dec 9, 2015

abitrolly commented Dec 10, 2015

winman3000 commented Dec 10, 2015

abitrolly commented Dec 10, 2015

winman3000 commented Dec 10, 2015

abitrolly commented Dec 10, 2015

winman3000 commented Dec 10, 2015 via email

abitrolly commented Dec 10, 2015 via email

winman3000 commented Dec 10, 2015 via email

nshmyrev commented Dec 10, 2015

winman3000 commented Dec 10, 2015 via email

winman3000 commented Dec 10, 2015 via email

abitrolly commented Dec 10, 2015 via email

abitrolly commented Dec 10, 2015 via email

winman3000 commented Dec 10, 2015

abitrolly commented Dec 11, 2015

abitrolly commented Dec 11, 2015

winman3000 commented Dec 13, 2015 via email

nshmyrev commented Dec 14, 2015

abitrolly commented Jan 15, 2016

nshmyrev commented Jan 15, 2016

abitrolly commented Jan 16, 2016

rugk commented Oct 16, 2020

maniyax commented Oct 16, 2020

beqabeqa473 commented Oct 17, 2020 via email

rugk commented Oct 17, 2020

winman3000 commented Feb 9, 2021

citizenserious commented Dec 7, 2021

zstanecic commented Dec 7, 2021 via email

uncle-ben-devel commented Aug 14, 2022

winman3000 commented Aug 14, 2022

nshmyrev commented Aug 14, 2022 • edited

winman3000 commented Aug 15, 2022

zstanecic commented Aug 15, 2022 via email

thorstenMueller commented Jan 7, 2023

zstanecic commented Jan 7, 2023 via email

thorstenMueller commented Mar 11, 2023

zstanecic commented Mar 11, 2023 via email

thorstenMueller commented Mar 11, 2023

zstanecic commented Mar 11, 2023 via email

thorstenMueller commented Mar 13, 2023

BluePixel4k commented Mar 19, 2023

svnpsc commented Aug 26, 2023

Foexle11 commented Nov 23, 2023

thorstenMueller commented Nov 25, 2023

citizenserious commented Apr 17, 2024

nshmyrev commented Aug 14, 2022 •

edited