[Bug] Portuguese TTS model on XTTS is pronouncing the "." (dot) character when it happens in a text #2952

Subarasheese · 2023-09-15T21:50:13Z

Describe the bug

Hello,

It seem a bit of a "oopsie" was made when handling the Portuguese dataset as now the PTBR pronounces the "." character as ponto every time we insert sentences like:

"Olá, sou seu novo clone de voz. Faça o possível para carregar um áudio de qualidade."

Here is the output: https://vocaroo.com/1404xnr0Vkmc

It was not supposed to say "ponto"...

It goes like:

"Olá, sou seu novo clone de voz ponto Faça o possível para carregar um áudio de qualidade ponto"

But it should not be like that.

To Reproduce

Set the client to portuguese (pt) then type anything including "." (dot)

Expected behavior

Not pronouncing dot. The purpose of "." is to indicate the end of a declarative sentence or to separate certain elements in written text.

Logs

None

Environment

git clone https://huggingface.co/spaces/coqui/xtts
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt
python app.py

Additional context

No response

Subarasheese · 2023-09-15T21:56:14Z

@Edresson

Edresson · 2023-09-15T22:01:47Z

Hi @Subarasheese, thanks for reporting this bug. We plan to fix this issue soon. As work around I noticed that if you add a space between the word and the point it will fix the issue.

Subarasheese · 2023-09-15T22:10:29Z

Hi @Subarasheese, thanks for reporting this bug. We plan to fix this issue soon. As work around I noticed that if you add a space between the word and the point it will fix the issue.

Thank you.
I have a question, out of curiosity: can the dataset used to train the Portuguese model be found online, or did Coqui use a private/internal dataset for Portuguese?

stale · 2023-10-17T04:56:57Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

Inc44 · 2023-10-28T17:05:06Z

A similar error exists in other languages, such as French, Russian and Japanese.
The problem appears in model xtts_v1.1, coqui 0.19.0, python 3.11.5.

Subarasheese · 2023-11-07T22:29:16Z

@Edresson The workaround (space before dot) is not working on xtts v2... It is still saying "dot" (ponto)
Previusly the workaround worked every time, if I recall correctly

erogol · 2023-11-08T09:26:57Z

We don't actually know why it happens. If anyone has any ideas, let us know

Dhrog · 2023-11-09T15:58:56Z

I experienced the same problem with xtts-v2 using the german language.

Subarasheese · 2023-11-09T17:12:10Z

We don't actually know why it happens. If anyone has any ideas, let us know

Are you guys sure there isn't an issue with the dataset? What were your sources?

brambox · 2023-11-09T17:12:27Z

I'm also getting 'ponto' when fine tunning.

Dhrog · 2023-11-09T19:07:52Z

I used the example code and read the text from a file. I installed Coqui TTS yesterday, so it is still overwhelming right now.
The sound file is attached. At one point you can hear: "Punkt dot"
It quite often happens that there are long gaps between sentences. Not sure if there is a connection to this issue?


# -*- coding: utf-8 -*-
import sys
from pathlib import Path
import torch
from TTS.api import TTS

f = open(sys.argv[1], 'rb').read()
f = f.decode('unicode_escape').encode('latin-1').decode('utf-8')
print (f)

file_output = sys.argv[2]

# Get device
device = "cuda" if torch.cuda.is_available() else "cpu"

# List available 🐸TTS models
#print(TTS().list_models())

# Init TTS
tts = TTS("tts_models/multilingual/multi-dataset/xtts_v2").to(device)

# Run TTS
# ❗ Since this model is multi-lingual voice cloning model, we must set the target speaker_wav and language
# Text to speech list of amplitude values as output
wav = tts.tts(text= f, speaker_wav="Data/RefClips/4.wav", language="de")
# Text to speech to a file
tts.tts_to_file(text=f, speaker_wav="Data/RefClips/4.wav", language="de", file_path=file_output)

umlaut.zip

Inc44 · 2023-11-09T19:23:47Z

Temporarily it is possible to fix this problem by replacing dots "." with exclamations "!"

Edresson · 2023-11-13T12:49:04Z

Temporarily it is possible to fix this problem by replacing dots "." with exclamations "!"

In general, the use of ".." instead of ".", also works for Portuguese language.

wonka929 · 2023-12-09T10:03:37Z

Italian has the same issue.
Except for workarounds, did you find a stable fix?

".." method does not work. Neither "!".

Thanks

PS: with italian works replacing "." with "\n"

fcrescio · 2024-01-30T20:24:14Z

This bug is still present at least for italian. Another workaround is to replace . with ;

Fgabz · 2024-03-15T14:44:17Z

We have the same issue in french

danielmzak · 2024-04-08T02:39:37Z

In Czech (xtts_v2 model) try replacing "." with ";\n" - this will make the ends of sentences sound more natural.

lincoln157nascimento · 2024-05-08T21:16:33Z

Does anyone have a solution to the problem?.

abhisirka2001 · 2024-08-06T01:10:38Z

Solution : Replacing the full stops(.) in the text with "|" works for the portuguese language also it adds a pause after the sentence ends. Using space instead of full stop doesnt add a pause.
However using a text with "|" instead of full stops won't work for longer text so use shorter text prompt less than 400 tokens with "|".

Subarasheese added the bug Something isn't working label Sep 15, 2023

stale bot added the wontfix This will not be worked on but feel free to help. label Oct 17, 2023

Edresson self-assigned this Oct 20, 2023

stale bot removed the wontfix This will not be worked on but feel free to help. label Oct 20, 2023

erogol closed this as completed Nov 23, 2023

Th3rdSergeevich mentioned this issue Dec 21, 2023

[Bug] dot pronounced a "punto" in italian :D #3445

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Portuguese TTS model on XTTS is pronouncing the "." (dot) character when it happens in a text #2952

[Bug] Portuguese TTS model on XTTS is pronouncing the "." (dot) character when it happens in a text #2952

Subarasheese commented Sep 15, 2023 •

edited

Loading

Subarasheese commented Sep 15, 2023

Edresson commented Sep 15, 2023

Subarasheese commented Sep 15, 2023 •

edited

Loading

stale bot commented Oct 17, 2023

Inc44 commented Oct 28, 2023

Subarasheese commented Nov 7, 2023 •

edited

Loading

erogol commented Nov 8, 2023

Dhrog commented Nov 9, 2023

Subarasheese commented Nov 9, 2023

brambox commented Nov 9, 2023

Dhrog commented Nov 9, 2023

Inc44 commented Nov 9, 2023

Edresson commented Nov 13, 2023

wonka929 commented Dec 9, 2023 •

edited

Loading

fcrescio commented Jan 30, 2024

Fgabz commented Mar 15, 2024

danielmzak commented Apr 8, 2024

lincoln157nascimento commented May 8, 2024

abhisirka2001 commented Aug 6, 2024 •

edited

Loading

[Bug] Portuguese TTS model on XTTS is pronouncing the "." (dot) character when it happens in a text #2952

[Bug] Portuguese TTS model on XTTS is pronouncing the "." (dot) character when it happens in a text #2952

Comments

Subarasheese commented Sep 15, 2023 • edited Loading

Describe the bug

To Reproduce

Expected behavior

Logs

Environment

Additional context

Subarasheese commented Sep 15, 2023

Edresson commented Sep 15, 2023

Subarasheese commented Sep 15, 2023 • edited Loading

stale bot commented Oct 17, 2023

Inc44 commented Oct 28, 2023

Subarasheese commented Nov 7, 2023 • edited Loading

erogol commented Nov 8, 2023

Dhrog commented Nov 9, 2023

Subarasheese commented Nov 9, 2023

brambox commented Nov 9, 2023

Dhrog commented Nov 9, 2023

Inc44 commented Nov 9, 2023

Edresson commented Nov 13, 2023

wonka929 commented Dec 9, 2023 • edited Loading

fcrescio commented Jan 30, 2024

Fgabz commented Mar 15, 2024

danielmzak commented Apr 8, 2024

lincoln157nascimento commented May 8, 2024

abhisirka2001 commented Aug 6, 2024 • edited Loading

Subarasheese commented Sep 15, 2023 •

edited

Loading

Subarasheese commented Sep 15, 2023 •

edited

Loading

Subarasheese commented Nov 7, 2023 •

edited

Loading

wonka929 commented Dec 9, 2023 •

edited

Loading

abhisirka2001 commented Aug 6, 2024 •

edited

Loading