Pronounciation and other finer control #35

Bikonja · 2023-01-07T11:18:17Z

Hi,
would it be possible to extend this to support defining pronounciations for specific words? I see Edge has a few different options in the XML and there is a step here that converts the passed in message to XML before sending of to Edge TTS so perhaps exposing another function to pass in the direct XML instead for "advanced" users might be an easy way to extend support for things that aren't implemented as nice, easy options?

rany2 · 2023-01-07T11:27:17Z

This function previously existed but was removed. It doesn't work properly due to Microsoft closing up the API and so I removed it. If you send XML that uses features not present in Edge browser the API will error out now.

rany2 · 2023-01-07T11:30:14Z

It expects XML documents with only XML document formats that would be generated by Edge Browser. I believe it checks if it matches the template the browser uses.

Bikonja · 2023-01-07T11:33:35Z

Ah, thanks for the info.
If the API errors out, can you gracefully handle the error so it's a "might not work if you try it, but it's there if you want to try it"? Or was the problem that you kept getting issues opened for things that Edge didn't support and you didn't want to support that?

rany2 · 2023-01-07T11:46:13Z

Well the issue was that the API didn't error out exactly, it just returned nothing. In any case it does raise an exception for when the API returns no audio.

Either way users of other libraries that depended on edge-tts made issues because the library that they were using was using custom SSML and was not working, so I removed it entirely.

I was never able to get it to do anything useful anyway. It used to support all the Azure Cognitive Services features but not anymore. It doesn't even support setting a custom pitch value.

Bikonja · 2023-01-07T11:49:57Z

Ah... That kind of sucks.
Let me see on Monday if I can find the team responsible for the Edge and see if they are planning to make the API more stable and support those extra features. Hold your fingers crossed, but temper your expectations :)
And thank you!

rany2 · 2023-01-07T11:53:38Z

The API is stable, it's just that using the paid Azure Cognitive Services features no longer works. At the start they don't do any sort of filtering.

rany2 · 2023-01-07T11:54:01Z

It's intentional, basically!

Bikonja · 2023-01-07T11:54:50Z

Oh, so they didn't shut it down, just moved it to a paid tier? Ok, I guess that makes sense... To be fair, it's one of the best TTS's around and the basica functionality is free so guess we can be happy with that at least :)

rany2 · 2023-01-07T11:59:13Z

Well it was always a paid service. Azure Cognitive Services is a paid service for TTS generation. Edge browser's online TTS is basically just Azure Cognitive Services with restrictions.

Anyway if pronunciation and other fine control is available functionality in Edge, then I could probably add it to the library. Otherwise I guess this issue needs to be closed.

Bikonja · 2023-01-07T12:00:16Z

Got it, so it's essentially not even supposed to be free, and we just shouldn't talk about it and be happy it works :)
Cool, thanks, closing the issue.

Bikonja closed this as completed Jan 7, 2023

Bikonja closed this as not planned Won't fix, can't repro, duplicate, stale Jan 7, 2023

Cohee1207 mentioned this issue Aug 5, 2023

Edge-TTS Pitch SillyTavern/SillyTavern-Extras#106

Closed

scott306lr mentioned this issue Aug 30, 2023

Adding pitch variable back #138

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pronounciation and other finer control #35

Pronounciation and other finer control #35

Bikonja commented Jan 7, 2023

rany2 commented Jan 7, 2023

rany2 commented Jan 7, 2023 •

edited

Loading

Bikonja commented Jan 7, 2023

rany2 commented Jan 7, 2023

Bikonja commented Jan 7, 2023

rany2 commented Jan 7, 2023

rany2 commented Jan 7, 2023

Bikonja commented Jan 7, 2023

rany2 commented Jan 7, 2023

Bikonja commented Jan 7, 2023

Pronounciation and other finer control #35

Pronounciation and other finer control #35

Comments

Bikonja commented Jan 7, 2023

rany2 commented Jan 7, 2023

rany2 commented Jan 7, 2023 • edited Loading

Bikonja commented Jan 7, 2023

rany2 commented Jan 7, 2023

Bikonja commented Jan 7, 2023

rany2 commented Jan 7, 2023

rany2 commented Jan 7, 2023

Bikonja commented Jan 7, 2023

rany2 commented Jan 7, 2023

Bikonja commented Jan 7, 2023

rany2 commented Jan 7, 2023 •

edited

Loading