feat: add Edge TTS provider #30

xtmu · 2024-01-06T14:04:12Z

Edge TTS and Azure TTS are almost same,but Edge TTS don't require API Key because it's based on Edge read aloud functionality, it's free to use.

p0n1 · 2024-01-06T15:06:18Z

@xtmu Thanks for contributing. Love this! Will try to debug this on or after Monday.

I'm quite curious about if Edge TTS could be used to convert a whole book without being banned by Microsoft.

BTW, We have a tiny discord server https://discord.gg/pgp2G8zhS7. Invite you to join if you want to discuss anything.

xtmu · 2024-01-07T09:25:35Z

For now it could be used to converting a whole book.
I tested a book parsed 60 chapters among which max length is 27k Chinese characters and get a 45 mins audio. Note chapter is not split into chunks as Azure provider do so need to keep connection alive during converting, actually I get interrupted audio(10min max) if in proxy environment, I suppose it's the GFW bothering.

Here is some information:

edge-tts bypassed text length limit and seems won't be banned if conections are not thousand parallel.

retrieve title from fallback tag: <h1>,<h2>,<h3>.

p0n1 · 2024-01-09T12:54:25Z

Looks good. Will test, review and merge asap!

p0n1

Great Pull Request. I went through the code and conducted tests, and it's excellent. Thank you. In the past, I often used the voice zh-CN-YunyeNeural, but it's not supported by edge TTS. I wonder if there are any other similar recommendations.

p0n1 · 2024-01-11T13:03:27Z

audiobook_generator/book_parsers/epub_book_parser.py

@@ -45,7 +45,12 @@ def get_chapters(self, break_string) -> List[Tuple[str, str]]:
        for item in self.book.get_items_of_type(ebooklib.ITEM_DOCUMENT):
            content = item.get_content()
            soup = BeautifulSoup(content, "lxml")
-            title = soup.title.string if soup.title else ""
+            title = ""
+            title_levels = ['title', 'h1', 'h2', 'h3']


I'm a bit concerned about the possibility that items labeled as h1 h2 h3 could be non section title. However, it's not a big issue, and if there is indeed a problem, we can fix it later.

Thanks for testing. Yunjian could be an alternative, in addition, you can adjust voice_pitch and voice_rate, even for female voice, it could be sound like male if lowered by 50Hz. I use this script to try out voice.

I'm a bit concerned about the possibility that items labeled as h1 h2 h3 could be non section title. However, it's not a big issue, and if there is indeed a problem, we can fix it later.

Where did you get your audio book? I have converted my favorite two books, each page's <head> element content was somehow cleared, though, their section info is designed to be in the h1 or h2 element, so I still can get the exact section title, in the past code, I get the first 100 characters.

soup sample:

<?xml version='1.0' encoding='utf-8'?><!DOCTYPE html> <html epub:prefix="z3998: http://www.daisy.org/z3998/2012/vocab/structure/#" lang="en" xml:lang="en" xmlns="http://www.w3.org/1999/xhtml" xmlns:epub="http://www.idpf.org/2007/ops"> <head></head> <body> <h2>Section XX</h2>

feat: add Edge TTS provider

b60a9a6

xtmu added 2 commits January 8, 2024 19:52

fix: -h error

825e3d9

fix: retrieve title.

1be794c

retrieve title from fallback tag: <h1>,<h2>,<h3>.

xtmu marked this pull request as ready for review January 8, 2024 12:13

p0n1 approved these changes Jan 11, 2024

View reviewed changes

p0n1 merged commit 7432dc7 into p0n1:main Jan 11, 2024

xtmu deleted the dev branch January 21, 2024 20:33

p0n1 mentioned this pull request Feb 20, 2024

Better chapter title handling #47

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Edge TTS provider #30

feat: add Edge TTS provider #30

xtmu commented Jan 6, 2024 •

edited

p0n1 commented Jan 6, 2024

xtmu commented Jan 7, 2024

p0n1 commented Jan 9, 2024

p0n1 left a comment

p0n1 Jan 11, 2024

xtmu Jan 13, 2024

xtmu Jan 13, 2024

feat: add Edge TTS provider #30

feat: add Edge TTS provider #30

Conversation

xtmu commented Jan 6, 2024 • edited

p0n1 commented Jan 6, 2024

xtmu commented Jan 7, 2024

p0n1 commented Jan 9, 2024

p0n1 left a comment

Choose a reason for hiding this comment

p0n1 Jan 11, 2024

Choose a reason for hiding this comment

xtmu Jan 13, 2024

Choose a reason for hiding this comment

xtmu Jan 13, 2024

Choose a reason for hiding this comment

xtmu commented Jan 6, 2024 •

edited