Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] GUI Interface #6

Open
photkey opened this issue Mar 21, 2022 · 7 comments
Open

[Feature Request] GUI Interface #6

photkey opened this issue Mar 21, 2022 · 7 comments

Comments

@photkey
Copy link

photkey commented Mar 21, 2022

Functions similar to: https://azure.microsoft.com/en-us/services/cognitive-services/text-to-speech/#features
I made a simple sketch with this page and included another paragraph of reference data
界面

roleList = ['Default', 'YoungAdultFemale', 'YoungAdultMale', 'OlderAdultFemale', 'OlderAdultMale', 'SeniorFemale', 'SeniorMale', 'Girl', 'Boy']
voices = {
    'zh-CN': {
        'zh-CN-YunyeNeural': {
            'name': 'Yunye',
            'style': ['General', 'angry', 'calm', 'cheerful', 'disgruntled', 'embarrassed', 'fearful', 'sad', 'serious'],
            'role': roleList,
            'style_support': 'Optimized for story narrating, multiple role-play and styles available using SSML'},
        'zh-CN-XiaoxiaoNeural': {
            'name': 'Xiaoxiao',
            'style': ['General', 'affectionate', 'angry', 'assistant', 'calm', 'chat', 'cheerful', 'customerservice', 'disgruntled', 'fearful', 'gentle', 'lyrical', 'newscast', 'sad', 'serious'],
            'role': None,
            'style_support': 'General, multiple voice styles available using SSML'},
        'zh-CN-XiaohanNeural': {
            'name': 'Xiaohan',
            'style': ['General', 'affectionate', 'angry', 'calm', 'cheerful', 'disgruntled', 'embarrassed', 'fearful', 'gentle', 'sad', 'serious'],
            'role': None,
            'style_support': 'General, multiple styles available using SSML'},
        'zh-CN-XiaoruiNeural': {
            'name': 'Xiaorui',
            'style': ['General', 'angry', 'calm', 'fearful', 'sad'],
            'role': None,
            'style_support': 'Senior voice, multiple styles available using SSML'},
        'zh-CN-XiaomoNeural': {
            'name': 'Xiaomo',
            'style': ['General', 'affectionate', 'angry', 'calm', 'cheerful', 'depressed', 'disgruntled', 'embarrassed', 'envious', 'fearful', 'gentle', 'sad', 'serious'],
            'role': roleList,
            'style_support': 'General, multiple role-play and styles available using SSML'},
        'zh-CN-XiaochenNeural': {
            'name': 'Xiaochen',
            'style': None,
            'role': None,
            'style_support': 'Optimized for spontaneous conversation'},
        'zh-CN-XiaoqiuNeural': {
            'name': 'Xiaoqiu',
            'style': None,
            'role': None,
            'style_support': 'Optimized for narrating'}
    },

    'en-US': {
        'en-US-GuyNeural': {
            'name': 'Guy',
            'style': ['General', 'newscast'],
            'role': None,
            'style_support': 'General, multiple voice styles available using SSML'},
        'en-US-AriaNeural': {
            'name': 'Aria',
            'style': ['General', 'chat', 'cheerful', 'customerservice', 'empathetic', 'narration-professional', 'newscast-casual', 'newscast-formal'],
            'role': None, 'style_support': 'General, multiple voice styles available using SSML'},
        'en-US-JennyNeural': {
            'name': 'Jenny',
            'style': ['General', 'assistant', 'chat', 'customerservice', 'newscast'],
            'role': None,
            'style_support': 'General, multiple voice styles available using SSML'},
        'en-US-SaraNeural': {
            'name': 'Sara',
            'style': ['General', 'angry', 'cheerful', 'sad'],
            'role': None,
            'style_support': 'General, multiple voice styles available using SSML'}
    }
}

# role、style Data from:https://azure.microsoft.com/en-us/services/cognitive-services/text-to-speech/#features
# Style_support Data from:https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/language-support#text-to-speech

Supports accepting command line arguments to start: srt_file ,out_file, voice.

A SSML template needs to be built in, e.g.

<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis"
       xmlns:mstts="https://www.w3.org/2001/mstts" xml:lang="{lang}">
    <voice name="{voice}">
        <mstts:express-as role="{role}" style="{style}" styledegree="{styledegree}">
        <prosody pitch="{pitch}" rate="{rate}" volume="{volume}">
            {text}
        </prosody>
        </mstts:express-as>
    </voice>
</speak>
@rany2
Copy link
Owner

rany2 commented Mar 21, 2022 via email

@photkey
Copy link
Author

photkey commented Mar 21, 2022

Your development speed, every time, is amazingly fast, looking forward to your good news.
Please let me know if you need me to do the tedious and unskilled work of organizing voice lists, supported styles, etc. I will be happy to do.

@rany2
Copy link
Owner

rany2 commented May 29, 2022

I'm sorry but I've been really busy these few weeks. I'll try to do something about this soon but no ETA or promises :(

@rany2
Copy link
Owner

rany2 commented Aug 4, 2022

Sadly a lot of the features which were previously available had to remove because of Microsoft increasing security of their API endpoint. For example, now custom SSML is no longer available; so for now we have to get rid of a lot of these features and simplify everything to only what Edge could offer (very sad..)

@photkey
Copy link
Author

photkey commented Aug 4, 2022

This is indeed a bit of a shame, and I noticed the change when edge-tts was updated. Hopefully the update to edge-srt-to-speech will now be synchronized to remove the parts that are no longer working (sadly, but no way).
I've been looking forward to the gui, hopefully when you're not busy you can still implement it, thanks.

@photkey
Copy link
Author

photkey commented Oct 10, 2022

@rany2 If you have time to hopefully update this library, declare: absolutely this is absolutely not a rush, just afraid you will forget this library, remind it Oh. Thank you.

@rany2
Copy link
Owner

rany2 commented Jan 5, 2023

I haven't forgotten about this, but I'm not able to work on this now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants