Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

It seems the output is limited to 10 minutes of audio #132

Closed
boltomli opened this issue Sep 10, 2019 · 5 comments
Closed

It seems the output is limited to 10 minutes of audio #132

boltomli opened this issue Sep 10, 2019 · 5 comments

Comments

@boltomli
Copy link
Contributor

Try BatchSynthesis, which support 50+ sentences, long paragraph/chapter.

Here's a sample https://github.com/Azure-Samples/Cognitive-Speech-TTS/tree/master/CustomVoice-API-Samples/CSharp

It seems the output is limited to 10 minutes of audio (at least using the nural option). What if I want to process a long text file, like a required reading or a chapter of a book?

Originally posted by @phly95 in #128 (comment)

@phly95
Copy link

phly95 commented Sep 10, 2019

Why does it stay in the "NotStarted..." status for so long in this mode?

@phly95
Copy link

phly95 commented Sep 10, 2019

Also, is there a way to get the file later when it's finished instead of leaving the command window open the whole time?

@boltomli
Copy link
Contributor Author

Also, is there a way to get the file later when it's finished instead of leaving the command window open the whole time?

This is related to your implementation detail. Could try create a service, call by async, run in background, etc.

@phly95
Copy link

phly95 commented Sep 11, 2019

I am a complete beginner at this, so I don't know what you mean by any of those. I'm using the python script https://github.com/Azure-Samples/Cognitive-Speech-TTS/tree/master/CustomVoice-API-Samples/Python and got this line working

python voiceclient.py -key [censored] -region eastus -file tts.txt -locale en-US -voiceId 2eafe49c-e4d6-4c49-be92-ba55f6ab3971 -format audio-24khz-160kbitrate-mono-mp3 --submit --concatenateResult

but if I'm doing this on my laptop, and I have to turn it off between classes or something, then it's just wasted because by the time it's processed, I will have lost the session.

@boltomli
Copy link
Contributor Author

The samples are kept very simple as basic demos showing only how to call the API, and may not fit in production use.

Since the problem you are experiencing now is not directly related to the speech services, we may not provide more accurate solution yet. But it's still a good question on sites such as stackoverflow.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants