Using REST or SDK

Jump to bottom

szhaomsft edited this page Feb 4, 2024 · 4 revisions

Recommendation:

We recommend using SDK to call TTS service instead of REST:

SDK can support events like word boundary / visemes / bookmarks etc.
easy-to-use streaming mode to allow you get audio chunks before whole request is synthesized.
out-of-box audio player so you can play the audio in supported platforms without additional codes.
SDK provides a unified interface for both online cloud TTS and embedded ones such like Hybrid TTS in EV cars.

Reference:

Word boundary: https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/enable-read-aloud-for-your-application-with-azure-neural-tts/ba-p/2301422

Viseme: https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-neural-text-to-speech-extended-to-support-lip-sync-with/ba-p/2356748

Hybrid TTS: https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-neural-tts-now-available-on-devices-for-disconnected-and/ba-p/3716797

https://azure.microsoft.com/en-us/blog/accelerate-the-invehicle-digital-experience-with-azure-cognitive-services/

Azure TTS: Empower every person and every organization on the planet to have a delightful digital voice!
Azure Custom Voice: Build your one-of-a-kind Custom Voice and close to human Neural TTS in cloud and edge!