Skip to content

Using REST or SDK

szhaomsft edited this page Feb 4, 2024 · 4 revisions

Recommendation:

We recommend using SDK to call TTS service instead of REST:

  • SDK can support events like word boundary / visemes / bookmarks etc.
  • easy-to-use streaming mode to allow you get audio chunks before whole request is synthesized.
  • out-of-box audio player so you can play the audio in supported platforms without additional codes.
  • SDK provides a unified interface for both online cloud TTS and embedded ones such like Hybrid TTS in EV cars.

Reference:

Word boundary: https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/enable-read-aloud-for-your-application-with-azure-neural-tts/ba-p/2301422

Viseme: https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-neural-text-to-speech-extended-to-support-lip-sync-with/ba-p/2356748

Hybrid TTS: https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-neural-tts-now-available-on-devices-for-disconnected-and/ba-p/3716797

https://azure.microsoft.com/en-us/blog/accelerate-the-invehicle-digital-experience-with-azure-cognitive-services/

Clone this wiki locally