Optimization of XTTS latency #559

abentabib · 2026-02-11T11:48:33Z

abentabib
Feb 11, 2026

I wanted to know if it'is possible to optimise latency generation. I don't know if there is an option to preload the speak_wav voice or embeddings ?
in "tts_models/multilingual/multi-dataset/xtts_v2" while using the tts_to_file function ?

Answered by eginhard

Feb 11, 2026

Yes, you can cache the embeddings for later use: https://coqui-tts.readthedocs.io/en/latest/cloning.html#usage

Otherwise XTTS also supports streaming synthesis: https://coqui-tts.readthedocs.io/en/latest/models/xtts.html#streaming-manually

View full answer

eginhard · 2026-02-11T12:44:20Z

eginhard
Feb 11, 2026
Maintainer

Yes, you can cache the embeddings for later use: https://coqui-tts.readthedocs.io/en/latest/cloning.html#usage

Otherwise XTTS also supports streaming synthesis: https://coqui-tts.readthedocs.io/en/latest/models/xtts.html#streaming-manually

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimization of XTTS latency #559

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Optimization of XTTS latency #559

Uh oh!

abentabib Feb 11, 2026

Replies: 1 comment

Uh oh!

eginhard Feb 11, 2026 Maintainer

abentabib
Feb 11, 2026

eginhard
Feb 11, 2026
Maintainer