I have a solution for slow inference on CPU. You should try setting environment variable OMP_NUM_THREADS=1 before running a python script. When pytorch is allowed to set the thread count to be equal to the number of CPU cores, it takes 10x longer to synthesize text.
It's really a problem with pytorch and blas libraries, not TTS. However, it leads to the perception that TTS inference is slow. I would suggest documenting it in the readme file.
I have a solution for slow inference on CPU. You should try setting environment variable OMP_NUM_THREADS=1 before running a python script. When pytorch is allowed to set the thread count to be equal to the number of CPU cores, it takes 10x longer to synthesize text.
It's really a problem with pytorch and blas libraries, not TTS. However, it leads to the perception that TTS inference is slow. I would suggest documenting it in the readme file.