Important! The current model (fastspeech) does not work well with short phrases. (e.g. "hi", "how are you", etc.)
This package provides a fully functional cross platform Text To Speech engine using deep learning models integrated in Unity with C#!
You can find the example repository here.
The model that we use for TTS is FastSpeech. The TFLite model that we used is converted from a pre-trained model found in the TensorflowTTS repository.
To prevent Unity from freezing when inferencing the TFLite model, we run the inference process in a new thread and play the audio in the main thread once it is ready.
- Install OpenUPM-CLI
- Run command in your project folder
openupm add com.github.voxell.unitytts
External dependencies:
- Download the TFLite models from Google Drive and import them into Unity (place them inside the
Assets/StreamingAssetsfolder). - mapper file should be moved oustide
Assets/StreamingAssets - And you are ready to go!
This repository as a whole is licensed under the GNU Public License, Version 3. Individual files may have a different, but compatible license.
See license file for details.
- Fastspeech
- MelGAN
- All TFLite model inferencing will not be possible without the help of the Unity TFLite repository.