These source files are gui for users who use the coqui-TTS vits model.
- It is designed to make it easy to use the model obtained by performing voice synthesis with Vits.
- Model file (.pth), speakers (.pth), language_ids (.json), config(.json) are required.
- Model file, speakers file, language_ids file, config file must be in the same location.
- Just copy the
UI.py
file and thedesign.ui
file to the location wheresetup.py
is located. - When you run the
UI.py
file, the gui runs.
- pyqt5
- playsound==1.2.2
pip install pyqt5
pip install playsound==1.2.2
- Press the path button to select the model file.
- Select speaker and language from the box .
- Type text in the text box for voice synthesis.
- If necessary, write the name of the wav file to be printed in the output file name. The default value is output.wav.
- If necessary, check the running voice box. If checked, play the voice as soon as the synthesis is complete.
- Press the systhesis button
- Total time is the time taken from the time the synthesis button is clicked to the time the wav file is generated.
- Audio time is the length of the generated audio.
- The output file is located in the same location as the model.
- The filename of speakers file, language_ids file, config file follows the default value. If you want to change the file name above, please modify the values of config_name, lang_id_name, and speak_id_name in the code.
Ability to preload models to reduce the amount of time before voice is output