[Bug] Cant run any of the xtts models using the TTS Command Line Interface (CLI) #3270

240db · 2023-11-20T01:33:34Z

Describe the bug

Hello I just started playing with the TTS library and I am running tests using the TTS Command Line Interface (CLI).
I was able to try capacitron, vits (english and portuguese) and tacotron2 successfully. But when I tried any of the xtts models, I get the same error that suggests I have yet to set a language option.

To Reproduce

I tried running the following and it issues the error

tts --text "Welcome. This is a TTS test." --model_name "tts_models/multilingual/multi-dataset/xtts_v2" --language en --out_path TTS_english_test_xtts_output2.wav

tts --text "Welcome. This is a TTS test." --model_name "tts_models/multilingual/multi-dataset/xtts_v1.1" --language en --out_path TTS_english_test_xtts_output2.wav

I tried these commands on multiple systems yet I get the same error
AssertionError: ❗ Language None is not supported. Supported languages are ['en', 'es', 'fr', 'de', 'it', 'pt', 'pl', 'tr', 'ru', 'nl', 'cs', 'ar', 'zh-cn', 'hu', 'ko', 'ja']

Expected behavior

No response

Logs

No response

Environment

- TTS installed from pip install TTS
- Linux OS

Additional context

My guess is that --language en is ignored and perhaps the xtts_v2 and xtts_v1.1 models are required to run in Python? I wanted to try a multilingual model through the command line interface (CLI) are there any missing steps I am missing here?

I was able to run bark using

tts --text "Welcome. This is a TTS test." --model_name "tts_models/multilingual/multi-dataset/bark" --language en --out_path TTS_english_test_bark_output2.wav

The text was updated successfully, but these errors were encountered:

WeberJulian · 2023-11-20T08:38:43Z

~$ tts --help
usage: tts [-h] [--list_models [LIST_MODELS]]
           [--model_info_by_idx MODEL_INFO_BY_IDX]
           [--model_info_by_name MODEL_INFO_BY_NAME] [--text TEXT]
           [--model_name MODEL_NAME] [--vocoder_name VOCODER_NAME]
           [--config_path CONFIG_PATH] [--model_path MODEL_PATH]
           [--out_path OUT_PATH] [--use_cuda USE_CUDA] [--device DEVICE]
           [--vocoder_path VOCODER_PATH]
           [--vocoder_config_path VOCODER_CONFIG_PATH]
           [--encoder_path ENCODER_PATH]
           [--encoder_config_path ENCODER_CONFIG_PATH] [--cs_model CS_MODEL]
           [--emotion EMOTION] [--language LANGUAGE] [--pipe_out [PIPE_OUT]]
           [--speed SPEED] [--speakers_file_path SPEAKERS_FILE_PATH]
           [--language_ids_file_path LANGUAGE_IDS_FILE_PATH]
           [--speaker_idx SPEAKER_IDX] [--language_idx LANGUAGE_IDX]
           [--speaker_wav SPEAKER_WAV [SPEAKER_WAV ...]]
           [--gst_style GST_STYLE]
           [--capacitron_style_wav CAPACITRON_STYLE_WAV]
           [--capacitron_style_text CAPACITRON_STYLE_TEXT]
           [--list_speaker_idxs [LIST_SPEAKER_IDXS]]
           [--list_language_idxs [LIST_LANGUAGE_IDXS]]
           [--save_spectogram SAVE_SPECTOGRAM] [--reference_wav REFERENCE_WAV]
           [--reference_speaker_idx REFERENCE_SPEAKER_IDX]
           [--progress_bar PROGRESS_BAR] [--source_wav SOURCE_WAV]
           [--target_wav TARGET_WAV] [--voice_dir VOICE_DIR]

As you can see in the help, or in the documentation, the proper argument name for specifying the language is --language_idx

…ment In multilingual models, the target language is specified via the `--language_idx` argument. However, the `tts` CLI also accepts a `--language` argument for use with Coqui Studio, so it is easy to choose the wrong one, resulting in the following confusing error at synthesis time: ``` AssertionError: ❗ Language None is not supported. Supported languages are ['en', 'es', 'fr', 'de', 'it', 'pt', 'pl', 'tr', 'ru', 'nl', 'cs', 'ar', 'zh-cn', 'hu', 'ko', 'ja'] ``` This commit adds a better error message when `--language` is passed for a non-studio model. Fixes coqui-ai#3270, fixes coqui-ai#3291

…ment (#3294) In multilingual models, the target language is specified via the `--language_idx` argument. However, the `tts` CLI also accepts a `--language` argument for use with Coqui Studio, so it is easy to choose the wrong one, resulting in the following confusing error at synthesis time: ``` AssertionError: ❗ Language None is not supported. Supported languages are ['en', 'es', 'fr', 'de', 'it', 'pt', 'pl', 'tr', 'ru', 'nl', 'cs', 'ar', 'zh-cn', 'hu', 'ko', 'ja'] ``` This commit adds a better error message when `--language` is passed for a non-studio model. Fixes #3270, fixes #3291

240db added the bug Something isn't working label Nov 20, 2023

WeberJulian closed this as completed Nov 20, 2023

eginhard mentioned this issue Nov 23, 2023

More informative error for wrong --language argument #3294

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Cant run any of the xtts models using the TTS Command Line Interface (CLI) #3270

[Bug] Cant run any of the xtts models using the TTS Command Line Interface (CLI) #3270

240db commented Nov 20, 2023

WeberJulian commented Nov 20, 2023

[Bug] Cant run any of the xtts models using the TTS Command Line Interface (CLI) #3270

[Bug] Cant run any of the xtts models using the TTS Command Line Interface (CLI) #3270

Comments

240db commented Nov 20, 2023

Describe the bug

To Reproduce

Expected behavior

Logs

Environment

Additional context

WeberJulian commented Nov 20, 2023