Add studio speakers to open source XTTS! #3405

WeberJulian · 2023-12-11T17:57:58Z

Add pseudo speaker and language manager to XTTS.
Now these commands work in the CLI:

tts --model_name tts_models/multilingual/multi-dataset/xtts_v2 --list_language_idx
 > tts_models/multilingual/multi-dataset/xtts_v2 is already downloaded.
 > Using model: xtts
 > Available language ids: (Set --language_idx flag to one of these values to use the multi-lingual model.
['en', 'es', 'fr', 'de', 'it', 'pt', 'pl', 'tr', 'ru', 'nl', 'cs', 'ar', 'zh-cn', 'hu', 'ko', 'ja', 'hi']

tts --model_name tts_models/multilingual/multi-dataset/xtts_v2 --list_speaker_idx
 > tts_models/multilingual/multi-dataset/xtts_v2 is already downloaded.
 > Using model: xtts
 > Available speaker ids: (Set --speaker_idx flag to one of these values to use the multi-speaker model.
dict_keys(['Claribel Dervla', 'Daisy Studious', 'Gracie Wise', 'Tammie Ema', 'Alison Dietlinde', 'Ana Florence', 'Annmarie Nele', 'Asya Anara', 'Brenda Stern', 'Gitta Nikolina', 'Henriette Usha', 'Sofia Hellen', 'Tammy Grit', 'Tanja Adelina', 'Vjollca Johnnie', 'Andrew Chipper', 'Badr Odhiambo', 'Dionisio Schuyler', 'Royston Min', 'Viktor Eka', 'Abrahan Mack', 'Adde Michal', 'Baldur Sanjin', 'Craig Gutsy', 'Damien Black', 'Gilberto Mathias', 'Ilkin Urbano', 'Kazuhiko Atallah', 'Ludvig Milivoj', 'Suad Qasim', 'Torcull Diarmuid', 'Viktor Menelaos', 'Zacharie Aimilios', 'Nova Hogarth', 'Maja Ruoho', 'Uta Obando', 'Lidiya Szekeres', 'Chandra MacFarland', 'Szofi Granger', 'Camilla Holmström', 'Lilya Stainthorpe', 'Zofija Kendrick', 'Narelle Moon', 'Barbora MacLean', 'Alexandra Hisakawa', 'Alma María', 'Rosemary Okafor', 'Ige Behringer', 'Filip Traverse', 'Damjan Chapman', 'Wulf Carlevaro', 'Aaron Dreschner', 'Kumar Dahl', 'Eugenio Mataracı', 'Ferran Simen', 'Xavier Hayasaka', 'Luis Moray', 'Marcos Rudaski'])

tts --model_name tts_models/multilingual/multi-dataset/xtts_v2 --speaker_idx 'Ana Florence' --language_idx en --text Hello
 > tts_models/multilingual/multi-dataset/xtts_v2 is already downloaded.
 > Using model: xtts
 > Text: Hello
 > Text splitted to sentences.
['Hello']
 > Processing time: 2.065340518951416
 > Real-time factor: 1.077734722711064
 > Saving output to tts_output.wav

TTS/tts/models/xtts.py

Edresson

All looks good to me :)

Edresson · 2023-12-12T12:24:35Z

@WeberJulian @erogol I added some docs with the new functionalities. Now we can merge it :).

thpham · 2024-01-08T00:02:06Z

TTS/.models.json

@@ -45,7 +46,7 @@
                    "hf_url": [
                        "https://coqui.gateway.scarf.sh/hf/bark/coarse_2.pt",
                        "https://coqui.gateway.scarf.sh/hf/bark/fine_2.pt",
-                        "https://app.coqui.ai/tts_model/text_2.pt",
+                        "https://coqui.gateway.scarf.sh/hf/text_2.pt",


https://coqui.gateway.scarf.sh/hf/bark/text_2.pt

WeberJulian added 5 commits December 11, 2023 11:29

Download speaker file

0a136a8

Add basic speaker manager

36143fe

rename manager

a5c0d97

rename speaker file

0a90359

Make CLI work

e3c9dab

Edresson reviewed Dec 11, 2023

View reviewed changes

TTS/tts/models/xtts.py Outdated Show resolved Hide resolved

Edresson approved these changes Dec 11, 2023

View reviewed changes

WeberJulian and others added 10 commits December 11, 2023 20:21

Fix API and CI

5cd750a

Remove coqui studio integration from TTS

8c20a59

Fix CI

5ab228d

Fix CI readme

ecc3889

Remove models that require app.coqui.ai

b40750d

Remove tortoise

605a857

Make comments in .model.json valid

d47b6df

Fix read_json_with_comments

61b67ef

Add docs

b6e1ac6

Update docs

4b33699

Update .models.json

4dc0722

Edresson approved these changes Dec 12, 2023

View reviewed changes

erogol added 2 commits December 12, 2023 13:30

Update test_models.py

8999780

Update .models.json

8e6a7cb

aedocw mentioned this pull request Dec 12, 2023

Other languages? aedocw/epub2tts#71

Closed

erogol merged commit 8c1a8b5 into dev Dec 12, 2023
49 checks passed

erogol deleted the studio_speakers branch December 12, 2023 15:10

akx mentioned this pull request Dec 13, 2023

CI: remove apparently no-op check_skip steps #3129

Closed

thpham reviewed Jan 8, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add studio speakers to open source XTTS! #3405

Add studio speakers to open source XTTS! #3405

WeberJulian commented Dec 11, 2023

Edresson left a comment

Edresson commented Dec 12, 2023

thpham Jan 8, 2024

Add studio speakers to open source XTTS! #3405

Add studio speakers to open source XTTS! #3405

Conversation

WeberJulian commented Dec 11, 2023

Edresson left a comment

Choose a reason for hiding this comment

Edresson commented Dec 12, 2023

thpham Jan 8, 2024

Choose a reason for hiding this comment