GLaDOS voice #187

apiote · 2023-08-28T14:00:40Z

Is there a GLaDOS voice for pipers as it was for larynx (rhasspy/larynx#56)? Or possibly an easy way to convert one to another?
I added phonemes and missing entries in the json file, but still there are phonemes missing and errors about the model

onnxruntime.capi.onnxruntime_pybind11_state.InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Got invalid dimensions for input: scales for the following indices
 index: 0 Got: 3 Expected: 2
 Please fix either the inputs or the model.

The text was updated successfully, but these errors were encountered:

rmcpantoja · 2023-09-04T01:41:37Z

Tomorrow I will train a Glados dataset, but what worries me is the license to publish it.

apiote · 2023-09-04T09:46:11Z

That's what I was afraid of. Would instructions to train a dataset on one's own be more in the clear? I have no idea about hardware requirements, though.

rmcpantoja · 2023-09-04T12:44:27Z

To make things easier, I use colab notebooks, since I don't have the hardware. To run it locally, you would need an NVIDIA GPU and the parameters (eg batch_size) can be run according to the capabilities of your GPU.

apiote · 2023-09-11T09:46:12Z

I don't have the hardware either. And I guess if detailed instructions were published, that could still get DMCA'd as did tools like yt-dl

dnhkng · 2023-11-04T20:33:26Z

@rmcpantoja any update on the Glados training?

rmcpantoja · 2023-11-08T16:14:53Z

@rmcpantoja any update on the Glados training?

Hi @dnhkng,
I have two GLaDos models made, one in Spanish and the other in English through my colab notebooks, but unfortunately since they are datasets with a lot of corpus, they require more training and I do not have the resources to buy colab pro. They are the following:
English and Spanish

dnhkng · 2023-11-08T16:25:58Z

@rmcpantoja The English link doesn't work. I was going to try a finetune on the original game voice data. I have 2x 4090s, so I should have enough compute.

I could rip the voices from https://theportalwiki.com/wiki/GLaDOS_voice_lines but is there a dataset with this already prepared? Happy to share the results!

rmcpantoja · 2023-11-10T13:15:52Z

Hi @dnhkng,
It sounds strange, I am able to open the Drive folder with the English model without problems. Anyway, here I've a model exported to onnx

The model was trained using this dataset, but I was in charge of fixing many incorrect transcriptions.

dnhkng · 2023-11-13T14:17:21Z

@rmcpantoja
Thanks for the export! I found the checkpoint file eventually though, sorry! Sounds pretty good, I see it trained on colab for 2.25 hours.

I scrapped the GLaDOS dataset (only using the Portal 2 voice and DLC), manually filtered out all the wav files that contained extras (Laughing, telephones, beeping, etc), and also fixed all the text. That gave me about 1 hour of high-quality data. I have currently fine-tuned for 15 hours on a 4090, and it sounds very good, and the loss is still decreasing. I will train for 24 hours, and see how the loss curves look.

EDIT: Here is a samples after 24H of finetuning. 'a' is the generated sample, 'b' is an unseen sample from the the game.
https://drive.google.com/drive/folders/1WVpS2zlJ9JqXIYV8Fkjoy5Fjz-eWPaEh?usp=sharing

I think the generated sample is better! Kudos to Piper, this is amazing!

RoxBlox3 · 2023-11-26T10:55:35Z

@dnhkng Hello is it possible to get the model ?

dnhkng · 2023-11-28T14:20:37Z

@dnhkng Hello is it possible to get the model ?

Yes, I will share it in the next few days. Doing a big refactor on the inference code.

takov751 · 2023-12-07T10:16:38Z

Sign me up as well

RoxBlox3 · 2023-12-10T20:24:57Z

@dnhkng Any update on the model ?

dnhkng · 2023-12-11T08:28:26Z

OK, the model is available here:
https://github.com/dnhkng/GlaDOS

You can find the GlaDOS model in the models directory.

It includes my new code base to use the voice. Have a look in the Jupyter Notebook on how to use it.

If you instead want to use it with Piper, just take a medium size model, and copy the .onnx.json file, and rename it as glados.onnx.json, and it will run with Piper.

takov751 · 2023-12-11T09:27:27Z

Thank you very much for your work @dnhkng 👍👍👍

csukuangfj · 2023-12-13T04:05:52Z

For those of you who want to run GlaDOS onnx model on iOS, Android, Raspberry Pi, or use
C++, C, Go, C#, Kotlin, Swift, Python, Java, or on Windows, Linux, macOS, etc, please have
a look at https://github.com/k2-fsa/sherpa-onnx

We provide a colab to show you how to convert the GlaDOS model to sherpa-onnx
https://colab.research.google.com/drive/1m3Zr8H1RJaoZu4Y7hpQlav5vhtw3A513?usp=sharing

The following is a sample command using the converted model with sherpa-onnx

# You can also use sherpa-onnx-offline-tts-play

sherpa-onnx-offline-tts \
  --vits-model=./glados.onnx \
  --vits-tokens=./tokens.txt \
  --vits-data-dir=./espeak-ng-data \
  --output-filename=./test-glados.wav \
  "How are you doing? This is a text-to-speech application using next generation Kaldi."

test-glados.mov

csukuangfj · 2023-12-13T07:36:54Z

By the way, I just managed to build Android APKs for the pre-trained GLaDOS models mentioned in this issue, i.e, for the following two models:

A Spanish model from https://drive.google.com/file/d/12tNCCyd0Hf5jsyqCw8828kLSHHx5LOw9/view?usp=drive_link
An English model from https://github.com/dnhkng/GlaDOS

You can find the APKs at
https://k2-fsa.github.io/sherpa/onnx/tts/apk.html

For your convenience, the download address is given below:

If you are interested in how we build the APK, please read the following documentation
https://k2-fsa.github.io/sherpa/onnx/android/index.html

You can also try the models in the following huggingface space in your browser

http://huggingface.co/spaces/k2-fsa/text-to-speech

csukuangfj mentioned this issue Dec 13, 2023

Add two GLaDOS TTS models k2-fsa/sherpa-onnx#481

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GLaDOS voice #187

GLaDOS voice #187

apiote commented Aug 28, 2023

rmcpantoja commented Sep 4, 2023

apiote commented Sep 4, 2023

rmcpantoja commented Sep 4, 2023

apiote commented Sep 11, 2023

dnhkng commented Nov 4, 2023

rmcpantoja commented Nov 8, 2023

dnhkng commented Nov 8, 2023 •

edited

Loading

rmcpantoja commented Nov 10, 2023

dnhkng commented Nov 13, 2023 •

edited

Loading

RoxBlox3 commented Nov 26, 2023

dnhkng commented Nov 28, 2023

takov751 commented Dec 7, 2023

RoxBlox3 commented Dec 10, 2023

dnhkng commented Dec 11, 2023 •

edited

Loading

takov751 commented Dec 11, 2023

csukuangfj commented Dec 13, 2023 •

edited

Loading

csukuangfj commented Dec 13, 2023

GLaDOS voice #187

GLaDOS voice #187

Comments

apiote commented Aug 28, 2023

rmcpantoja commented Sep 4, 2023

apiote commented Sep 4, 2023

rmcpantoja commented Sep 4, 2023

apiote commented Sep 11, 2023

dnhkng commented Nov 4, 2023

rmcpantoja commented Nov 8, 2023

dnhkng commented Nov 8, 2023 • edited Loading

rmcpantoja commented Nov 10, 2023

dnhkng commented Nov 13, 2023 • edited Loading

RoxBlox3 commented Nov 26, 2023

dnhkng commented Nov 28, 2023

takov751 commented Dec 7, 2023

RoxBlox3 commented Dec 10, 2023

dnhkng commented Dec 11, 2023 • edited Loading

takov751 commented Dec 11, 2023

csukuangfj commented Dec 13, 2023 • edited Loading

csukuangfj commented Dec 13, 2023

dnhkng commented Nov 8, 2023 •

edited

Loading

dnhkng commented Nov 13, 2023 •

edited

Loading

dnhkng commented Dec 11, 2023 •

edited

Loading

csukuangfj commented Dec 13, 2023 •

edited

Loading