New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Adding TTS Tutorials #1584

Merged

erogol merged 7 commits into dev from tutorials

Jun 2, 2022

Contributor

Aya-AlJafari commented May 20, 2022

No description provided.

erogol and others added 2 commits

May 13, 2022 14:58


          Merge pull request #1574 from coqui-ai/update_badge

f237e4c

Update CI badges


          Adding inferencing notebook

0e69367

Aya-AlJafari requested review from reuben and erogol

May 20, 2022 09:46

WeberJulian force-pushed the dev branch from 74f5c3f to ee99a6c Compare

May 20, 2022 13:53

reuben approved these changes

View reviewed changes

Contributor

reuben left a comment

Discussed feedback on Element. Looking good 🚀


          added multispeaker explanation and usecase and renamed the file

d06a730

Member

erogol commented May 25, 2022

Looking good but notebooks are not testable. So far any notebook we released as a tutorial could not be maintained. We need a way to have this notebook in the CI tests.

Contributor

reuben commented May 25, 2022

They are testable, we test our notebooks in the STT CI. Can probably copy that and adapt.

Aya-AlJafari changed the title ~~Adding inferencing notebook~~ Adding TTS Tutorials


          Adding training tutorial

441222a

Member

erogol commented May 29, 2022

They are testable, we test our notebooks in the STT CI. Can probably copy that and adapt.

can you link me where in the STT?

Member

erogol commented May 29, 2022

@Aya-AlJafari I see you are still committing. Should I wait for more?

erogol approved these changes

View reviewed changes

Contributor

reuben commented May 29, 2022

They are testable, we test our notebooks in the STT CI. Can probably copy that and adapt.

can you link me where in the STT?

Sorry I shared with Aya on chat but forgot to add here.

the STT notebook CI I referred to in the PR is here and here
the gist of it is that you can use jupyter nbconvert --to notebook --execute to run all cells of a notebook programmatically
there are also ways to replace variables or disable cells in certain cases but I'm not too familiar, I can do some research if we need that

reuben reviewed

View reviewed changes

notebooks/Tutorial_2_train_your_first_TTS_model.ipynb Outdated

+                  "\n",
+                  "So, let's jump right in!\n",
+                  "\n",
+                  "*PS - If you just want a working, off-the-shelf model, check out the [🐸 Model Zoo](https://www.coqui.ai/models)*"

Contributor

reuben May 29, 2022

Model zoo doesn't have TTS models.

notebooks/Tutorial_2_train_your_first_TTS_model.ipynb Outdated

+                  "\n",
+                  "If you have a single audio file and you need to **split** it into clips. It is also important to use a lossless audio file format to prevent compression artifacts. We recommend using **wav** file format.\n",
+                  "\n",
+                  "The data format we will be adopting for this tutorial is taken from widely-used the **LJSpeech** dataset, where **waves** are collected under a folder:\n",

Contributor

reuben May 29, 2022

Suggested change

      
                "The data format we will be adopting for this tutorial is taken from widely-used the **LJSpeech** dataset, where **waves** are collected under a folder:\n",
          
                "The data format we will be adopting for this tutorial is taken from the widely-used **LJSpeech** dataset, where **waves** are collected under a folder:\n",

notebooks/Tutorial_2_train_your_first_TTS_model.ipynb Outdated

+                  "\n",
+                  "### **First things first**: we need some data.\n",
+                  "\n",
+                  "We're training a Text-to-Speech model, so we need some _text_ and we need some _speech_. Specificially, we want _transcribed speech_. The speech must be divided into audio clips and each clip needs transcription. \n",

Contributor

reuben May 29, 2022

There's also many other requirements in terms of the recording characteristics, background noise, vocabulary coverage, etc. Even if going into details is not appropriate here we should at least link to more extensive documentation.

notebooks/Tutorial_2_train_your_first_TTS_model.ipynb Outdated

+                  "<span style=\"color:purple;font-size:15px\">\n",
+                  "/wavs<br /> \n",
+                  " &emsp;| - audio1.wav<br /> \n",
+                  " &emsp;| - udio2.wav<br /> \n",

Contributor

reuben May 29, 2022

Suggested change

      
                " &emsp;| - udio2.wav<br /> \n",
          
                " &emsp;| - audio2.wav<br /> \n",

notebooks/Tutorial_2_train_your_first_TTS_model.ipynb Outdated

+                  "  ...<br /> \n",
+                  "</span>\n",
+                  "\n",
+                  "and a **metdata.txt** file will have the audioname in parallel to the transcript, delimeted by `|`: \n",

Contributor

reuben May 29, 2022

Suggested change

      
                "and a **metdata.txt** file will have the audioname in parallel to the transcript, delimeted by `|`: \n",
          
                "and a **metadata.txt** file will have the audio file name in parallel to the transcript, delimited by `|`: \n",

notebooks/Tutorial_2_train_your_first_TTS_model.ipynb Outdated

+                  "## ⏳️ Loading your dataset\n",
+                  "Load one of the dataset supported by 🐸TTS.\n",
+                  "\n",
+                  "For this tutorial we will be using LJSpeech dataset.\n",

Contributor

reuben May 29, 2022

This was already said above.

notebooks/Tutorial_2_train_your_first_TTS_model.ipynb

+                  "    os.makedirs(output_path)\n",
+                  "\n",
+                  "dataset_config = BaseDatasetConfig(\n",
+                  "    name=\"ljspeech\", meta_file_train=\"metadata.csv\", path=os.path.join(output_path, \"LJSpeech-1.1/\")\n",

Contributor

reuben May 29, 2022

In the examples above the metadata file has a .txt extension.

Contributor Author

Aya-AlJafari May 30, 2022 •

edited

Loading

Oh that's a bug in the documentation as well. @erogol it should be CSV right? as opposed to what's in this page
And for a CSV, should we add a header of audioname|text?

notebooks/Tutorial_2_train_your_first_TTS_model.ipynb Outdated

+                  "dataset_config = BaseDatasetConfig(\n",
+                  "    name=\"ljspeech\", meta_file_train=\"metadata.csv\", path=os.path.join(output_path, \"LJSpeech-1.1/\")\n",
+                  ")\n",
+                  "# You need to download LJSpeech inside output_path\n"

Contributor

reuben May 29, 2022

We should make the notebook do this instead of asking people to do it.

notebooks/Tutorial_2_train_your_first_TTS_model.ipynb

Comment on lines +375 to +376

		" --model_path $test_ckpt \\\n",
		" --config_path $test_config \\\n",

Contributor

reuben May 29, 2022

Jupyter lets you access Python variables in inline shell calls like this, so you don't have to set them in os.environ above, just create normal Python variables test_ckpt and test_config.

notebooks/Tutorial_2_train_your_first_TTS_model.ipynb Outdated

+                 "metadata": {},
+                 "source": [
+                  "## 🎉 Congratulations! 🎉 You now have trained your first TTS model! \n",
+                  "Follow up with the next tutorials to learn more adnavced material."

Contributor

reuben May 29, 2022

Suggested change

      
                "Follow up with the next tutorials to learn more adnavced material."
          
                "Follow up with the next tutorials to learn more advanced material."

Contributor Author

Aya-AlJafari commented May 30, 2022

@Aya-AlJafari I see you are still committing. Should I wait for more?

@erogol yes I will be adding one more tutorial today

Aya-AlJafari added 3 commits

May 30, 2022 16:14


          fixed dummy paths

878a95d


          fixed review comments

f729fc0


          fixed metadata extension

cfec154

Member

erogol commented Jun 1, 2022

@TrycsPublic interesting way to send commits :)

How about sending a PR? It is challenging this way to see what you changed.

Contributor

reuben commented Jun 1, 2022

You can even make a PR for another PR by setting the base branch to tutorials instead of dev :)

erogol merged commit 68cef28 into dev

erogol deleted the tutorials branch

June 2, 2022 11:59

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet