Skip to content

Latest commit

 

History

History
30 lines (21 loc) · 1.65 KB

README.md

File metadata and controls

30 lines (21 loc) · 1.65 KB

SM-Dictionaries

Base dictionaries

These dictionaries are given as parameter to Montreal Forced Aligner.

Four different languages are supported for the moment:

  • French
  • English
  • German
  • Spanish

Consonant-vowel dicts

These dictionaries indicates for a given base dictionary which phones are consonants and which phonems are vowels.

This is used by Sentence Mixing library

Add a new language

If you want to use a language that is supported by an MFA pretrained model but not present in SM-Dictionaries, you can follow this procedure:

  1. Find a text dictionary in the targetted language. This dictionary should be a big list of words
  2. Use MFA's g2p application:
    1. Download the corresponding g2p pretrained model
    2. Format you text dictionary by putting every word in lowercase, on the same line, separated by spaces
    3. Run command mfa g2p [g2p model] [dictionary] output.dict
    4. Put every word in output.dict in uppercase
  3. Create a consonant-vowel dict where you specify which phonemes are consonants and which phonemes are vowels. You should take example on the supported languages of this repository. Do not forget to add the line SPACE sp

Do noy hesitate to pull request your changes.