Replies: 2 comments
-
|
I am not sure this info helpful or not. G2P: |
Beta Was this translation helpful? Give feedback.
-
|
Your info. is fully correct dear. this approach will work to add multilingual support in any phoneme based tts like styletts2 & piper both. for regular expression you text should be like this in the case of if you want to add language switching styletts2 or piper or kokoro. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
We’d like to explore how to make piper models multilingual so they can handle both English and Farsi seamlessly. This discussion aims to collect ideas, best practices, and potential technical approaches for adding multilingual capabilities.
Some open points to consider:
Architecture design: Should we train a single unified model for multiple languages, or separate models with a shared front-end?
Data requirements: What datasets are available for Farsi? How can we balance the amount of English and Farsi data?
Tokenization strategies: Is it better to use a shared tokenizer (e.g., SentencePiece/BPE covering both scripts) or language-specific ones?
Inference pipeline: Should the system auto-detect the language, or require explicit language selection from the user?
Evaluation metrics: How do we ensure quality across both languages (e.g., WER, BLEU, MOS for speech/text outputs)?
Deployment considerations: Performance implications of supporting multiple languages in a single model.
We invite contributions, suggestions, and references to prior work or experiments. If anyone has experience building multilingual systems (especially with low-resource languages like Farsi), your input would be extremely valuable.
Beta Was this translation helpful? Give feedback.
All reactions