Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create new language #105

Open
dhm42 opened this issue Mar 2, 2017 · 8 comments
Open

Create new language #105

dhm42 opened this issue Mar 2, 2017 · 8 comments

Comments

@dhm42
Copy link

dhm42 commented Mar 2, 2017

Hi, I want to use Merlin with French langage. Can you guide me throught the process of creating a new langage (and voice).This will be helpfull for people who want to participate in Merlin developpement.
I have a corpus of wav files with their text transcription. What does Merlin need as data (apart from alignment files and audio files).
What are the tools needed to do the linguistic and lexical analysis of the text files and what do they generate. Is there any lexical or linguistic features that needs to be developed for French Langage?
Any help will be appreciated, thanks in advance for your help.

@dreamk73
Copy link

dreamk73 commented Mar 2, 2017

You need a linguistic frontend to process the text to get relevant linguistic features for Merlin. You can start with a very simple set of features defined in the question file and see how it goes from there. Typically if you start from scratch, I would see if there is any French voice available for Festival and use that. At the very least you need information about each phoneme, accents, phrase boundaries, and counters for how many phonemes there are in the syllable, how many syllables in the phrase, etc (counting both forward and backward).

If you can't find it, I would write a script from scratch using the input transcriptions and having a small number of function words which never receive an accent and use commas in the sentences to denote phrase boundaries.

@shartoo
Copy link
Contributor

shartoo commented May 10, 2017

@dreamk73 i want to use Merlin with Chinese language ,which is same as French when constructing from scratch.There is Chinese voice dataset like 'THCHS30'.Please share an example or tutorial .

@Jackiexiao
Copy link
Contributor

@shartoo I want to use Merlin with Chinese language too. Do you have any idea now?

@shartoo
Copy link
Contributor

shartoo commented Jun 29, 2017

@Jackiexiao not yet.I'm trying,but not focusing on this topic.I have to do image processing work. You can keep my contact QQ:604135528 or gmail: shartoo518@gmail.com

@Jackiexiao
Copy link
Contributor

@dhm42 I strongly recommend the tutorial from Columbia University. It's the best tutorial for speech synthesis I have ever seen ( for a new beginner) !

@chazo1994
Copy link

@Jackiexiao I cannot access this link Merlin Instructions and Troubleshooting in your tutorial.

@Jackiexiao
Copy link
Contributor

@chazo1994 sorry, only columbia student can access it

@ecooper7
Copy link

ecooper7 commented Aug 28, 2017 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants