Skip to content

ychfan/deep-tts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

deep-tts

Deep Learning based Text-to-Speech Synthesis (under construction)

##To Do List

  1. Feature extraction
    1. Linguistic: ourselves (or Festival as backup)
    2. Acoustic: WORLD
  2. State alignment:
    1. Kaldi (Refer to Merlin: The frame alignment and state information was obtained from forced alignment using a monophone HMM-based system with 5 emitting states per phone. Based on my previous experience, obejective model measurements highly depend on a accurate alignment.)
    2. HTS as backup
  3. Neural network: MXNet (or CNTK as backup)

About

Deep Learning based Text-to-Speech Synthesis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published