Paper: Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approach
python3.5+
tensorflow>=1.6
numpy
pandas
scikit-learn
gensim
convert
.utf-8
raw files to prosody tagged files
trans prosody tagged files to dataset
into models
use bilstm_cbow to do prosody prediction
use alignment to do prosody prediction