/
INSTALL
58 lines (39 loc) · 2.71 KB
/
INSTALL
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
* Installation of HTS-demo_CMU-ARCTIC-SLT_STRAIGHT
==================================================
1. HTS-demo_CMU-ARCTIC-SLT_STRAIGHT requires Festival, SPTK-3.2, HTS-2.1, hts_engine_API-1.0, STRAIGHT, and MATLAB.
Please install them before running this demo.
You can download part of them from the following websites:
Festival: http://www.festvox.org/festival/
SPTK: http://sourceforge.net/projects/sp-tk/
HTS & hts_engine API: http://hts.sp.nitech.ac.jp/
STRAIGHT: http://www.wakayama-u.ac.jp/~kawahara/STRAIGHTtrial/
Note that this package doesn't contain raw audio and utterance files.
Please copy them from the Non-STRAIGHT version of this demonstration.
2. Setup HTS-demo_CMU-ARCTIC-SLT_STRAIGHT by running configure script:
% cd HTS-demo_CMU-ARCTIC-SLT_STRAIGHT
% ./configure --with-matlab-search-path=/usr/local/matlab/bin \
--with-straight-path=/home/zen/work/STRAIGHTV40 \
--with-fest-search-path=/usr/local/festival/examples \
--with-sptk-search-path=/usr/local/SPTK-3.2/bin \
--with-hts-search-path=/usr/local/HTS-2.1_for_HTK-3.4/bin \
--with-hts-engine-search-path=/usr/local/hts_engine_API-1.0/src/bin
Please adjust the above directories for your environment.
Note that you should specify festival/examples rather than festival/bin.
You can change various parameters such as speech analysis conditions and model training conditions
through ./configure arguments. For example
% ./configure MGCORDER=39 MGCLSP=0 GAMMA=0 FREQWARP=0.0 (39-th order cepstrum)
% ./configure MGCORDER=39 MGCLSP=0 GAMMA=0 FREQWARP=0.42 (39-th order Mel-cepstrum)
% ./configure MGCORDER=39 MGCLSP=0 GAMMA=3 FREQWARP=0.0 (39-th order generalized cepstrum)
% ./configure MGCORDER=39 MGCLSP=0 GAMMA=3 FREQWARP=0.42 (39-th order Mel-generalized cepstrum)
% ./configure MGCORDER=39 MGCLSP=1 GAMMA=1 FREQWARP=0.0 LNGAIN=0 (39-th order LSP, linear gain)
% ./configure MGCORDER=39 MGCLSP=1 GAMMA=1 FREQWARP=0.0 LNGAIN=1 (39-th order LSP, log gain)
% ./configure MGCORDER=39 MGCLSP=1 GAMMA=1 FREQWARP=0.42 LNGAIN=1 (39-th order Mel-LSP, log gain)
% ./configure MGCORDER=39 MGCLSP=1 GAMMA=3 FREQWARP=0.42 LNGAIN=1 (39-th order MGC-LSP, log gain)
% ./configure NSTATE=7 NITER=10 WFLOOR=5 (# of HMM states=7, # of EM iterations=10, mix weight floor=5)
Please refer to the help message for details:
% ./configure --help
3. Start running demonstration as follows:
% cd HTS-demo_CMU-ARCTIC-SLT_STRAIGHT
% make
After composing training data, HMMs are estimated and speech waveforms are synthesized.
It takes about 12 to 24 hours :-)