emSpeech is the speech module of the e-magyar language processing system.
The speech activity detection module wraps the SHOUT.
The speaker diarization module wraps the 'shout_segment' and 'shout_cluster' programs of SHOUT, with added audio conversion using the SoX (Sound eXchange) utility. For details on the usage run
python speaker_diarization/em-dia.py --help
-
Install SHOUT. Instructions can be found here.
-
Install sox. sox should be available on most Linux distribution (e.g.
apt-get install sox
). -
Check out this repository.
-
Set the
SHOUT_DIR
environmental variable to the directory where you isntalled SHOUT:export SHOUT_DIR=/path/to/shout
This will be the default place to look for the model files SHOUT needs.
Please cite Marijn Huijbregts's dissertation on SHOUT.