Name		Name	Last commit message	Last commit date
parent directory ..
conf		conf
data_simu		data_simu
fbank/tr_spatialized_all		fbank/tr_spatialized_all
local		local
README.md		README.md
cmd.sh		cmd.sh
path.sh		path.sh
run.sh		run.sh
steps		steps
utils		utils

README.md

End-to-End Far-Field Speech Recognition with Uniﬁed Dereverberation and Beamforming

This recipe provides source code for data simulation and models related to the paper "End-to-End Far-Field Speech Recognition with Uniﬁed Dereverberation and Beamforming".

Prerequisite Installation

Kaldi and ESPnet (v.0.5.3 in this repository)
MATLAB (for data simulation)

Steps

This recipe uses spatialized wsj1-2mix as the dataset, and the data simulation scripts and instructions can be found in data_simu/.
After generating the spatialized wsj1-2mix data (16 kHz, max version), just run run.sh to start data preparation, training and evaluation. You can specify some arguments to control which stages to run, e.g.

./run.sh --stage 4 --stop-stage 5

Note: You may need to modify the default paths specified in run.sh to make it work.

Note

The implementation of frontend modules (both WPE+MVDR and WPD) can be found in https://github.com/Emrys365/espnet/blob/wsj1_mix_spatialized/espnet/nets/pytorch_backend/frontends/frontend.py and https://github.com/Emrys365/espnet/blob/wsj1_mix_spatialized/espnet/nets/pytorch_backend/frontends/frontend_wpd.py respectively.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asr1

asr1

conf

conf

data_simu

data_simu

fbank/tr_spatialized_all

fbank/tr_spatialized_all

local

local

README.md

README.md

cmd.sh

cmd.sh

path.sh

path.sh

run.sh

run.sh

steps

steps

utils

utils

README.md

End-to-End Far-Field Speech Recognition with Uniﬁed Dereverberation and Beamforming

Prerequisite Installation

Steps

Note

Files

asr1

Directory actions

More options

Directory actions

More options

Latest commit

History

asr1

Folders and files

parent directory

End-to-End Far-Field Speech Recognition with Uniﬁed Dereverberation and Beamforming

Prerequisite Installation

Steps

Note