KALDI：apply-cmvn-sliding #535

wanglong001 · 2020-04-12T14:09:51Z

🚀 Feature

Apply sliding-window cepstral mean (and optionally variance)
normalization per utterance.

Motivation

Acoustic features are extracted based on Kaldi. I want to use torchaudio instead, but there is no cmvn， I wrote a torch version of cmvn according to Kaldi

stonelazy · 2021-08-31T11:03:26Z

Dear @wanglong001 I fail to understand where exactly we would be making use of torchaudio.transforms.SlidingWindowCmn is it to normalize the output of STFT/MFCC at a window level ?
Would it be possible for you to explain on this ? Am not familiar with Kaldi.

wanglong001 · 2021-09-03T06:32:43Z

Dear @wanglong001 I fail to understand where exactly we would be making use of torchaudio.transforms.SlidingWindowCmn is it to normalize the output of STFT/MFCC at a window level ?
Would it be possible for you to explain on this ? Am not familiar with Kaldi.

Yes, normalize the output of cepstral (STFT/MFCC...) at a window level, Mainly to reduce the impact of environmental noise.

https://kaldi-asr.org/doc/apply-cmvn-sliding_8cc.html

wanglong001 mentioned this issue Apr 14, 2020

add cmvn #540

Merged

vincentqb changed the title ~~KALID：apply-cmvn-sliding~~ KALDI：apply-cmvn-sliding Apr 15, 2020

vincentqb closed this as completed in #540 Apr 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KALDI：apply-cmvn-sliding #535

KALDI：apply-cmvn-sliding #535

wanglong001 commented Apr 12, 2020

stonelazy commented Aug 31, 2021

wanglong001 commented Sep 3, 2021

KALDI：apply-cmvn-sliding #535

KALDI：apply-cmvn-sliding #535

Comments

wanglong001 commented Apr 12, 2020

🚀 Feature

Motivation

stonelazy commented Aug 31, 2021

wanglong001 commented Sep 3, 2021