Skip to content

CoESML/gfcc-speech-kaldi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

gfcc-speech-kaldi

If you use this code or part of it, please cite us!

The success of any ASR system is dependent on the availability of its training data. However, output in low resource-speaking language is deteriorated due to the absence of sufficient signals processing features. In case of Indian tonal languages like Punjabi, one such difficulty is nearly zero resource conditions and language differences exist due to speaking and vocal tract length differences between children and audlt speech data.

The code attempts to create the Punjabi Children ASR structure with mismatched settings, with rigorous sound methods such as the Mel frequency cepstral factor (MFCC) and more noise robust methods employing gammatone frequency cepstral factor (GFCC).

  1. Puneet Bawa , Virender Kadyan, "Noise robust in-domain children speech enhancement for automatic Punjabi recognition system under mismatched conditions" doi: https://doi.org/10.1016/j.apacoust.2020.107810

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published