Skip to content

Latest commit

 

History

History
45 lines (33 loc) · 1.04 KB

SpecAugment.md

File metadata and controls

45 lines (33 loc) · 1.04 KB

SpecAugment

Questions

  • SpecAugment is suitable for traditional asr ?
  • What does time-warping do ?

Key Concepts

  • warping the features(time warp)
  • masking blocks of frequency
  • masking blocks of time steps
  • LAS(listen, attend and spell)
  • Similiar to unsupervised pre-training
  • Label smoothing

Methods

  • Vocal Tract Length Normalization(VTLN)
  • Noisy Data Synthesised
  • Speed Perturbation
  • Acoustic Room Simulator

Reference Repo

Reference Blog

TODO

  • Reference paper
  • Implement SpecAugment with numpy