- SpecAugment is suitable for traditional asr ?
- What does time-warping do ?
- warping the features(time warp)
- masking blocks of frequency
- masking blocks of time steps
- LAS(listen, attend and spell)
- Similiar to unsupervised pre-training
- Label smoothing
- Vocal Tract Length Normalization(VTLN)
- Noisy Data Synthesised
- Speed Perturbation
- Acoustic Room Simulator
- Reference paper
- Implement SpecAugment with numpy