ASR Papers

2021

MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition

Notes

Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition

Notes

2020

Enhancing Monotonic Multihead Attention for Streaming ASR

Notes

A Better and Faster End-to-End Model for Streaming ASR

A review of on-device fully neural end-to-end automatic speech recognition algorithms

2019

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Abstract

We present SpecAugment, a simple data augmentation method for speech recognition. SpecAugment is applied directly to the feature inputs of a neural network (i.e., filter bank coefficients). The augmentation policy consists of warping the features, masking blocks of frequency channels, and masking blocks of time steps. We apply SpecAugment on Listen, Attend and Spell networks for end-to-end speech recognition tasks. We achieve state-of-the-art performance on the LibriSpeech 960h and Swichboard 300h tasks, outperforming all prior work. On LibriSpeech, we achieve 6.8% WER on test-other without the use of a language model, and 5.8% WER with shallow fusion with a language model. This compares to the previous state-of-the-art hybrid system of 7.5% WER. For Switchboard, we achieve 7.2%/14.6% on the Switchboard/CallHome portion of the Hub5'00 test set without the use of a language model, and 6.8%/14.1% with shallow fusion, which compares to the previous state-of-the-art hybrid system at 8.3%/17.3% WER.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASR Papers

2021

MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition

Notes

Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition

Notes

2020

Enhancing Monotonic Multihead Attention for Streaming ASR

Notes

A Better and Faster End-to-End Model for Streaming ASR

A review of on-device fully neural end-to-end automatic speech recognition algorithms

2019

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Abstract

Notes

2018

STREAMING END-TO-END SPEECH RECOGNITION FOR MOBILE DEVICES

2017

2016 and before

2012

About

Releases

Packages

License

will-rice/asr-papers

Folders and files

Latest commit

History

Repository files navigation

ASR Papers

2021

Notes

Notes

2020

Notes

2019

Abstract

Notes

2018

2017

2016 and before

2012

About

Resources

License

Stars

Watchers

Forks