Speech-Recognition-Paper-List
A List of classic or instructive paper in speech recognition.
Conventional Methods
End-to-end Models for speech recognition
CTC
Attention
RNN-Transducer
- Graves, Alex. “Sequence Transduction with Recurrent Neural Networks.” ArXiv Preprint ArXiv:1211.3711, 2012.
- Sak, Hasim, et al. “Recurrent Neural Aligner: An Encoder-Decoder Neural Network Model for Sequence to Sequence Mapping.” Interspeech 2017, 2017, pp. 1298–1302.
- Rohit, et al. "A comparison ofsequence-to-sequence models for speech recognition." Interspeech2017
- Rao K, Sak H, Prabhavalkar R. Exploring architectures, data andunits for streaming end-to-end speech recognition withRNN-transducer[C]//Automatic Speech Recognition and Understanding Workshop(ASRU), 2017 IEEE. IEEE, 2017: 193-199.