Permalink
Browse files

Merge pull request #18 from hlthu/master

 Add librispeech results of TDNNF
  • Loading branch information...
syhw committed Aug 7, 2018
2 parents 8f1496a + b7315b3 commit 2b66dc0b0ee7945b7a1c8d74538328e7fcc10fc4
Showing with 2 additions and 0 deletions.
  1. +2 −0 README.md
View
@@ -12,6 +12,7 @@ WER are we? An attempt at tracking states of the art(s) and recent results on sp
| :------------- | :------------- | :------------- | :-------- | :-----: |
| 5.83% | 12.69% | [Deep Speech 2: End-to-End Speech Recognition in English and Mandarin](http://arxiv.org/abs/1512.02595v1) | December 2015 | *Humans* |
| 3.19% | 7.64% | [The CAPIO 2017 Conversational Speech Recognition System](https://arxiv.org/abs/1801.00059) | April 2018 | TDNN + TDNN-LSTM + CNN-bLSTM + Dense TDNN-LSTM across two kinds of trees
| 3.80% | 8.76% | [Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks](http://www.danielpovey.com/files/2018_interspeech_tdnnf.pdf) | Interspeech, Sept 2018 |[Kaldi recipe](https://github.com/kaldi-asr/kaldi/blob/master/egs/librispeech/s5/local/chain/tuning/run_tdnn_1d.sh), 17-layer TDNN-F + iVectors|
| 3.82% | 12.76% | [Improved training of end-to-end attention models for speech recognition](https://www-i6.informatik.rwth-aachen.de/publications/download/1068/Zeyer--2018.pdf) | Interspeech, Sept 2018 | encoder-attention-decoder end-to-end model |
| 4.28% | | [Purely sequence-trained neural networks for ASR based on lattice-free MMI](http://www.danielpovey.com/files/2016_interspeech_mmi.pdf) | September 2016 | HMM-TDNN trained with MMI + data augmentation (speed) + iVectors + 3 regularizations |
| 4.83% | | [A time delay neural network architecture for efficient modeling of long temporal contexts](http://speak.clsp.jhu.edu/uploads/publications/papers/1048_pdf.pdf) | 2015 | HMM-TDNN + iVectors |
@@ -123,6 +124,7 @@ TODO?
* DNN: deep neural network
* CNN: convolutional neural network
* DBN: deep belief network (RBM-based DNN)
* TDNN-F: a factored form of time delay neural networks (TDNN)
* RNN: recurrent neural network
* LSTM: long short-term memory
* CTC: connectionist temporal classification

0 comments on commit 2b66dc0

Please sign in to comment.