Switchboard

Results on Switchboard datasets.

Conformer+Transformer rescoring

Reported in "Advancing CTC-CRF Based End-to-End Speech Recognition with Wordpieces and Conformers"
AM: Conformer with 52M parameters. SpecAug and 3-way perturbation is applied.
"Trans." in the table denotes the interpolation between 4-gram and Transformer LM.
Data for phone-based system and wp-based system rescoring respectively is publicly available on Google Drive, including data/lang_{phn,bpe}, Nbest list.

Unit	LM	SW	CH	Eval2000	Notes
phone	4-gram	7.9	16.1	12.1	---
phone	Trans.	6.9	14.5	10.7	N-best rescoring, N=40, weight=0.8
wp	4-gram	8.7	16.5	12.7	---
wp	Trans.	7.2	14.8	11.1	N-best rescoring, N=60, weight=0.8

Experiment

For rescoring with "Trans.", please refer to local/pytorchnn/readme.

Unit	SW	CH
phone	9.9	19.4