This language model (LM) be used together with the attention model. See here for an example.
A pretrained model can be downloaded here.
The vocab used for the LmDataset
has a custom format, different from the attention model
(which you need, if you want to train it yourself).
It should be straight forward to convert from one to the other.
Or to add support for the other format in LmDataset
.
The LM vocab file can be downloaded here.
The train files (data_files in config) are generated from the LibriSpeech LM training data.