RNNLMPara

Why this tool?

Parallel RNN trainer implementes Two stage class RNNs and parallel RNNs proposed in the following paper

Z. H.  Huang, G. Zweig, M. Levit, B. Dumoulin, B. Oguz and S. Chang, Accelerating Recurrent Neural 
Network Training via Two Stage Classes and Parallelization, in Automatic Speech Recognition and 
Understanding (ASRU), 2013.

Two stage class RNNs uses two stage classes (super classes and classes) as opposed to one class. Parallel RNN trainer splits the training data into batches and then dispatchs jobs to multiple CPUs/nodes for slave models training. Two stage class RNNs and parallel RNNs not only result in equal or lower WERs compared to original RNNs but also accelerate training by 2 and 10 times respectively. Code is developed based on RNNLM 0.3e (Tomas Mikolov). The following changes are made

Separate Vocab part to a class
Add two stage class (super class and class) to speed up training. Two options to generate super classes: even or frequency based.
new Maxent feature hash function
Explicit RNN constructors from random initialization or from model file
Submit HPC jobs to train slave RNN models
Master model update after done with HPC slave RNN model training

Usage

Build

To build, run build.sh to generate the binary at Release/RNNLMPara

Experiments

See RNNOrigExp/runPennTreebank.sh and RNNParaExp/readme.txt for experiments.

Contact

Please send your questions/comments to zhiheng.huang@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
RNNOrigExp		RNNOrigExp
RNNParaExp		RNNParaExp
CommandRunner.cpp		CommandRunner.cpp
CommandRunner.h		CommandRunner.h
FeatureIndexer.cpp		FeatureIndexer.cpp
FeatureIndexer.h		FeatureIndexer.h
LICENSE		LICENSE
Parameters.cpp		Parameters.cpp
Parameters.h		Parameters.h
README.md		README.md
RNN.cpp		RNN.cpp
RNN.h		RNN.h
RNNExp.cpp		RNNExp.cpp
RNNExp.h		RNNExp.h
RNNMaster.cpp		RNNMaster.cpp
RNNMaster.h		RNNMaster.h
Temp.cpp		Temp.cpp
Temp.h		Temp.h
Util.h		Util.h
Utils.cpp		Utils.cpp
Utils.h		Utils.h
Vocab.cpp		Vocab.cpp
Vocab.h		Vocab.h
build.sh		build.sh

License

zhiheng-huang/RNNLMPara

Folders and files

Latest commit

History

Repository files navigation

RNNLMPara

Why this tool?

Usage

Build

Experiments

Contact

About

Resources

License

Stars

Watchers

Forks

Languages