Name		Name	Last commit message	Last commit date
parent directory ..
data/penn		data/penn
README.md		README.md
data.py		data.py
generate.py		generate.py
main.py		main.py
model.py		model.py
requirements.txt		requirements.txt

README.md

Word-level language modeling RNN

This example trains a multi-layer RNN (Elman, GRU, LSTM or subLSTM) on a language modeling task. By default, the training script uses the PTB dataset, provided. The trained model can then be used by the generate script to generate new text.

# Train a subLSTM on PTB with CUDA, reaching perplexity of 136.90 (15 epochs)
python main.py --cuda --emsize 650 --nhid 650 --dropout 0.5 --epochs 100 --lr 0.001 --optim adam
# Generate samples from the trained subLSTM model.
python generate.py

The model uses the nn.RNN module (and its sister modules nn.GRU, nn.LSTM and sublstm.SubLSTM). which will automatically use the cuDNN backend if run on CUDA with cuDNN installed.

During training, if a keyboard interrupt (Ctrl-C) is received, training is stopped and the current model is evaluated against the test dataset.

The main.py script accepts the following arguments:

optional arguments:
  -h, --help         show this help message and exit
  --data DATA        location of the data corpus
  --model MODEL      type of recurrent net (RNN_TANH, RNN_RELU, LSTM, GRU,
                     subLSTM)
  --emsize EMSIZE    size of word embeddings
  --nhid NHID        number of hidden units per layer
  --nlayers NLAYERS  number of layers
  --lr LR            initial learning rate
  --clip CLIP        gradient clipping
  --optim OPTIM      learning rule, supports
                     adam|sparseadam|adamax|rmsprop|sgd|adagrad|adadelta
  --epochs EPOCHS    upper epoch limit
  --batch_size N     batch size
  --bptt BPTT        sequence length
  --dropout DROPOUT  dropout applied to layers (0 = no dropout)
  --tied             tie the word embedding and softmax weights
  --seed SEED        random seed
  --cuda             use CUDA
  --log-interval N   report interval
  --save SAVE        path to save the final model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

word_language_model

word_language_model

data/penn

data/penn

README.md

README.md

data.py

data.py

generate.py

generate.py

main.py

main.py

model.py

model.py

requirements.txt

requirements.txt

README.md

Word-level language modeling RNN

Files

word_language_model

Directory actions

More options

Directory actions

More options

Latest commit

History

word_language_model

Folders and files

parent directory

Word-level language modeling RNN