GitHub - lukemelas/Language-Modeling: Language modeling on the Penn Treebank dataset

LSTM for Language Modeling

This repository contains an implementation of a LSTM model for language modeling on the Penn Treebank database. More details on LSTM network architectures for state of the art language models may be found in On the State of the Art of Evaluation in Neural Language Models.

To train our model, clone the repo and run main.py:

usage: main.py [-h] [--model DIR] [--lr N] [--hs N] [--nlayers N] [--no-wt]
               [--maxnorm N] [--dropout N] [-v N] [--data DATA] [-b N]                                                                       [--bptt N] [--epochs N] [--bigram] [-e] [-p] [--sample SAMPLE]
                                                                                                                              Language Model                                                                                                                
optional arguments:
  -h, --help       show this help message and exit
  --model DIR      path to model
  --lr N           learning rate
  --hs N           size of hidden state                                                                                         --nlayers N      number of layers in rnn                                                                                      --no-wt          disable weight tying in network
  --maxnorm N      maximum gradient norm for clipping
  --dropout N      dropout probability
  -v N             vocab size
  --data DATA      path to data
  -b N             batch size
  --bptt N         backprop though time length (sequence length)
  --epochs N       number of epochs
  --ngram          use ngram language model
  -e, --evaluate   run model only on validation set
  -p, --predict    save predictions on final input data
  --sample SAMPLE  number of sentences to sample

For example, we found the following hyperparameters worked well:

python main.py -b 128 --bptt 64 --epochs 20 --nlayers 2

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
models		models
.gitignore		.gitignore
main.py		main.py
readme.md		readme.md
train.py		train.py
utils.py		utils.py
valid.py		valid.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

models

models

.gitignore

.gitignore

main.py

main.py

readme.md

readme.md

train.py

train.py

utils.py

utils.py

valid.py

valid.py

Repository files navigation

LSTM for Language Modeling

About

Releases

Packages

Languages

lukemelas/Language-Modeling

Folders and files

Latest commit

History

Repository files navigation

LSTM for Language Modeling

About

Resources

Stars

Watchers

Forks

Languages