DeFINE-AWD-LSTM

Implementation of the paper DEFINE: DEEP FACTORIZED INPUT TOKEN EMBEDDINGS FOR NEURAL SEQUENCE MODELING with AWD-LSTM from the paper Regularizing and Optimizing LSTM Language Models in PyTorch

This implemmentation only supports Language Modeling on Penn Treebank. Fine tuning for AWD-LSTM has not been implemented.

Training

To train the model run

python train.py

Optional arguments

--lr               Learning rate
--epochs           Number of epochs over the whole dataset
--k                Size of the last layer of the DeFINE embeddings
--embed_dim        Embedding size for adaptive shared inputs
--h_size           hidden size for LSTM
--layers           Number of splits for vocab
--n_layers         Number of LSTM layers
--path_train       path to training file
--path_dev         path to validation file
--path_test        path to testing file
--dropouti         Variational dropout for input to the first LSTM layer
--dropouth         Variational dropout for input to the other LSTM layer
--dropout          Variational dropout to the output of the final LSTM layer
--dropout_embed    Dropout for embeddings
--N                Number of layers in DeFINE
--t1               First partition frequency for vocabulary
--t2               Second partition frequency for vocabulary
--n0               Minimum validation runs for non-monotonic ASGD
--log_interval     Logging interval while training
--batch_size       Batch size for training
--bptt             Mean for sequence length for training
--alpha            Scaling factor for Activation Regularization
--beta             Scaling factor for Temporal Activation Regularization
--dropoutw         DropConnect for hidden to hidden LSTM weights
--dev_batch_size   Match size for development
--m                Final output embedding size for DeFINE embeddings
--clip             Gradient nomr for clipping

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
__pycache__		__pycache__
.gitignore		.gitignore
AWD_LSTM.py		AWD_LSTM.py
LICENSE		LICENSE
README.md		README.md
embeddings.py		embeddings.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pycache

pycache

.gitignore

.gitignore

AWD_LSTM.py

AWD_LSTM.py

LICENSE

LICENSE

README.md

README.md

embeddings.py

embeddings.py

train.py

train.py

utils.py

utils.py

Repository files navigation

DeFINE-AWD-LSTM

Training

About

Releases

Packages

Languages

License

pranavajitnair/DeFINE-AWD-LSTM

Folders and files

Latest commit

History

Repository files navigation

DeFINE-AWD-LSTM

Training

About

Topics

Resources

License

Stars

Watchers

Forks

Languages