Compressing Word Embeddings via Deep Compositional Code Learning (ICLR 2018)

PyTorch implementation and Keras for testing

I got comparable results for sentiment analysis in the best configuration. I did not test it for Machine Translation.

https://openreview.net/forum?id=BJRZzFlRb

Dependencies

Keras (for testing in the LSTM IMDB sentiment analysis classification)
tensorflow (for testing in the LSTM IMDB sentiment analysis classification)
PyTorch
tqdm
torchwordemb
numpy
Pre-trained GloVe vectors (Download glove.42B.300d.zip from https://nlp.stanford.edu/projects/glove/)
git
unzip

Execution

git clone <this_project>
cd compositional_code_learning
wget http://nlp.stanford.edu/data/glove.42B.300d.zip
# Install all dependencies
unzip glove.42B.300d.zip
# The follow line generates a dataset containing only words and vectors found in IMDB and in GloVe
python gen_intersect_imdb_embeddings.py
# Learn the compact representation (please consult help for more options)
python gumbel_softmax_ae.py --path_output_codes <path> --path_output_reconstruction <path> --version <version_name>
# Test vectors using a LSTM Model for IMDB Sentiment Analysis Classification
python lstm_sent.py

Any concerns or suggestions please contact me

Credits for the implementation: Max Raphael Sobroza Marques Thanks you Raphael Shu for answer some questions about the paper

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
README.md		README.md
compositional_image.png		compositional_image.png
dl4nlp.yml		dl4nlp.yml
gen_intersect_imdb_embeddings.py		gen_intersect_imdb_embeddings.py
gumbel_softmax_ae.py		gumbel_softmax_ae.py
lstm_sent.py		lstm_sent.py
modules.py		modules.py
script.sh		script.sh
tensorflow35.yml		tensorflow35.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Compressing Word Embeddings via Deep Compositional Code Learning (ICLR 2018)

Dependencies

Execution

About

Releases

Packages

Languages

msobroza/compositional_code_learning

Folders and files

Latest commit

History

Repository files navigation

Compressing Word Embeddings via Deep Compositional Code Learning (ICLR 2018)

Dependencies

Execution

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages