GitHub - vijaydwivedi75/Beyond_word2vec: Deep learning architectures to embed multi-word units into a vector space maximizing similarity between units of different sizes.

IIITH NLP Lab Summer Research Project

Building deep learning architectures to embed multi-word units into a vector space maximizing similarity between units of different sizes.

Implemented a Siamese MLP architecture with following best results.

* Training on 263000 samples, Testing on 113000 samples
* Accuracy on training set: 91.06%
* Accuracy on test set: 74.93%

Also, implemented a Siamese LSTM architecture with following best results.

* Training on 263000 samples, Testing on 113000 samples
* Accuracy on training set: 93.20%
* Accuracy on test set: 76.65%

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
README.md		README.md
data_pair_neg		data_pair_neg
data_pair_pos		data_pair_pos
evals.py		evals.py
preprocess_data.py		preprocess_data.py
preprocess_data_lstm.py		preprocess_data_lstm.py
results.txt		results.txt
siamese_lstm.py		siamese_lstm.py
siamese_mlp.py		siamese_mlp.py
word2VecTF.py		word2VecTF.py