Skip to content

Deep learning architectures to embed multi-word units into a vector space maximizing similarity between units of different sizes.

Notifications You must be signed in to change notification settings

vijaydwivedi75/Beyond_word2vec

Repository files navigation

IIITH NLP Lab Summer Research Project

Beyond Word2Vec

  • Vijay Prakash Dwivedi
  • Dr Manish Shrivastava

Description

Building deep learning architectures to embed multi-word units into a vector space maximizing similarity between units of different sizes.

Work

Implemented a Siamese MLP architecture with following best results. Siamese MLP Architecture

* Training on 263000 samples, Testing on 113000 samples
* Accuracy on training set: 91.06%
* Accuracy on test set: 74.93%

Also, implemented a Siamese LSTM architecture with following best results. Siamese LSTM Architecture

* Training on 263000 samples, Testing on 113000 samples
* Accuracy on training set: 93.20%
* Accuracy on test set: 76.65%

About

Deep learning architectures to embed multi-word units into a vector space maximizing similarity between units of different sizes.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages