Skip to content

eftekhar-hossain/Word-Embedding-on-Bangla-Text

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Word Embedding on a Bengali Text Corpus

  • Created a word embedding model for Bangla text corpus.
  • Used Word2Vec algorithm.
  • Used a publicly availabe dataset of 0.1 Milion Bangla news articles.
  • Visualized the word similarity using t-sne plot.

Word Similarity Output of the developed model-

similar

similar

Visualization Using T-SNE Plot

word2vec

References:

  1. Bangla News Dataset
  2. The AI University

Releases

No releases published

Packages

No packages published