sentiment-analysis

Sentiment analysis using IMDB data set. The code uses tensorflow to implement a sequence2vec model to generate a paragraph embedding, which is based on the paper: Distributed Representations of Sentences and Documents. Based on the paragraph embedding, the code uses random forest, gbdt and svc to do the sentiment classification. Among these three models, svc works best and gets an accuracy of 0.82940. This is a raw model and a lot of parameter adjustment can be done.

Below is the image of word vectors, using t-sne to reduce the dimensionality to 2

Below is the image of paragraph vectors, using t-sne to reduce the dimensionality to 2

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
img		img
README.md		README.md
seq2vec.py		seq2vec.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sentiment-analysis

About

Releases

Packages

Languages

saber1988/sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

sentiment-analysis

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages