Sentiment Analysis using word embedding and Convolutional Neural Networks on Keras

Imdb has released a database of 50,000 movie reviews classified in two categories: Negative and Positive. This is a typical sequence binary classification problem.

The Movie Review Data is a collection of movie reviews retrieved from the imdb.com website in the early 2000s by Bo Pang and Lillian Lee. The reviews were collected and made available as part of their research on natural language processing.

The reviews were originally released in 2002, but an updated and cleaned up version were released in 2004, referred to as “v2.0”.

The dataset is comprised of 1,000 positive and 1,000 negative movie reviews drawn from an archive of the rec.arts.movies.reviews newsgroup hosted at imdb.com. The authors refer to this dataset as the “polarity dataset.”

The dataset will be downloaded automatically if not present on your disk.

Training should take ~20min. Accuracy should be ~88%.

Data Preparation

There are 3 things done in this step.

Separation of data into training and test sets.
Loading and cleaning the data to remove punctuation and numbers.
Defining a vocabulary of preferred words.

Training the model

The model summary used for the sentiment analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LICENSE		LICENSE
README.md		README.md
sentiment_analysis.ipynb		sentiment_analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

sentiment_analysis.ipynb

sentiment_analysis.ipynb

Repository files navigation

Sentiment Analysis using word embedding and Convolutional Neural Networks on Keras

Data Preparation

Training the model

About

Releases

Packages

Languages

License

Suraj-Bhor/Sentiment-Analysis

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis using word embedding and Convolutional Neural Networks on Keras

Data Preparation

Training the model

About

Resources

License

Stars

Watchers

Forks

Languages