Multi-Class Sentimental Analysis with Deep Learning

5-class Sentimental Analysis: LSTM Architecture Comparison

Usage

python3 predictEmotions.py

Purpose

To use different LSTM architectures having different number of layers and regularization tweaks to do multi-class sentimental analysis. Accuracy and loss are being analyzed to compare these architectures. The implementation is done in Keras.

Steps at a Glance:

Take Amazon Review dataset as input, as it contains 5 level rating along with review text.
Generate a vocabulary of all words
Make a word-frequency table having frequency corresponding to each word
Generate the index of each word based on sorted frequency (only top ‘n’ words are considered)
Encode the reviews as a set of indices of top ‘n’ frequent words. Remaining words are ignored.
Run the LSTM Model on Single Layer & Double-Layer LSTM, each layer having 100s of LSTMs stacked in parallel.
Tune for higher Accuracy by changing # of neurons in each layer to compare performance of different architectures.
Draw error plots, of both train & test loss, for each architecture to find whether the model is overfitting or not.
Apply regularization such as Dropout, L1, L2, L1L2 or a combination of these to reduce overfitting.
Conclusion based on the accuracy and plots obtained with test data.

Data Source

Amazon Fine Food Review Dataset: https://www.kaggle.com/snap/amazon-fine-food-reviews

The Amazon Fine Food Reviews dataset consists of reviews of fine foods from Amazon.
Number of reviews : 568,454
Number of users : 256,059
Number of products : 74,258
Timespan: Oct 1999 : Oct 2012
Number of Attributes/Columns in data: 10

Model 1: Single Layer LSTM Architecture: 100 (1) LSTM stack

Model 2: Multiple Layer LSTM Architecture: 200 (1) -150 (2) LSTM stack

Model 3: Multi-Layer Neuron-Dense LSTM Architecture: 512 (1) -256 (2) LSTM stack

Summary Statistics

Conclusions

Three architectures with single layer and double layer LSTMs are used to train frequency-encoded Amazon Review dataset.
Double-Layer LSTM Architecture obtained highest accuracy on validation dataset.
A single layer stack of 100 LSTMs (M1) fetched a commendable validation accuracy of 89.99%.
The validation accuracy of multi layer neuron dense LSTM stack (M3) fell to 89.97%, though it showed a hike in training accuracy, 93.7%. The increase in training accuracy and reduction in test accuracy points to overfitting on the train data.
The slight improvement in accuracy of Model 2 may not be worth the extra time spent on training such a stack-dense model. However, for higher accuracy Model 2 can be used.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
1. PreProcessing_LSTM.ipynb		1. PreProcessing_LSTM.ipynb
1. PreProcessing_LSTM.pdf		1. PreProcessing_LSTM.pdf
2. LSTM Architectures on Amazon Reviews.ipynb		2. LSTM Architectures on Amazon Reviews.ipynb
2. LSTM Architectures on Amazon Reviews.pdf		2. LSTM Architectures on Amazon Reviews.pdf
M1_Accuracy.png		M1_Accuracy.png
M1_Loss.png		M1_Loss.png
M2_Accuracy.png		M2_Accuracy.png
M2_loss.png		M2_loss.png
M3_Accuracy.png		M3_Accuracy.png
M3_loss.png		M3_loss.png
README.md		README.md
getEmotions_1LSTM.h5		getEmotions_1LSTM.h5
getEmotions_1LSTM.py		getEmotions_1LSTM.py
getEmotions_2LSTM.h5		getEmotions_2LSTM.h5
getEmotions_2LSTM.py		getEmotions_2LSTM.py
predictEmotions.py		predictEmotions.py
preprocess.py		preprocess.py
sentimental_bot.gif		sentimental_bot.gif
summary.png		summary.png
top_words_dict.txt		top_words_dict.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-Class Sentimental Analysis with Deep Learning

5-class Sentimental Analysis: LSTM Architecture Comparison

Usage

Purpose

Steps at a Glance:

Data Source

Model 1: Single Layer LSTM Architecture: 100 (1) LSTM stack

Model 2: Multiple Layer LSTM Architecture: 200 (1) -150 (2) LSTM stack

Model 3: Multi-Layer Neuron-Dense LSTM Architecture: 512 (1) -256 (2) LSTM stack

Summary Statistics

Conclusions

About

Uh oh!

Releases

Packages

Languages

AdroitAnandAI/Multi-Class-Sentimental-Analysis-Deep-Learning

Folders and files

Latest commit

History

Repository files navigation

Multi-Class Sentimental Analysis with Deep Learning

5-class Sentimental Analysis: LSTM Architecture Comparison

Usage

Purpose

Steps at a Glance:

Data Source

Model 1: Single Layer LSTM Architecture: 100 (1) LSTM stack

Model 2: Multiple Layer LSTM Architecture: 200 (1) -150 (2) LSTM stack

Model 3: Multi-Layer Neuron-Dense LSTM Architecture: 512 (1) -256 (2) LSTM stack

Summary Statistics

Conclusions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages