Sentiment-Analysis-CNN

Text classification using Convolutional Neural Networks

Dataset Description

The dataset contains reviews from the following 3 websites, Amazon, Imdb, Yelp. There are 1000 reviews for each website, 500 of which are positive. In total we have 1500 positive reviews and 1500 negative reviews.

Update: Added new dataset consisting of movie reviews with 5531 positive training examples and 5531 negative training examples.

Model Description

The model takes inspiration from the paper, "Sentence Classification using Convolutional Neural Networks" by Yoon Kim. Paper

Kim CNN:

CONVOLUTIONAL LAYER

Multiple filters of varying window size convolved over each training example.
Each filter generates a feature map.
Filters with different window sizes capture context and relation between words.

MAX-POOLING LAYER

Max-pooling operation performed on each feature map to get one feature per filter.
The idea is to capture the most important feature necessary for classification.
Naturally deals with variable length sentences

FULLY CONNECTED LAYER

SOFTMAX LAYER

Probability distribution over labels obtained.

Hyperparameter Tuning

Tunable hyperparameters:

Word vector size (embedding size)
Sequence length (after padding or truncation)
Filter sizes
Number of filters of each type
Learning rate
Reguralization constant
Num_epochs
minibatch_size

Learning Curves

1)Batch size too small. 2)Near optimal hyperparameters

Results

Instructions for use:

Data_prepartion.ipynb used to convert tab separated data into csv format [sentence,category].
Final_Code_CNN.ipynb uses csv input as mentioned above and trains the CNN model.
CNN_code_raw folder contains older versions of the Final_Code_CNN with various intermediate blocks to print output for better visualization, understanding and debugging. (Final_Code_CNN contains only necessary blocks to train the model.)
If trying to reproduce results on the same dataset, no need to run Data_preparation.ipynb, data in proper format already present in Processed_Data folder.

More work to be done

Domain adaptation: I plan to train the model using data from a specific domain like movie reviews and then test it on reviews from other domains like Electronics, Food, Travel etc.
Work on CNN + LSTM. Both models will be trained independently and the output will be concatenated.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.ipynb_checkpoints		.ipynb_checkpoints
CNN_code_raw		CNN_code_raw
Datasets		Datasets
Learning_curves		Learning_curves
Processed_Data		Processed_Data
Results		Results
__MACOSX		__MACOSX
Data_preparation.ipynb		Data_preparation.ipynb
Final_Code_CNN.ipynb		Final_Code_CNN.ipynb
LICENSE		LICENSE
README.md		README.md

License

Shubhammawa/Sentiment-analysis-cnn

Folders and files

Latest commit

History

Repository files navigation

Sentiment-Analysis-CNN

Dataset Description

Model Description

CONVOLUTIONAL LAYER

MAX-POOLING LAYER

FULLY CONNECTED LAYER

SOFTMAX LAYER

Hyperparameter Tuning

Learning Curves

Results

Instructions for use:

More work to be done

About

Topics

Resources

License

Stars

Watchers

Forks

Languages