sarcasm-detector-by-CNN

It predicts whether a given statement is sarcastic or not.
It uses neural network made of Convolutional1D layers, maxpooling, dropout and pretained GloVe embeddings in Embedding layer at first.

GloVe word embeddings can be downloaded at https://nlp.stanford.edu/projects/glove/
I used this one : http://nlp.stanford.edu/data/glove.6B.zip (6B tokens, 400K vocab, uncased, 50d, 100d, 200d, & 300d vectors, 822 MB download)
Little data preprocessing is done to make data cleaner.
Most of the code and some preprocessing is in first.ipynb. Little preprocessing is in second.ipynb too.

Possible future improvements

Data Preprocessing still can be done to get more accurate results. We can remove statements which are very difficult to predict. by doing that, we are giving a clean, more classified data to our neural network. And results will be better.

Structure of neural network also can be modified. Adding some more Conv1d layers won't cause any harm !! (But yes, it may become computationally expensive. As it will take longer time for training. And burn more GPU power too !!)

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.directory		.directory
README.md		README.md
Train_v1.tsv		Train_v1.tsv
data_handler.py		data_handler.py
emoji_unicode_names_final.txt		emoji_unicode_names_final.txt
first.ipynb		first.ipynb
first.py		first.py
neg.txt		neg.txt
neg1.txt		neg1.txt
norm_text_rm-stop.txt		norm_text_rm-stop.txt
norm_text_rm-stop11.txt		norm_text_rm-stop11.txt
norm_text_rm-stopdemo1.txt		norm_text_rm-stopdemo1.txt
pos.txt		pos.txt
pos1.txt		pos1.txt
preprocessing.ipynb		preprocessing.ipynb
simple_2xdense_acc64.h5		simple_2xdense_acc64.h5
simple_acc62.h5		simple_acc62.h5
simple_withdropout_acc64.h5		simple_withdropout_acc64.h5
word_list.txt		word_list.txt
word_list_freq.txt		word_list_freq.txt
word_split.txt		word_split.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sarcasm-detector-by-CNN

Possible future improvements

About

Releases

Packages

Languages

prashant-kikani/sarcasm-detector-by-CNN

Folders and files

Latest commit

History

Repository files navigation

sarcasm-detector-by-CNN

Possible future improvements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages