Predicting the Semantic Orientation of Communication over Social Networking.

The World needs a particular channel through which any post, message, tweet or comment is scrutinized before being broadcasted.

Abstract

Developing a state of the art sentiment analysis system that detects the sentiment of short messages/posts/etc such as tweets, facebook activity like SMS (message-level task) or posting or commenting and many other. The sentiment of a word or a phrase within a message (term-level task). The system is based on a supervised statistical text classification approach leveraging a variety of surface form, semantic, and sentiment features. The sentiment features are primarily derived from large datasets sentiment lexicons. These lexicons are explitcitply collecting from cyberspace. Sentiment word hashtags or simple words on facebook instagram or many other social media with emoticons. To adequately capture the sentiment of words in negated contexts, a separate sentiment lexicon is generated for negated words.

Derived Classifier.

Naive Baye's (NB) classifier by Observing Baye's’ rule, P(c | d) = P(c)P(d | c)/P(d).

System Requirement

Hardware Requirements:
- Processor: Intel(R) Core(TM)2 Quad CPU Q8400 @ 2.66GHz
- RAM : 2GB (Operating System + sofware)
- HardDisk : 1GB (Data sets)
Software Requirements:
- Linux Operating System (Recommended)
- GCC (GNU Compiler Collection)
- GTK (GIMP Tool Kit)
- GLADE (A User Interface Designer)

Compilation & Execution:

$ sudo apt-get install libgtk-3-dev
$ gcc -o <output-file-name> <code-file-extension.c> -Wall `pkg-config --cflags --libs gtk+-3.0` -export-dynamic
$ ./<output-file-name>

BlackBok Functioning:

Enter some text (Post, message, comment,tweet or anything), the software predicts the the sentiment of the respective text.

Datasets Collection :

Mendeley Datasets: A collection of millions of data as per the sentiments for Hindi and English.
MT: A great collection of sentences with the respective words.
Stanford datasets : This is a dataset for binary sentiment classification containing substantially more data than previous benchmark datasets. We provide a set of 25,000 highly polar movie reviews for training, and 25,000 for testing. There is additional unlabeled data for use as well. Raw text and already processed bag of words formats are provided. See the README file contained in the release for more details.

Contribution:

Fork the Repository.
Clone the project to your machine.
Add respective and relevent data to it and upload as a new branch to it.
Add your name and Profile address to below named Developer and Contributors

Developers and Contributors:

Harshal Mittal
Shubhangi Srivastava
Mansi Sahu

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
DataSets		DataSets
Driver		Driver
Main code		Main code
Documentation.pdf		Documentation.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DataSets

DataSets

Driver

Driver

Main code

Main code

Documentation.pdf

Documentation.pdf

README.md

README.md

Repository files navigation

Predicting the Semantic Orientation of Communication over Social Networking.

Abstract

Derived Classifier.

System Requirement

Compilation & Execution:

BlackBok Functioning:

Datasets Collection :

Contribution:

Developers and Contributors:

About

Releases

Packages

Contributors 4

Languages

harshal306/PredictingSemanticOrientationOfCommunication

Folders and files

Latest commit

History

Repository files navigation

Predicting the Semantic Orientation of Communication over Social Networking.

Abstract

Derived Classifier.

System Requirement

Compilation & Execution:

BlackBok Functioning:

Datasets Collection :

Contribution:

Developers and Contributors:

About

Topics

Resources

Stars

Watchers

Forks

Languages