Skip to content

Full term project for the exam of parallel computing, University of Florence.

Notifications You must be signed in to change notification settings

CosimoGiani/SentimentAnalysis_LambdaArchitecture

Repository files navigation

Lambda Architecture for Sentiment Analysis

This project contains the implementation of a Lambda Architecture that allows to perform Twitter real-time sentiment analysis. To simulate the stream of tweets it was used the Twitter API, accessible through the Twitter4J library.
In this scenario the architecture makes use of Apache Hadoop for the Batch Layer, Apache Storm for the Speed Layer and Apache HBase for the Serving Layer. The system exploits the LingPipe tool kit for processing text using computational linguistics to classify tweets. Then, to show the results it was implemented a graphical interface.

More information and implementation details are available in the paper.

Software requirements

Datasets

The following datasets were used during the development of this project:

Usage

Get your personal Twitter Developer credentials and write them the TwitterCredentials.txt file.
After having configured correctly and started Hadoop, Storm and HBase, execute in the following order:

  • Classifier setting the datasets paths and the the file in which save the classifier
  • Topology setting as args the keywords for the query
  • Driver no parameters need in args
  • GUIinterface no parameters need in args

Graphical User Interface

About

Full term project for the exam of parallel computing, University of Florence.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published