This project contains the implementation of a Lambda Architecture that allows to perform Twitter real-time sentiment analysis. To simulate the stream
of tweets it was used the Twitter API, accessible through the Twitter4J library.
In this scenario the architecture makes use of Apache Hadoop for the Batch Layer, Apache Storm for the Speed Layer and Apache HBase for the
Serving Layer. The system exploits the LingPipe tool kit for processing text using computational linguistics to classify tweets. Then, to show the
results it was implemented a graphical interface.
More information and implementation details are available in the paper.
The following datasets were used during the development of this project:
Get your personal Twitter Developer credentials and write them the TwitterCredentials.txt file.
After having configured correctly and started Hadoop, Storm and HBase, execute in the following order:
Classifiersetting the datasets paths and the the file in which save the classifierTopologysetting as args the keywords for the queryDriverno parameters need in argsGUIinterfaceno parameters need in args

