Stratosphere PACT Program to run statistics over Tweets
It parses tweets records, cleaning the texts, computing sentiment analysis and collecting statistics about words popularity evolution over time.
It uses also a customized version of SentiStrength, which needs to be downloaded separately from the official website. SentiStrength Data dir is also needed, and the flow should be configured accordingly.
Configuration is present in the maven file.