Skip to content

Baselines

Vanessa Tschichold edited this page May 26, 2020 · 4 revisions

Business Perspective:

What other readily available option would a company or private customer have to do twitter sentiment analysis?

Sentiment 140

Based on (apparently) first paper on twitter sentiment analysis by some standford people. One needs to register and can then send a .txt file to be classified from the command line.

Google API

Should be one of the better available API's for general sentiment analysis. Hard to gauge how much time it would take to score our test dataset with it. Probably boils down to creating a google cloud account and writing a short script.

MonkeyLearn

I think it's the newer Sentiment 140. Looks pretty nice.

Academic Perspective:

What other approaches have been proposed in the literature?

Coding Exercise in Series 6

Compute glove embeddings for each token, average tokens of one tweet. Classify the obtained feature vector using logistic regression, SVM. Cloud probably be done in flair to reduce work and possibility of bugs.

Simple Bert Classifier

As a "harder" baseline we could use the simple pretrained bert classifier. Would distinguish more clearly between our implemented improvements and the existing state of the art.

Sources: