Skip to content

This project analyzes tweets about electronic products using ReBERTa, Kafka, Logstash, Elasticsearch, and Kibana for sentiment analysis and data visualization.

Notifications You must be signed in to change notification settings

oussafik/Data-Pipeline-Twitter-Sentiment-Analysis-ELK_STACK

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data Pipeline - Twitter Sentiment Analysis

Data pilpeline architecture

The goal of this project is to extract tweets about various electronic products using the Tweepy Python library. Once the tweets are collected, we apply a pre-trained NLP model called ReBERTa to analyze the sentiment of each tweet. The sentiment analysis results are categorized as negative, neutral, or positive.

The tweets and their corresponding sentiment analysis results are then sent to a Kafka topic. The consumer of the Kafka topic in our case is an ETL tool called Logstash. The role of Logstash is to transform the messages received from the Kafka topic into a JSON format and load the documents into Elasticsearch.

Finally, the data stored in Elasticsearch is visualized on a Kibana dashboard.

About

This project analyzes tweets about electronic products using ReBERTa, Kafka, Logstash, Elasticsearch, and Kibana for sentiment analysis and data visualization.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages