Data Pipeline - Twitter Sentiment Analysis

The goal of this project is to extract tweets about various electronic products using the Tweepy Python library. Once the tweets are collected, we apply a pre-trained NLP model called ReBERTa to analyze the sentiment of each tweet. The sentiment analysis results are categorized as negative, neutral, or positive.

The tweets and their corresponding sentiment analysis results are then sent to a Kafka topic. The consumer of the Kafka topic in our case is an ETL tool called Logstash. The role of Logstash is to transform the messages received from the Kafka topic into a JSON format and load the documents into Elasticsearch.

Finally, the data stored in Elasticsearch is visualized on a Kibana dashboard.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
dd.png		dd.png
logstash.conf		logstash.conf
streaming.py		streaming.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Pipeline - Twitter Sentiment Analysis

About

Releases

Packages

Languages

oussafik/Data-Pipeline-Twitter-Sentiment-Analysis-ELK_STACK

Folders and files

Latest commit

History

Repository files navigation

Data Pipeline - Twitter Sentiment Analysis

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages