Big Data Technology Project

Twitter-Kafka-Spark-HBase Integration

Fetch a stream of tweets from twitter, push it to Kafka, ingest the stream into Spark Stream and save it to Hive

Getting Started

1. Kakfa Installation

Download and extract Kafka to your machine from https://downloads.apache.org/kafka/3.2.1/kafka_2.13-3.2.1.tgz
Extra it and locate it to your home directory, added the bin location to your system environment.

2. Runing Kafka

run *.sh on linux Or *.bat on windows to make sure that no existsing kafka
run ZooKeeper, after you run zookeeper you should get.

./zookeeper-server-start.sh ../config/zookeeper.properties

binding to port 0.0.0.0/0.0.0.0:2181
run Kafka Server, after you run kafka-server you should get.

./kafka-server-start.sh ../config/server.properties

Started socket server acceptors and processors

3. Create Kafka Topic

Creating kafka topic where we receive our stream.

./kafka-topics.sh --create --topic=twitter-topic-new --bootstrap-server localhost:9092 --replication-factor=1 --partitions=1

Created topic twitter-topic-new.

To list current topics (Testing)

./kafka-topics.sh --list --bootstrap-server localhost:9092

4. Test Kafka Consumer (Testing)

./kafka-console-consumer.sh --bootstrap-server localhost:9092 --topic twitter-topic-new --from-beginning

Tools

Kafka Kafka

References

Kafka Documentation Kafka Documentation

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
Kafka-Prod		Kafka-Prod
SparkConsumer		SparkConsumer
.gitignore		.gitignore
Commands.txt		Commands.txt
Hive Table Creation.txt		Hive Table Creation.txt
Integration Power – CS523.pptx		Integration Power – CS523.pptx
LICENSE		LICENSE
README.md		README.md
hive-apache-jdbc-0.13.1-9.jar		hive-apache-jdbc-0.13.1-9.jar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Big Data Technology Project

Twitter-Kafka-Spark-HBase Integration

Getting Started

1. Kakfa Installation

2. Runing Kafka

3. Create Kafka Topic

4. Test Kafka Consumer (Testing)

Tools

References

About

Releases

Packages

Languages

License

mohamedsaleh1984/twitter-spark

Folders and files

Latest commit

History

Repository files navigation

Big Data Technology Project

Twitter-Kafka-Spark-HBase Integration

Getting Started

1. Kakfa Installation

2. Runing Kafka

3. Create Kafka Topic

4. Test Kafka Consumer (Testing)

Tools

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages