GitHub - Azer0s/kafka-spark-streaming-demo: A simple Spark Streaming demo with Kafka

kafka-spark-streaming-demo

This application streams tweets (in the aggregator written in Ruby) and produces Kafka messages (yes, I know...Spark Streaming is able to stream from Twitter directly...but I wanted to test Spark Streaming with Kafka). The Spark Streaming job looks for hashtags and users and creates relationships between them in Neo4j.

In Neo4j this looks like so:

aggregator environment variables

TWITTER_CONSUMER_KEY="<your Twitter developers consumer key>"
TWITTER_CONSUMER_SECRET="<your Twitter developers consumer secret>"
TWITTER_ACCESS_TOKEN="<your Twitter developers access token specific to your 'app'>"
TWITTER_ACCESS_TOKEN_SECRET="<your Twitter developers access token secret specific to your 'app'>"
KAFKA_BROKERS="<Kafka brokers, comma seperated>"
KAFKA_CLIENT_ID="<Kafka client id>"
KAFKA_TOPIC="twitter-news-topic"

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
aggregator		aggregator
assets		assets
docker		docker
spark-hashtags		spark-hashtags
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aggregator

aggregator

assets

assets

docker

docker

spark-hashtags

spark-hashtags

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

kafka-spark-streaming-demo

aggregator environment variables

About

Releases

Packages

Languages

License

Azer0s/kafka-spark-streaming-demo

Folders and files

Latest commit

History

Repository files navigation

kafka-spark-streaming-demo

aggregator environment variables

About

Resources

License

Stars

Watchers

Forks

Languages