Skip to content

jakeeqsb/EntreeETLPipeline_spark_streaming

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

EntreeETLPipeline_spark_streaming

screen shot 2018-04-03 at 2 07 46 am

To run the program

1. Run the zookeeper

zkServer start

2. Run the kafka

kafka-server-start <... server.properties>

3. Run the producer script

python3 EntreeKafkaProd.py

4. Run the pipeline script

spark-submit --jars <jars...> EntreePipeline.py