Skip to content

Read events from topic, apply function on downloaded bundles

Notifications You must be signed in to change notification settings

kshakhmatova/hadoop-kafka

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

50 Commits
 
 
 
 
 
 
 
 

Repository files navigation

hadoop-kafka

Purpose:

  1. Read events from topic and return only of specified type (default - AvailableForProcessing)
  2. Download URI (using cURL), apply specified function, return what function returns and clean (by default) downloaded files

Prerequisites:

You may provide your own function in Main method instead of a default someFunc(f: File).
You may change what type to return in getKafkaEvents in Consumer, all available types you may find in EventTypes.

In order to run the program

  • run sbt assembly
  • run jar with parameters (topic name to read from, kafka IP address), example:
    java -jar hadoop-kafka.jar --topic TopicName --kafkaIpPort localhost:9092

About

Read events from topic, apply function on downloaded bundles

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages