Skip to content

dtreb/spark-folder-monitor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark streaming usage to monitor local folder

Info

This is an example that demonstrates usage of:

  1. Spark streaming and embedded Spark instance.
  2. Different spark data analysis approaches.
  3. Commons CLI to parse parameters.

Run

  1. Build project with $ mvn package.
  2. Locate created spark-folder-monitor-xxx-jar-with-dependencies.jar in target folder.
  3. Run $ java -jar spark-folder-monitor-xxx-jar-with-dependencies.jar --help to get information about available parameters.
  4. Run $ java -jar spark-folder-monitor-xxx-jar-with-dependencies.jar to use default parameters (check monitor folder, display 10 items, use 10 seconds interval).
  5. Move some text files to monitored folder.
  6. Check console output. You should see analysis results - top used words, longest line etc.

Feel free to use, comment or collaborate.

Releases

No releases published

Packages

No packages published

Languages