Skip to content

Spark hands-on exercise for the lecture Distributed Data Analytics

Notifications You must be signed in to change notification settings

fawind/spark-examples

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark Examples

Spark hands-on exercise for the lecture Distributed Data Analytics.

Task

Usage

  1. Build a fatjar using sbt assembly
  2. Run the main method with the following program arguments:
    • --path <path to folder> - Path to the folder containing the dataset csv files. Optional, defaults to ./TPCH.
    • --paths <fileA,fileB,fileC> - Direct path to the dataset files seperated by comma. Optional, defaults to --path argument.
    • --cores <number of cores> - Number of local cores to use. Optional, defaults to 4.

About

Spark hands-on exercise for the lecture Distributed Data Analytics

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages