Skip to content

Solutions for the hands-on sessions of the course "Distributed Data Analytics" at Hasso-Plattner-Institute using Akka and Spark.

Notifications You must be signed in to change notification settings

WGierke/distributed_data_analytics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 

Repository files navigation

Distributed Data Analytics

Solutions for the hands-on sessions on Akka and Spark

1. Akka

The task was to crack hashes and find longest substrings. The solution can be obtained using the final jar by executing
java -jar akka-cracka.jar --path path/to/students.csv

2. Spark

The task was to perform Inclusion Dependency Discovery using Spark. The solution can be obtained using the final jar by executing
java -jar fINDer.jar --path path/to/TPCH --cores NUMBER_OF_CORES