Akka hands-on exercise for the lecture Distributed Data Analytics.
- Password Cracking: What are the students’ clear-text passwords?
- Gene Analysis: Which student pairs share the longest gene sub-strings?
students.csv
with the following columns:
- Id: Unique id of the student.
- Name: Name of the student.
- Password Hash: SHA-256 hash of the student's password. Passwords are numeric and length seven.
- Gene: Gene sequence of the student.
- Build a fatjar using
gradle shadowJar
or run from inside your IDE. - Run the main method with the following program arguments:
--path <path to students.csv>
- Path to the input csv file. Required.--task [passwords|genes|all]
- Which task to run. Optional, defaults toall
.--workers <number of local workers>
- Number of workers to spawn locally. Optional, defaults to4
.