PrefixSpan is a frequent pattern mining algorithm described in Pei et al., Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach.
This repository is adapted/forked from a project did by Tianlong and Jens in a course from Professor Volker Markl's group DIMA at TU Berlin. The project is to implement PrefixSpan algorithm in Apach Flink, tune the cluster, and compare the result against Apache Spark.