Skip to content

antonrud/flink-frequent-patterns

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Mining Frequent Patterns

Homework for DWBI class. TU Berlin, winter term 2018/2019

What it does

This application performs mining of frequent patterns from a dataset of retail transactions. In particular this program is an implementation of ECLAT algorithm (Equivalence Class Transformation) - a depth-first search algorithm based on set intersection.

The tasks are performed within Apache Flink execution environment as batch job and mostly in parallel.

How to use

  1. Download data set from here: https://tubcloud.tu-berlin.de/s/ZtfgnxMCZ5cjJf8 (or mirror)
  2. Place it under src/main/resources
  3. Run main() method in ECLATJob class
  4. Find results in src/main/resources

Built with

Autors

Releases

No releases published

Packages

No packages published

Languages