Skip to content

Python implementation of the Apriori, PCY, Multistage and Multihash algorithms

Notifications You must be signed in to change notification settings

vaasugambhir/mining-large-datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Mining of massive datasets

A python implementation of the Apriori, PCY, Multistage and Multihash algorithms

To run a particular algorithm, cd into that directory and run 'python index.py'. index.py has a collection of all passes for all the algorithms and prints the result of each pass (i.e., item index table, the frequent k sets, etc.). For the given sample dataset, we do not require more than 3 passes and hence we stop after checking for candidate tripletons

Reference: Mining of massive datasets by Anand Rajaraman and Jeffrey D. Ullman

Releases

No releases published

Packages

No packages published

Languages