GitHub - ibrahimerdem/AdvancedPatternMining: A paper about sequential data mining and experiments of some algorithms.

ADVANCED PATTERN MINING: THE EVALUATION OF SEQUENTIAL PATTERN MINING ALGORITHMS

Abstract

As a problem, in order to extract sequential patterns from a sequence database, there are plenty of algorithms introduced so far. They use different techniques in terms of scanning database, support counting, etc. It is possible to obtain different performance measurement with different sequential pattern mining algorithms and different size of data. In this paper it is tried to measure the efficiency of three algorithms: SPADE, PrefixSpan, and CM-SPADE. To do this, it is used three different real data set from UCI Machine Learning Repository: 1) MSNBC, 2) Online Retail, 3) DNA Sequence, with an open source data mining tool, SPMF, specialized in pattern mining. It is obvious that each algorithm have its own advantages and drawbacks. Whereas some features seem to be advantageous for a specific type of data set, some are disadvantageous. It is obtained that the runtime efficiency of one algorithm not only depends upon the data set characteristics, but also minimum support threshold values are also significant impact on process times.

Keywords. Frequent Pattern Mining, Sequential Pattern Mining, PrefixSpan, SPADE, CM-SPADE, Efficiency of Algorithms

Data Source

Preprocessed Data

Tool

SPMF

Name	Name	Last commit message	Last commit date
Latest commit ibrahimerdem correction Sep 7, 2020 08473ad · Sep 7, 2020 History 16 Commits
Data	Data	data upload	Sep 7, 2020
Ceng-552-Data Mining-Advanced Pattern Mining-Sequential Pattern Mining.pdf	Ceng-552-Data Mining-Advanced Pattern Mining-Sequential Pattern Mining.pdf	upload paper	Sep 7, 2020
README.md	README.md	correction	Sep 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ADVANCED PATTERN MINING: THE EVALUATION OF SEQUENTIAL PATTERN MINING ALGORITHMS

Abstract

Data Source

Preprocessed Data

Tool

About

Releases

Packages

ibrahimerdem/AdvancedPatternMining

Folders and files

Latest commit

History

Repository files navigation

ADVANCED PATTERN MINING: THE EVALUATION OF SEQUENTIAL PATTERN MINING ALGORITHMS

Abstract

Data Source

Preprocessed Data

Tool

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages