Clustering benchmarks

Datasets

This project contains collection of labeled clustering problems that can be found in the literature. Most of datasets were artificially created.

The benchmark includes:

Artificial data

Experiments

This project contains set of clustering methods benchmarks on various dataset. The project is dependent on Clueminer project.

in order to run benchmark compile dependencies into a single JAR file:

mvn assembly:assembly

Consensus experiment

allows running repeated runs of the same algorithm:

./run consensus --dataset "triangle1" --repeat 10

by default k-means algorithm is used.

For available datasets see resources folder.

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
src		src
.gitignore		.gitignore
README-old.asc		README-old.asc
README.md		README.md
consensus		consensus
evolve-sc		evolve-sc
nb-configuration.xml		nb-configuration.xml
pom.xml		pom.xml
run		run
updreadme.rb		updreadme.rb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

.gitignore

.gitignore

README-old.asc

README-old.asc

README.md

README.md

consensus

consensus

evolve-sc

evolve-sc

nb-configuration.xml

nb-configuration.xml

pom.xml

pom.xml

run

run

updreadme.rb

updreadme.rb

Repository files navigation

Clustering benchmarks

Datasets

Artificial data

Experiments

Consensus experiment

About

Releases

Packages

Contributors 2

Languages

deric/clustering-benchmark

Folders and files

Latest commit

History

Repository files navigation

Clustering benchmarks

Datasets

Artificial data

Experiments

Consensus experiment

About

Resources

Stars

Watchers

Forks

Languages