TimeSeriesSeedingKMeans

In this repository I've implemented the Semi Supervised Time Series Clustering called Seeded kMeans using the DTW as distance. The reference paper for this code is: Basu, S., Banerjee, A., & Mooney, R. (2002). Semi-supervised clustering by seeding. In Proceedings of 19th International Conference on Machine Learning (ICML-2002).

Input File

In input are accepted only .tsv files, where each row represents the time series and the first column represents the label of the time series. Each point of the time series is separated by a tab (\t). All the .tsv files should be inside a folder with the same name of the file.

Parameters

The algorithm requires only the percentage of the number of labels to use in order to start the computation. It is possible to modify the percentage via the trainFeatDataset variable.

Issues

At the moment, the code doesn’t accept time series with different length. It will be fixed ASAP.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Coffee		Coffee
LICENSE		LICENSE
README.md		README.md
SeededKMeansTimeSeriesMain.py		SeededKMeansTimeSeriesMain.py
utilityBackground.py		utilityBackground.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TimeSeriesSeedingKMeans

Input File

Parameters

Issues

About

Releases

Packages

Contributors 2

Languages

License

protti/TimeSeriesSeedingKMeans

Folders and files

Latest commit

History

Repository files navigation

TimeSeriesSeedingKMeans

Input File

Parameters

Issues

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages