自動最佳化 Kmeans (Self‑optimizing Kmeans)

介紹 Introdution

K-means 利用距離公式（本篇採用歐式距離）將 m 維的 n個資料點分群。

這裡提供兩種最佳化分群 K 值的方法（GAP Statistic, Silhouette Coefficient），

自動將最佳 K 值帶入程式，提供最佳分群選擇。

K-means use distantce formula convert m dim, n data point clustered.

We offer two optimal method to search K (cluster value),

auto input Best-K to Kmeans algorithm, and show best cluster result.

用法 Usage

python Optimal_Kmeans.py <optional> -m (gap or sil)

輸出入範例 input and output example

輸入 input

一個 N*M維矩陣的 CSV 檔案。

A N*M dim martrix in CSV file.

輸出 output

1.cluster each dim mean

2.cluster each dim median

3.cluster each dim standard deviation

4.point of this cluster numbers

5.point of this cluster ratio

最佳 K 值選擇 Best-K chosen

Acknowledgement

Gap statistic source code from:

https://medium.com/@pahome.chen/clustering%E6%B1%BA%E5%AE%9A%E5%88%86%E7%BE%A4%E6%95%B8%E7%9A%84%E6%96%B9%E6%B3%95-abedc1d81ccb

Contact

Further information please contact me.

wuyiulin@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
img		img
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Optimal_Kmeans.py		Optimal_Kmeans.py
Readme.md		Readme.md
method.py		method.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

自動最佳化 Kmeans (Self‑optimizing Kmeans)

介紹 Introdution

用法 Usage

輸出入範例 input and output example

輸入 input

輸出 output

最佳 K 值選擇 Best-K chosen

Acknowledgement

Contact

About

Releases

Packages

Languages

License

wuyiulin/OptimalKmeans

Folders and files

Latest commit

History

Repository files navigation

自動最佳化 Kmeans (Self‑optimizing Kmeans)

介紹 Introdution

用法 Usage

輸出入範例 input and output example

輸入 input

輸出 output

最佳 K 值選擇 Best-K chosen

Acknowledgement

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages