Excel table clustering using K-Means in Python (Machine Learning)

Author: João Nuno Carvalho

Date: 2017

License: MIT Open Source License

How to:

Pre-requisites:

Install the free Anaconda for Python 3.6.

Procedure:

1º Start with a table of data in a excel worksheet.

The row will be what you want to cluster, in the end this program creates a new column with the cluster ID at each row, next to the name (second column).

2º Save the excel file as a *.csv file. (Coma Separated Values)

3º At the start menu select the Anaconda prompt and then go to the directory were you have your csv file and code file and start the Jupyter notebooks by making the command “jupyter notebook”. In the file list, double click on the code file to open it.

4º In the program, change the name of the input file, to your CSV file, and change the number of clusters that you want to generate.

5º Execute all the cells. It will generate a new CSV file, that terminates in “K_means”.

6º Open the file in Excel and apply a filter on the new column data to see the elements of the separate clusters.

DataSet

Zoo Data Set (Artificial, 7 classes of animals)

UCI - Machine Learning Repository

Zoo Dataset

See file Excel_table_clustering_code_using_K-Means_in_Python.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
Excel_table_clustering_code_using_K-Means_in_Python.ipynb		Excel_table_clustering_code_using_K-Means_in_Python.ipynb
README.md		README.md
zoo.data.csv		zoo.data.csv
zoo.data.txt		zoo.data.txt
zoo.data.xlsx		zoo.data.xlsx
zoo.data_kmeans_7.csv		zoo.data_kmeans_7.csv
zoo.names.txt		zoo.names.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitattributes

.gitattributes

.gitignore

.gitignore

Excel_table_clustering_code_using_K-Means_in_Python.ipynb

Excel_table_clustering_code_using_K-Means_in_Python.ipynb

README.md

README.md

zoo.data.csv

zoo.data.csv

zoo.data.txt

zoo.data.txt

zoo.data.xlsx

zoo.data.xlsx

zoo.data_kmeans_7.csv

zoo.data_kmeans_7.csv

zoo.names.txt

zoo.names.txt

Repository files navigation

Excel table clustering using K-Means in Python (Machine Learning)

How to:

DataSet

About

Releases

Packages

Languages

joaocarvalhoopen/Excel_table_clustering_using_K-Means_ML

Folders and files

Latest commit

History

Repository files navigation

Excel table clustering using K-Means in Python (Machine Learning)

How to:

DataSet

About

Resources

Stars

Watchers

Forks

Languages