All 12 datasets used in Categorical Data Clustering via Value Order Estimated Distance Metric Learning can be found in the Data folder. Additionally, six supplementary datasets are included in the Supp_data folder.
All code is written in Matlab 2022a. Each section serves the following purpose:
- The
Metricsfolder contains code for evaluating clustering performance. - The
OCL_algfolder details the code of the OCL algorithm.
The script information in the OCL_alg file are as follows:
- The
initialization.mandorder_initial.mfiles initializeQandorder, respectively. - The
OCL_main.mfile is the main component of the OCL algorithm and executes the outer loop. - The
INOCL.mfile implements the inner loop function of the OCL algorithm. - The
Order_choose.mfile and theOrder_reset.mfile are responsible for learning the order and modifying the order of the original dataset, respectively.
Finally, the Execute_Clustering.m file executes the entire OCL algorithm on the provided dataset.
- Ensure that the
OCL_Source_Codefolder has been added to the execution path by right-clicking the folder, selectingAdd to Path, and then left-clickingSelected Folders and Subfolders. - Run the
Execute_Clustering.mfile, and the results will be displayed in the command line window.