HPML-KAIS

Repository for the Journal Paper "Multi-Label Classification with Label Clusters"

How to cite this repository

PRE-PRINTS

RESULTS

Resulting Files

Here you can download the generated partitions, best selected partitions and the results from our experiments. If you want to get these for your experiments since the beginning, then please check the source code section.

Generated Partitions

10 Folds Cross Validation
All Possible Partitions (Bell Number)
HPML-J
HPML-K
Random 1
Random 2
Random 3

Best Selected Partitions

Oracle
Exhaustive: Macro-F1 | Micro-F1
HPML-J: Macro-F1 | Micro-F1 | Silhouette
HPML-K: Macro-F1 | Micro-F1 | Silhouette
Random-1: Macro-F1 | Micro-F1 | Silhouette
Random-2: Macro-F1 | Micro-F1 | Silhouette
Local-Partitions
Global-Partitions

Analysis

Here you will find all the documents (tables, plots, etc.) with analysis of the results.

Performance
Win-Loss-Tie
Method-Pair-Comparison
Statistical Tests
Partitioning

INSTRUCTIONS TO REPRODUCE THE EXPERIMENTS

Enviroments to run experiments

Conda

You can run this experiment in a Conda Environment. The name of our conda environment is "AmbienteTeste". To be able to use this env, you must first install conda in your computer or cluster and then install the env using the following command: conda env create --file AmbienteTeste.yaml. Click here to download the files.

Singularity/AppTainer

You can also run this experiment in a singularity container. We do not provide a singularity container for this experiment, but you can build one. Here you can find a little tutorial about how to do that for our experiments. Using singularity is better than conda environments when you have to execute all the experiment in a tmp (scratch or dev/shm) folder.

Pay attention to this because sometimes using the conda environment directly from the /home can destroy hard disks and harm all users. In some situations, copying your singularity container to the server's temp folder and running absolutely everything from there is the best solution for everyone. Talk to the administrator about this before trying to reproduce the experiments.

SOURCE CODE

Our code is completely modular because of our servers - mainly job queue, time, and memory. In this way, we can run many jobs in parallel in different steps of the methodology. In the future, a package that executes all the flow will be developed and available for the scientific community.

In each source code, you will find instructions about how to run the code. You also can make adjustments in the main script to save the results in your machine or in your cloud using rclone (there are some examples in the R scripts).

Attention: before using rclone, check with your institution's network administrator if it is possible to upload files and folders from the cluster to the cloud. In case of using google accounts for universities, you need to follow specific steps to configure communication between google cloud and server.

Bell Partitions
Cross-Validation
Local Partitions
Global Partitions
Oracle Partitions
Exhaustive Partitions: Micro-F1 | Macro-F1
Hybrid Partitions

-- Generate Partitions: Jaccard | Kohonen

-- Select Best Partition: Macro-F1 | Micro-F1 | Silhouette | Kohonen Macro-F1 | Kohonen Micro-F1

-- Test Best Partition: Macro-F1 | Micro-F1 | Silhouette

Random Partitions

-- Generate Partitions: Random Partitions Version 1 | Random Partitions Version 2 | Random Partitions Version 3

-- Select Best Partition: Macro-F1 | Micro-F1 | Silhouette | Kohonen Macro-F1 | Kohonen Micro-F1

-- Test Best Partition: Macro-F1 | Micro-F1 | Silhouette

-- Random 3

Acknowledgment

This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001.
This study was financed in part by the Conselho Nacional de Desenvolvimento Científico e Tecnológico - Brasil (CNPQ) - Process number 200371/2022-3.
The authors also thank the Brazilian research agencies FAPESP financial support.

Contact

elainececiliagatto@gmail.com

Links

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

HPML-KAIS

How to cite this repository

PRE-PRINTS

RESULTS

Resulting Files

Analysis

INSTRUCTIONS TO REPRODUCE THE EXPERIMENTS

Enviroments to run experiments

Conda

Singularity/AppTainer

SOURCE CODE

Acknowledgment

Contact

Links

Files

README.md

Latest commit

History

README.md

File metadata and controls

HPML-KAIS

How to cite this repository

PRE-PRINTS

RESULTS

Resulting Files

Analysis

INSTRUCTIONS TO REPRODUCE THE EXPERIMENTS

Enviroments to run experiments

Conda

Singularity/AppTainer

SOURCE CODE

Acknowledgment

Contact

Links