Skip to content

Latest commit

 

History

History
126 lines (67 loc) · 8.66 KB

README.md

File metadata and controls

126 lines (67 loc) · 8.66 KB

HPML-KAIS

Repository for the Journal Paper "Multi-Label Classification with Label Clusters"

How to cite this repository

PRE-PRINTS

| 1.º Pre-Print | 2.º Pre-Print |

RESULTS

Resulting Files

Here you can download the generated partitions, best selected partitions and the results from our experiments. If you want to get these for your experiments since the beginning, then please check the source code section.

Generated Partitions

Best Selected Partitions

Analysis

Here you will find all the documents (tables, plots, etc.) with analysis of the results.

INSTRUCTIONS TO REPRODUCE THE EXPERIMENTS

Enviroments to run experiments

Conda

You can run this experiment in a Conda Environment. The name of our conda environment is "AmbienteTeste". To be able to use this env, you must first install conda in your computer or cluster and then install the env using the following command: conda env create --file AmbienteTeste.yaml. Click here to download the files.

Singularity/AppTainer

You can also run this experiment in a singularity container. We do not provide a singularity container for this experiment, but you can build one. Here you can find a little tutorial about how to do that for our experiments. Using singularity is better than conda environments when you have to execute all the experiment in a tmp (scratch or dev/shm) folder.

Pay attention to this because sometimes using the conda environment directly from the /home can destroy hard disks and harm all users. In some situations, copying your singularity container to the server's temp folder and running absolutely everything from there is the best solution for everyone. Talk to the administrator about this before trying to reproduce the experiments.

SOURCE CODE

Our code is completely modular because of our servers - mainly job queue, time, and memory. In this way, we can run many jobs in parallel in different steps of the methodology. In the future, a package that executes all the flow will be developed and available for the scientific community.

In each source code, you will find instructions about how to run the code. You also can make adjustments in the main script to save the results in your machine or in your cloud using rclone (there are some examples in the R scripts).

Attention: before using rclone, check with your institution's network administrator if it is possible to upload files and folders from the cluster to the cloud. In case of using google accounts for universities, you need to follow specific steps to configure communication between google cloud and server.

-- Generate Partitions: Jaccard | Kohonen

-- Select Best Partition: Macro-F1 | Micro-F1 | Silhouette | Kohonen Macro-F1 | Kohonen Micro-F1

-- Test Best Partition: Macro-F1 | Micro-F1 | Silhouette

  • Random Partitions

-- Generate Partitions: Random Partitions Version 1 | Random Partitions Version 2 | Random Partitions Version 3

-- Select Best Partition: Macro-F1 | Micro-F1 | Silhouette | Kohonen Macro-F1 | Kohonen Micro-F1

-- Test Best Partition: Macro-F1 | Micro-F1 | Silhouette

-- Random 3

Acknowledgment

  • This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001.
  • This study was financed in part by the Conselho Nacional de Desenvolvimento Científico e Tecnológico - Brasil (CNPQ) - Process number 200371/2022-3.
  • The authors also thank the Brazilian research agencies FAPESP financial support.

Contact

elainececiliagatto@gmail.com

Links

| Site | Post-Graduate Program in Computer Science | Computer Department | Biomal | CNPQ | Ku Leuven | Embarcados | Read Prensa | Linkedin Company | Linkedin Profile | Instagram | Facebook | Twitter | Twitch | Youtube |