Skip to content

Personal implementation of the paper "A two-stage ensemble method for the detection of class-label noise"

License

Notifications You must be signed in to change notification settings

jpedrocm/noise-detection-ensemble

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

92 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project

python status license

This is the final project for the Multiple Classifiers System's class.

Description

The goal of this project is to implement the proposed method of the paper "A two-stage ensemble method for the detection of class-label noise" and reproduce the experiments in some of the paper datasets, which can be found in the UCI Machine Learning repository. The experimented datasets were: Blood, Breast, Chess, Heart, Ionosfere, Liver, Parkinsons, Sonar and Spambase.

Getting Started

Requirements

Installing

  • Clone this repository into your machine
  • Download and install all the requirements listed above in the given order
  • Download the listed datasets in .txt format
  • Place all these files inside the data/ folder
  • Change their file types to .csv
  • Change their filenames according to the names in the ConfigHelper function get_datasets

Reproducing

  • Enter into the project main folder in your local repository
  • Run this first command to generate all metrics
python main.py
  • Run this second command to aggregate these metrics and generate the final error table and graphics
python aggregate.py

Project Structure

.            
├── data                                  # Datasets files
├── results                               # Results files
├── src                                   # Source code files
|   ├── aggregate.py
|   ├── main.py
|   ├── majority_filtering.py
|   ├── noise_detection_ensemble.py
|   ├── config_helper.py
|   ├── data_helper.py
|   ├── io_helper.py
|   └── metrics_helper.py 
├── LICENSE.md
└── README.md

Author

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

About

Personal implementation of the paper "A two-stage ensemble method for the detection of class-label noise"

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages