NeurIPS 2019

Reproducibility Challenge @ NeurIPS 2019

This project is submitted to Reproducibility Challenge @ NeurIPS 2019. Reproducibility Challenge @ NeurIPS 2019 is the most recent edition of the reproducibility challenge for members of the machine learning community to dive deep into cutting-edge research by aiming to re-implement parts of a paper. This project is based on the paper Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss with the focus on algorithm re-implementation and baseline improvement. A general scheme for baseline improvement with learning rate step decay and triangular policy is proposed in this project.

Abstract

Most state-of-the-arts classifiers assume a relatively balanced class distribution and equal mis-classification cost. Training with imbalanced data has encountered a significant difficulty of low attainable results. Although many previous work has addressed various strategies to tackle this issue, these techniques usually come with different drawbacks and the outcome is still very limited. Cao et al. introduced two new techniques, label-distribution-aware margin loss (LDAM) and deferred re-weighting(DRM), which have been claimed to acieve better performance gains over the existing techniques. In this work, we re-produced the baseline experiments reported in the authors' work with IMDB and CIFAR-10 benchmarks. We performed extensive hyper-parameter tuning on these models and outperformed the original reported results. We also proposed a general scheme for baseline improvement with learning rate step decay and triangular policy. Based on the improved results, we studied how different techniques affect the performance when learning imbalanced data, including class balanced re-weighting, class balanced re-sampling and borderline-SMOTE.

Proposed Algorithm by Cao et al.

View the full report

Experiment Result at a Glance

Imbalanced CIFAR-10 Experiments Results

Parameter Setting and Results for IMDB experiments

ERM: BatchSize = 128, Max Feature = 8000, Epoch = 2. Re-samping: BatchSize = 128, Max Feature = 10000, Epoch = 4. Re-weighting: BatchSize = 128, Max Feature = 10000, Epoch = 5. SMOTE: BatchSize = 128, Max Feature = 10000, Epoch = 3. Adam Optimizer with default setting.

Name	Name	Last commit message	Last commit date
Latest commit dorhelium Update README.md Jun 26, 2021 318e024 · Jun 26, 2021 History 40 Commits
ImbalancedCIFAR	ImbalancedCIFAR	Delete a	Jan 4, 2020
ImbalancedIMDB	ImbalancedIMDB	Delete a	Nov 16, 2020
results	results	Add files via upload	Jan 10, 2020
README.md	README.md	Update README.md	Jun 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NeurIPS 2019

Reproducibility Challenge @ NeurIPS 2019

Abstract

Proposed Algorithm by Cao et al.

Experiment Result at a Glance

About

Releases

Packages

Languages

dorhelium/NeurIPS

Folders and files

Latest commit

History

Repository files navigation

NeurIPS 2019

Reproducibility Challenge @ NeurIPS 2019

Abstract

Proposed Algorithm by Cao et al.

Experiment Result at a Glance

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages