Skip to content

Nivel: Mid-senior (Hackathon). Dado un dataset imbalanceado se genera diferentes modelos predictivos para seleccionar el que maximice el f1-score macro. Deep Learning.

Notifications You must be signed in to change notification settings

adions025/imbalanced_dataset-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Deep Learning - Imbalanced Dataset

Random under-sampling approach and also a comparison of different optimizers: Adam, SGD, Adadelta, Adagrad, and various learning rates.

Keywords: Deep learning, Imbalanced Dataset, Resampling

Resampling

A widely adopted technique for dealing with highly unbalanced datasets is called resampling. It consists of removing samples from the majority class (under-sampling) and / or adding more examples from the minority class (over-sampling).

image info

Despite the advantage of balancing classes, these techniques also have their weaknesses (there is no free lunch). The simplest implementation of over-sampling is to duplicate random records from the minority class, which can cause overfitting. In under-sampling, the simplest technique involves removing random records from the majority class, which can cause loss of information.

Dirs and files

Author

About

Nivel: Mid-senior (Hackathon). Dado un dataset imbalanceado se genera diferentes modelos predictivos para seleccionar el que maximice el f1-score macro. Deep Learning.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published