-
-
Notifications
You must be signed in to change notification settings - Fork 261
Open
Labels
RoadmapLarger, self-contained pieces of work.Larger, self-contained pieces of work.
Description
Imbalanced datasets, where the classes have very different occurrence rates, can show up in large data sets.
There are many strategies for dealing with imbalanced data. http://contrib.scikit-learn.org/imbalanced-learn/stable/api.html implements a set, some of which could be scaled to large datasets with dask.
Metadata
Metadata
Assignees
Labels
RoadmapLarger, self-contained pieces of work.Larger, self-contained pieces of work.