Skip to content

Unbalanced Data Streaming #445

Answered by raphaelsty
Haoyu-R asked this question in Q&A
Jan 19, 2021 · 2 comments · 4 replies
Discussion options

You must be logged in to vote

Hello,

The model should be influenced if a class is over-represented in the stream.

I don't know if this is exactly what you are looking for, but there is an imblearn module.

For exemple the HardSamplingClassifier class allows you to store the n latest observations that are the most difficult to predict and re-train the model on these data with some probability.

https://riverml.xyz/latest/api/imblearn/HardSamplingClassifier/

Also take a look at imblearn.RandomSampler, imblearn.RandomOverSampler and imblearn.RandomUnderSampler which may help you to get the desired data distribution.

The drift module may also help you to detect any trend updates.

Raphaël

Replies: 2 comments 4 replies

Comment options

You must be logged in to vote
4 replies
@MaxHalford
Comment options

@Haoyu-R
Comment options

@Haoyu-R
Comment options

@MaxHalford
Comment options

Answer selected by MaxHalford
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants