Add `monitor()` method for monitoring model performance in production #179

pplonski · 2020-09-10T07:37:41Z

The AutoML API should be extended with monitor() method:

the monitor() should track the model performance on new data
it should check prediction distribution on new data and compare with the distribution from training (out of folds predictions)
it should detect outliers in new data
it should detect data drifts in new data

I propose to have the following arguments in monitor():

X (new test data)
y (new test data targets)
y_predicted (predictions from the AutoML)

The monitor() should return a report about incidents in new data. For example, warnings list with explanations what was the problem.

The text was updated successfully, but these errors were encountered:

pplonski · 2021-02-22T09:20:17Z

There will be a new method need_retrain(). It will take new data as input. It will invoke two new methods:

is_drift() to check changes in new data
performance_decrease() to check the performance on new data

Example pseudo-code:

def need_retrain(self, X, y):
    return self.is_drift(X, y) or self.performance_decrease(X, y)

Maybe there should be also some summary Markdown file created with reasons why the model needs to be retrained.

pplonski · 2021-02-24T09:15:30Z

closed by mistake

pplonski · 2021-02-24T14:07:47Z

OK, at the beginning I want to make this feature super sensitive to any input data changes. But in the end, I finished with a simple approach of just performance monitoring. I hope it will be enough. If there will be a change in the data then the performance of AutoML prediction will decrease.

There is a new method added:

def need_retrain(self, X, y, sample_weight=None, decrease=0.1):
        """Decides about model retraining based on new data.

        Arguments:
            X (numpy.ndarray or pandas.DataFrame):
                New data.

            y (numpy.ndarray or pandas.Series):
                True labels for X.

            sample_weight (numpy.ndarray or pandas.Series):
                Sample weights.

            decrease (float): The ratio of change in the performance used as a threshold for retraining decision.
                By default, it is set to `0.1` which means that if the performance of AutoML will decrease by 10% 
                on new data then there is a need to retrain. This value should be set depending on your project needs.
                Sometimes, 10% is enough, but for some projects, it can be even lower than 1%.

            Returns:
                boolean: Decides if there is a need to retrain the AutoML.
        """

It works as follows:

a user calls the need_retrain() with new X and y data
the metric score is computed on new data
the performance of the best model is restored from its params.json file
if there is a decrease we check if it is larger than the decrease parameter.

The change is computed as follows:

change = np.abs((old_score - new_score) / old_score)

pplonski added enhancement New feature or request help wanted Extra attention is needed labels Sep 10, 2020

pplonski added this to the 0.9.0 milestone Feb 22, 2021

pplonski added this to To do in mljar-supervised Feb 23, 2021

pplonski moved this from To do to In progress in mljar-supervised Feb 23, 2021

pplonski closed this as completed Feb 24, 2021

pplonski moved this from In progress to Done in mljar-supervised Feb 24, 2021

pplonski moved this from Done to In progress in mljar-supervised Feb 24, 2021

pplonski reopened this Feb 24, 2021

pplonski added a commit that referenced this issue Feb 24, 2021

detect performance decrease in AutoML, need_retrain (#179)

6d22b00

pplonski closed this as completed Feb 24, 2021

pplonski added a commit that referenced this issue Feb 24, 2021

detect performance decrease in AutoML, need_retrain (#179)

5f7cad9

pplonski moved this from In progress to Done in mljar-supervised Feb 24, 2021

pplonski self-assigned this Feb 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `monitor()` method for monitoring model performance in production #179

Add `monitor()` method for monitoring model performance in production #179

pplonski commented Sep 10, 2020

pplonski commented Feb 22, 2021 •

edited

pplonski commented Feb 24, 2021

pplonski commented Feb 24, 2021

Add monitor() method for monitoring model performance in production #179

Add monitor() method for monitoring model performance in production #179

Comments

pplonski commented Sep 10, 2020

pplonski commented Feb 22, 2021 • edited

pplonski commented Feb 24, 2021

pplonski commented Feb 24, 2021

Add `monitor()` method for monitoring model performance in production #179

Add `monitor()` method for monitoring model performance in production #179

pplonski commented Feb 22, 2021 •

edited