Sklearn autologging: Fix behavior when a child and its parent are both patched #3582

dbczumar · 2020-10-23T06:38:41Z

What changes are proposed in this pull request?

Fixes #3574 by addressing the following two issues:

In gorilla 0.3.0, applying a patch that stores a hit to a parent class (e.g., CountVectorizer) forces get_original_attribute of its children to refer to the parent class attribute, even if we patch the child later on. I.e.

This ordering is okay

gorilla.Patch(<TfidfVectorizer>)
gorilla.Patch(<CountVectorizer>)

but this ordering is not

gorilla.Patch(<CountVectorizer>)
gorilla.Patch(<TfidfVectorizer>)

This appears to be caused by the following line that prevents the original attribute from being overwritten once it's set: https://github.com/christophercrouzet/gorilla/blob/0c895b34311b6546bde35854e24862c001279add/gorilla.py#L280. As a result, setting the attribute on the parent, which sets the original attribute on the child if the child does not already define it, means that the child is always stuck with the parent's original attribute. This seems to be fixed in the master branch of gorilla but has not been released. Either we can pull in the copy from master or pull in a copy from 0.3.0 that removes this problematic hasattr check.

In fit_mlflow and fit_predict, we perform get_original_attribute(self, func_name). In the case where we're in a super class function (e.g., CountVectorizer.fit_transform()), self still refers to the subclass (e.g., TfidfVectorizer). As a result, calling get_original_attribute in the patched super class method fetches invokes the patched subclass method, which creates the infinite recursion problem observed in the issue description. I've filed Sklearn autologging: Fix behavior when a child and its parent are both patched #3582 to address this problem by fetching the original function attribute by reference to a class rather than by reference to self. This PR still needs docs & tests, and it needs to incorporate a fix for (1) before it's mergeable.

How is this patch tested?

Unit tests

Release Notes

Fixes an infinite recursion bug in scikit-learn autologging caused by invoking a fit() method on a model class that includes a call to super.fit() as part of its implementation (see #3574).

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

What component(s), interfaces, languages, and integrations does this PR affect?

Components

Interface

area/uiux: Front-end, user experience, JavaScript, plotting
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

Language

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

Integrations

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

How should the PR be classified in the release notes? Choose one:

rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

harupy · 2020-10-23T06:48:13Z

@dbczumar Is this a fix for #3574 ?

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

This reverts commit 5a84bba. Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

This reverts commit b4fcfa5. Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

dbczumar · 2020-10-26T20:01:59Z

mlflow/utils/gorilla.py

+            prev_patch = getattr(patch.destination, _ACTIVE_PATCH, None)
+            if not hasattr(patch.destination, original_name) or (
+                prev_patch
+                and prev_patch.destination != patch.destination
+                and issubclass(patch.destination, prev_patch.destination)
+            ):
+                setattr(patch.destination, original_name, target)
+
+    setattr(patch.destination, patch.name, patch.obj)
+    setattr(patch.destination, _ACTIVE_PATCH, patch)


@harupy These are the critical portions of the change to address problem (1) from #3574 (comment).

We now retain a reference to the previous patch that was applied to a class or one of its super classes, and, if the patch was applied to a super class, we overwrite it with the patch that's being applied to this class directly. See https://github.com/christophercrouzet/gorilla/blob/0045a7f4b5c46bda208dbce9e628f97bc9e551e0/gorilla.py#L295-L300 for the original implementation.

mlflow/utils/gorilla.py

dbczumar · 2020-10-26T20:11:51Z

mlflow/sklearn/__init__.py

        should_start_run = mlflow.active_run() is None
        if should_start_run:
            try_mlflow_log(mlflow.start_run)

        _log_pretraining_metadata(self, *args, **kwargs)

-        original_fit = gorilla.get_original_attribute(self, func_name)
+        original_fit = gorilla.get_original_attribute(clazz, func_name)


This (together line 914) is the critical change to address problem (2) from #3574 (comment).

Assume that the old line was being invoked from a patched fit call on a class called Parent and that self refers to an instance of class Child which inherits from Parent. Calling get_original_attribute() on self would yield the Child.fit() function, rather than Parent.fit(), leading to recursive behavior.

Now, instead of fetching the attribute from self, we fetch it from clazz (which would refer to Parent in the above example).

tests/sklearn/test_sklearn_autolog.py

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

harupy

LGTM 👍

dbczumar requested a review from harupy October 23, 2020 06:38

dbczumar mentioned this pull request Oct 23, 2020

sklearn autolog and TfidfVectorizer do not work together #3574

Closed

dbczumar added 15 commits October 26, 2020 14:57

Clazz

e140de1

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Inline gorilla

f9ba80c

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Some docs

d52cd50

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Swap gorilla

2faafd5

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Add test

949db08

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Test tweak

1dd66b8

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Doc

ee231da

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

whitespace

2deed56

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Format with black

99f827e

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Overwrite option

c730b6d

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Doc

e69792c

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Revert "Doc"

bbcc799

This reverts commit 5a84bba. Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Revert "Overwrite option"

0584602

This reverts commit b4fcfa5. Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Safety

ec6398e

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

Black format

8b7bc42

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

dbczumar force-pushed the clazz_fix branch from 0e8a93d to 8b7bc42 Compare October 26, 2020 19:58

dbczumar commented Oct 26, 2020

View reviewed changes

dbczumar changed the title ~~[WIP] Sklearn autologging: Fix behavior when a child and its parent are both patched~~ Sklearn autologging: Fix behavior when a child and its parent are both patched Oct 26, 2020

github-actions bot added area/tracking Tracking service, tracking client APIs, autologging rn/bug-fix Mention under Bug Fixes in Changelogs. labels Oct 26, 2020

dbczumar commented Oct 26, 2020

View reviewed changes

mlflow/utils/gorilla.py Show resolved Hide resolved

dbczumar commented Oct 26, 2020

View reviewed changes

harupy reviewed Oct 26, 2020

View reviewed changes

tests/sklearn/test_sklearn_autolog.py Show resolved Hide resolved

harupy reviewed Oct 26, 2020

View reviewed changes

tests/sklearn/test_sklearn_autolog.py Outdated Show resolved Hide resolved

Lint, address comment

81f11bb

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

dbczumar force-pushed the clazz_fix branch from f93ff49 to 81f11bb Compare October 27, 2020 05:54

Fix conflicts

daa732b

Signed-off-by: Corey Zumar <corey.zumar@databricks.com>

harupy approved these changes Oct 27, 2020

View reviewed changes

dbczumar merged commit 44b152a into mlflow:master Oct 27, 2020

dbczumar mentioned this pull request Oct 27, 2020

Scikit-learn autologging: Exclude feature extraction / selection estimator #3600

Merged

27 tasks

harupy mentioned this pull request Dec 17, 2020

Add autologging safety utils to statsmodels & Fast.ai #3859

Merged

27 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sklearn autologging: Fix behavior when a child and its parent are both patched #3582

Sklearn autologging: Fix behavior when a child and its parent are both patched #3582

dbczumar commented Oct 23, 2020 •

edited

Loading

harupy commented Oct 23, 2020

dbczumar Oct 26, 2020 •

edited

Loading

dbczumar Oct 26, 2020

harupy left a comment

Sklearn autologging: Fix behavior when a child and its parent are both patched #3582

Sklearn autologging: Fix behavior when a child and its parent are both patched #3582

Conversation

dbczumar commented Oct 23, 2020 • edited Loading

What changes are proposed in this pull request?

How is this patch tested?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

harupy commented Oct 23, 2020

dbczumar Oct 26, 2020 • edited Loading

Choose a reason for hiding this comment

dbczumar Oct 26, 2020

Choose a reason for hiding this comment

harupy left a comment

Choose a reason for hiding this comment

dbczumar commented Oct 23, 2020 •

edited

Loading

dbczumar Oct 26, 2020 •

edited

Loading