Treeshap hypothesis tests #4671

RAMitchell · 2022-03-30T15:29:09Z

Increased test coverage for TreeExplainer, greatly expanding model types tested. New tests take around 4.8s on my machine.

Fixes #4352

New bugs found:
#4663
dmlc/treelite#375
#4670

…treeshap-test

RAMitchell · 2022-04-01T10:19:34Z

Looks like xgboost segfaulted. Can't reproduce this locally and it doesn't seem strongly related to any of these changes.

cuml/test/explainer/test_gpu_treeshap.py::test_xgb_toy_categorical [0]	train-error:0.00000
[c8d0704a3bfc:3027 :0:3561] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x5581e5d67ff8)
==== backtrace (tid:   3561) ====
 0  /opt/conda/envs/rapids/lib/python3.8/site-packages/ucp/_libs/../../../../libucs.so.0(ucs_handle_error+0x155) [0x7eff71a3f3f5]
 1  /opt/conda/envs/rapids/lib/python3.8/site-packages/ucp/_libs/../../../../libucs.so.0(+0x2d791) [0x7eff71a3f791]
 2  /opt/conda/envs/rapids/lib/python3.8/site-packages/ucp/_libs/../../../../libucs.so.0(+0x2d962) [0x7eff71a3f962]
 3  /usr/lib64/libc.so.6(+0x36400) [0x7f00b1fb4400]
 4  /opt/conda/envs/rapids/lib/python3.8/site-packages/cuml/common/../../../../libnccl.so.2(+0x5563e) [0x7effbd70d63e]
 5  /opt/conda/envs/rapids/lib/python3.8/site-packages/cuml/common/../../../../libnccl.so.2(+0x445b9) [0x7effbd6fc5b9]
 6  /usr/lib64/libpthread.so.0(+0x7ea5) [0x7f00b2c64ea5]
 7  /usr/lib64/libc.so.6(clone+0x6d) [0x7f00b207cb0d]
=================================
Fatal Python error: Segmentation fault

Thread 0x00007efecefd5700 (most recent call first):
  File "/opt/conda/envs/rapids/lib/python3.8/threading.py", line 306 in wait
  File "/opt/conda/envs/rapids/lib/python3.8/threading.py", line 558 in wait
  File "/opt/conda/envs/rapids/lib/python3.8/site-packages/tqdm/_monitor.py", line 60 in run
  File "/opt/conda/envs/rapids/lib/python3.8/threading.py", line 932 in _bootstrap_inner
  File "/opt/conda/envs/rapids/lib/python3.8/threading.py", line 890 in _bootstrap

Thread 0x00007f00b308e740 (most recent call first):
  File "/opt/conda/envs/rapids/lib/python3.8/site-packages/xgboost/core.py", line 1423 in __del__
  File "/opt/conda/envs/rapids/lib/python3.8/site-packages/xgboost/training.py", line 188 in train
  File "/workspace/python/cuml/test/explainer/test_gpu_treeshap.py", line 347 in test_xgb_toy_categorical

dantegd · 2022-04-04T19:23:46Z

@RAMitchell I've seen that in another PR or nightly job, so I'm sure it is unrelated to the changes of the PR but it does look like XGB segfaulting, will dig the log

RAMitchell · 2022-04-05T16:33:28Z

I tried to reproduce the CI failure locally by running the test in isolation. Couldn't reproduce anything unfortunately.

import numpy as np
import pandas as pd
import xgboost as xgb
from cuml.experimental.explainer.tree_shap import TreeExplainer


def test_xgb_toy_categorical():
    X = pd.DataFrame({'dummy': np.zeros(5, dtype=np.float32),
                      'x': np.array([0, 1, 2, 3, 4], dtype=np.int32)})
    y = np.array([0, 0, 1, 1, 1], dtype=np.float32)
    X['x'] = X['x'].astype("category")
    dtrain = xgb.DMatrix(X, y, enable_categorical=True)
    params = {"tree_method": "gpu_hist", "eval_metric": "error",
              "objective": "binary:logistic", "max_depth": 2,
              "min_child_weight": 0, "lambda": 0}
    xgb_model = xgb.train(params, dtrain, num_boost_round=1,
                          evals=[(dtrain, 'train')])
    explainer = TreeExplainer(model=xgb_model)
    out = explainer.shap_values(X)

    ref_out = xgb_model.predict(dtrain, pred_contribs=True)
    np.testing.assert_almost_equal(out, ref_out[:, :-1], decimal=5)
    np.testing.assert_almost_equal(explainer.expected_value, ref_out[0, -1],
                                   decimal=5)


for i in range(1000):
    test_xgb_toy_categorical()

trivialfis · 2022-04-05T22:58:44Z

Have you tried address sanitizer?

…treeshap-test

hcho3 · 2022-04-08T05:03:59Z

python/cuml/test/explainer/test_gpu_treeshap.py

+        n_targets = draw(st.integers(2, 5))
+    else:
+        n_targets = 1


So n_targets means n_classes in the context of classification? Let's just use n_classes for this purpose, since it's confusing otherwise. (I was wondering if we were using an unreleased feature of XGBoost)

These tests will support multi-output regression. I am using n_targets as a more generic term.

RAMitchell · 2022-04-08T10:29:26Z

I am wondering if the xgboost CI failure is due to nccl somehow. I've seen this occur in different tests calling xgboost, so the failure doesn't seem related to any particular test, as long as xgb is used. It seems to be occurring more in this PR than in other places in CI, maybe because I increased the frequency of xgboost tests.

…treeshap-test

RAMitchell · 2022-04-13T10:32:46Z

Took @trivialfis's advice and tried compiling cuml with address sanitizer. Turns out we were deleting a base class pointer without a virtual destructor - so the derived class deleter was not called. Hopefully this resolves the issue.

codecov-commenter · 2022-04-13T13:38:05Z

Codecov Report

❗ No coverage uploaded for pull request base (branch-22.06@dfc1eae). Click here to learn what that means.
The diff coverage is n/a.

@@               Coverage Diff               @@
##             branch-22.06    #4671   +/-   ##
===============================================
  Coverage                ?   84.04%           
===============================================
  Files                   ?      252           
  Lines                   ?    20348           
  Branches                ?        0           
===============================================
  Hits                    ?    17102           
  Misses                  ?     3246           
  Partials                ?        0

Flag	Coverage Δ
dask	`44.97% <0.00%> (?)`
non-dask	`77.34% <0.00%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update dfc1eae...a0086f3. Read the comment docs.

trivialfis · 2022-04-13T13:48:48Z

Seems to be working?

dantegd · 2022-04-13T21:32:00Z

@gpucibot merge

Stacked on #4671. - Remove extra redundant class in python layer. - Simplify the interface between C++ and python using variants. - Fix #4670 by allowing double precision data - Document TreeExplainer - Add interventional shap method - Add shapley interactions and taylor interactions - Promote from experimental - Support sklearn estimator types from xgb/lgbm (i.e. no need to convert to booster before using TreeExplainer) Authors: - Rory Mitchell (https://github.com/RAMitchell) Approvers: - Philip Hyunsu Cho (https://github.com/hcho3) - Dante Gama Dessavre (https://github.com/dantegd) URL: #4697

Increased test coverage for TreeExplainer, greatly expanding model types tested. New tests take around 4.8s on my machine. Fixes rapidsai#4352 New bugs found: rapidsai#4663 dmlc/treelite#375 rapidsai#4670 Authors: - Rory Mitchell (https://github.com/RAMitchell) Approvers: - Dante Gama Dessavre (https://github.com/dantegd) URL: rapidsai#4671

Stacked on rapidsai#4671. - Remove extra redundant class in python layer. - Simplify the interface between C++ and python using variants. - Fix rapidsai#4670 by allowing double precision data - Document TreeExplainer - Add interventional shap method - Add shapley interactions and taylor interactions - Promote from experimental - Support sklearn estimator types from xgb/lgbm (i.e. no need to convert to booster before using TreeExplainer) Authors: - Rory Mitchell (https://github.com/RAMitchell) Approvers: - Philip Hyunsu Cho (https://github.com/hcho3) - Dante Gama Dessavre (https://github.com/dantegd) URL: rapidsai#4697

RAMitchell added 3 commits March 24, 2022 04:22

Fix cast to incorrect model type.

8e891ff

Hypothesis tests

e7ee0dc

Lint

6d72d1c

RAMitchell requested review from a team as code owners March 30, 2022 15:29

github-actions bot added CUDA/C++ Cython / Python Cython or Python issue labels Mar 30, 2022

Merge branch 'branch-22.04' of https://github.com/rapidsai/cuml into …

dab35e0

…treeshap-test

RAMitchell changed the base branch from branch-22.04 to branch-22.06 April 6, 2022 15:38

RAMitchell added 2 commits April 6, 2022 08:41

Merge branch 'branch-22.06' of https://github.com/rapidsai/cuml into …

a288a67

…treeshap-test

Disable HealthCheck, reduce examples

030b67a

hcho3 requested changes Apr 8, 2022

View reviewed changes

RAMitchell mentioned this pull request Apr 12, 2022

TreeExplainer extensions #4697

Merged

RAMitchell added 2 commits April 12, 2022 03:47

Merge branch 'branch-22.06' of https://github.com/rapidsai/cuml into …

e037db1

…treeshap-test

Add virtual destructor

a0086f3

dantegd added tests Unit testing for project improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Apr 13, 2022

dantegd added this to PR-WIP in v22.06 Release via automation Apr 13, 2022

dantegd approved these changes Apr 13, 2022

View reviewed changes

v22.06 Release automation moved this from PR-WIP to PR-Reviewer approved Apr 13, 2022

rapids-bot bot merged commit b3967cf into rapidsai:branch-22.06 Apr 13, 2022

v22.06 Release automation moved this from PR-Reviewer approved to Done Apr 13, 2022

hcho3 mentioned this pull request Apr 14, 2022

[CI] Add virtual destructor check to clang-tidy #4701

Closed

RAMitchell mentioned this pull request May 9, 2022

[BUG] Intermittent CI failures with xgboost #4729

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Treeshap hypothesis tests #4671

Treeshap hypothesis tests #4671

RAMitchell commented Mar 30, 2022

RAMitchell commented Apr 1, 2022

dantegd commented Apr 4, 2022

RAMitchell commented Apr 5, 2022

trivialfis commented Apr 5, 2022 •

edited

hcho3 Apr 8, 2022

RAMitchell Apr 8, 2022 •

edited

RAMitchell commented Apr 8, 2022

RAMitchell commented Apr 13, 2022

codecov-commenter commented Apr 13, 2022

trivialfis commented Apr 13, 2022

dantegd commented Apr 13, 2022

Treeshap hypothesis tests #4671

Treeshap hypothesis tests #4671

Conversation

RAMitchell commented Mar 30, 2022

RAMitchell commented Apr 1, 2022

dantegd commented Apr 4, 2022

RAMitchell commented Apr 5, 2022

trivialfis commented Apr 5, 2022 • edited

hcho3 Apr 8, 2022

Choose a reason for hiding this comment

RAMitchell Apr 8, 2022 • edited

Choose a reason for hiding this comment

RAMitchell commented Apr 8, 2022

RAMitchell commented Apr 13, 2022

codecov-commenter commented Apr 13, 2022

Codecov Report

trivialfis commented Apr 13, 2022

dantegd commented Apr 13, 2022

trivialfis commented Apr 5, 2022 •

edited

RAMitchell Apr 8, 2022 •

edited