Ph add errors #8

jphall663 · 2019-03-08T21:22:40Z

New plan off attack on model debugging ...

Sensitivity Analysis:

Random attacks (very important actually, even though they drive me insane)
PDP:
-- w/ out of range values
-- w/ missing values
-- w/ ICE at deciles of yhat
-- w/ residual of PD
-- ALWAYS w/ histogram so you can see where data supports predictions
Adversarial examples:
-- What can change high residual points to low residual points (and vice versa)
-- What can change incorrectly classified points to correctly classified point (and vice versa)
-- record non-nonsensical features and combinations that cause problems as non-robust
-- train model on wrongly classified points (high-noise training set) and look for important features; score on low-noise training data and conduct activation analysis to find non-robust features

https://github.com/jphall663/interpretable_machine_learning_with_python/blob/ph_add_errors/debugging_sens_analysis_redux.ipynb

Residual Analysis:

Always report more than one error metric
Analyze w.r.t. trusted benchmark model (similarity matrix?)
Plots of residual vs. prediction
Plots of residual vs. target and top-k important inputs, by level if categorical
Explanation of residual: DT surrogate (trained on right/wrong), LIME (trained on right/wrong), Shap, LOCO of residual (global and local)
Use explanation of residuals: DT surrogate, LIME, Shap, LOCO of residual (global and local) to automatically build tests/assertions/adversaries.
Plots of Shap/LOCO vs. residual

https://github.com/jphall663/interpretable_machine_learning_with_python/blob/ph_add_errors/debugging_resid_analysis_redux.ipynb

For each bullet above, try to discuss in terms of:

accuracy
disparate impact
privacy/security

Potentially remove: https://github.com/jphall663/interpretable_machine_learning_with_python/blob/ph_add_errors/pdp_dt_surr_res_shap_loss.ipynb

Self-Healing

Use discovered assertions in training to prevent error states
Use adversarial examples to find non-robust (non-nonsensical? use explanation to decide) features or interactions that are correlated to the target and remove, or prevent these interactions from being modeled.
Boosting of surrogate prediction model from the underlying oracle model to the point where they are as accurate as the oracle ("Dagger").
Different models on robust and non-robust features and give lower weight to non-robust features model in ensemble.

…notebook

…achine_learning_with_python into ph_add_errors

…table_machine_learning_with_python into ph_add_errors

jphall663 · 2019-08-22T16:16:37Z

The advanced sensitivity and residual analysis notebook were merged into master in an extremely silly way, i.e. by downloading them from the branch associated with this PR and then checking out master and copying the files there. Sorry!

Good drafts for both are now done, so closing.

jphall663 added 30 commits December 30, 2018 17:04

VERY first draft of model debugging

f9067ac

housekeeping on this branch

82a2fac

add notebook just for shap loss for shap/shap#380

83352c2

shap on logloss working! intermediate work on larger model debugging …

8d2459c

…notebook

more progress on debugging

db19ae0

more formatting

2fec79e

more formatting

9de34ea

some intermediate commit necessary for merge

1d7044f

update to master

d85c07c

add disclaimer

e1c5828

new takes on model debugging

7a228a7

Merge branch 'master' of https://github.com/jphall663/interpretable_m…

fa26213

…achine_learning_with_python into ph_add_errors

add aquarium instructions

0db32ae

reset dia.ipynb back to master

57bae35

Merge branch 'ph_add_errors' of https://github.com/jphall663/interpre…

3422c6a

…table_machine_learning_with_python into ph_add_errors

more incremental progress on model debugging: PDP

31174c1

first draft of new PDP/ICE for model debugging

4268de0

first steps toward adversarial examples

fa88be0

first steps toward adversaries

c968047

more on adversaries

258c4d0

add random attack

83f524a

some text for model debugging with sensitivity analysis

6c0424b

more sensitivity analysis

b6464e1

more extensibility for adversary generating/summarizing functions

c305146

more analysis of results

e1b5390

proofing entire debugging notebook

fa9cef0

more steps on residual analysis

819022f

add residual plots and disparate error analysis

f9e6e16

rerun logloss shap

47f5636

add shap vs. resid stuff

b1c69ea

jphall663 added 7 commits July 24, 2019 20:41

some testing/comments

89ada2e

minor improvements to figures

0efb999

more annotation

e8bf445

fix merge

5b603bd

more annotation, up to section 8

610b68b

more annotation for section 8.1

04b64a5

good draft of sensitivity analysis

17248e1

jphall663 closed this Aug 22, 2019

jphall663 deleted the ph_add_errors branch January 2, 2020 21:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ph add errors #8

Ph add errors #8

jphall663 commented Mar 8, 2019 •

edited

jphall663 commented Aug 22, 2019

Ph add errors #8

Ph add errors #8

Conversation

jphall663 commented Mar 8, 2019 • edited

jphall663 commented Aug 22, 2019

jphall663 commented Mar 8, 2019 •

edited