Eval metrix calculated with model_performance() differ from those with caret::confusionMatrix() #250

FrieseWoudloper · 2020-06-22T09:28:13Z

I'd like to compare a decision tree with a random forest, So I first trained a decision tree and calculated some evaluation measures on a test set using caret::confusionMatrix(). I did the same using the DALEX package. Although the trees and predictions are the same, the metrics (precision, recall, F1) calculated by model_performance() differ from those calculated with caret::confusionMatrix(). Why is this? Am I doing something wrong?
See: https://rpubs.com/friesewoudloper/630778

The text was updated successfully, but these errors were encountered:

hbaniecki · 2020-06-22T10:42:09Z

Hi,
I believe that caret uses 0 as the positive class and DALEX uses 1 as the positive class.
https://stackoverflow.com/questions/38263137/set-positive-class-to-1-in-r

pbiecek · 2020-06-22T11:51:03Z

@maksymiuks can we take care about this in the default predict function?

maksymiuks · 2020-06-22T13:19:34Z

It is, but the solution would require modification of the input model (adding an attribute in explainer function)

hbaniecki · 2020-06-22T13:35:01Z

Why not in model_performance?

pbiecek · 2020-06-22T13:37:28Z

imho either explainer or predict function should know which label is positive

maksymiuks · 2020-08-17T16:15:14Z

Let's focus on it after DALEX 2.0.0 release.

* Positive class support * Update README.md * Remove #250 content * typo fix * News added * Added tests for new warnings * Typo fix * Update NEWS.md

* yhat changed * typo fix * Ver upgrade * Extend test, fix documantation * add external tests * Update NEWS.md * Parameter name change * New param name * add predict column for multiclass * Update misc_yhat.R * Change name of the parameter

maksymiuks · 2020-11-15T23:40:37Z

Solved in #353

hbaniecki added the R 🐳 Related to R label Jun 22, 2020

hbaniecki added feature 💡 New feature or enhancement request long term 📆 TODO long term labels Jul 22, 2020

maksymiuks self-assigned this Aug 3, 2020

maksymiuks added short term ⏰ TODO short term and removed long term 📆 TODO long term labels Aug 27, 2020

maksymiuks added this to the DALEX v2.0.0 milestone Aug 27, 2020

pbiecek removed this from the DALEX v2.0.0 milestone Aug 29, 2020

pbiecek removed the short term ⏰ TODO short term label Aug 29, 2020

maksymiuks added a commit that referenced this issue Aug 29, 2020

Remove #250 content

7964d79

pbiecek pushed a commit that referenced this issue Aug 29, 2020

Candidate fix for Issue #231 and #309 (#308)

0ebd68b

* Positive class support * Update README.md * Remove #250 content * typo fix * News added * Added tests for new warnings * Typo fix * Update NEWS.md

maksymiuks mentioned this issue Nov 3, 2020

Issue #250 v2 #353

Merged

maksymiuks closed this as completed Nov 15, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eval metrix calculated with model_performance() differ from those with caret::confusionMatrix() #250

Eval metrix calculated with model_performance() differ from those with caret::confusionMatrix() #250

FrieseWoudloper commented Jun 22, 2020

hbaniecki commented Jun 22, 2020 •

edited

pbiecek commented Jun 22, 2020

maksymiuks commented Jun 22, 2020

hbaniecki commented Jun 22, 2020

pbiecek commented Jun 22, 2020

maksymiuks commented Aug 17, 2020

maksymiuks commented Nov 15, 2020

Eval metrix calculated with model_performance() differ from those with caret::confusionMatrix() #250

Eval metrix calculated with model_performance() differ from those with caret::confusionMatrix() #250

Comments

FrieseWoudloper commented Jun 22, 2020

hbaniecki commented Jun 22, 2020 • edited

pbiecek commented Jun 22, 2020

maksymiuks commented Jun 22, 2020

hbaniecki commented Jun 22, 2020

pbiecek commented Jun 22, 2020

maksymiuks commented Aug 17, 2020

maksymiuks commented Nov 15, 2020

hbaniecki commented Jun 22, 2020 •

edited