[BUG] argument metric_missing=0 is ignored when points for missing cat is calculated in scorecard table #226

peterpanmj · 2022-12-18T10:19:08Z

description

Scorecard._fit will ignore the metric_missing=0 parameter, when build scorecard. I

example

data = pd.DataFrame(
    data = {'target': np.hstack(
        (np.random.choice([0, 1], 100, p=[0.1, 0.9]),
         np.random.choice([0, 1], 100, p=[0.9, 0.1])
        )),
    'var':[np.nan]*100+['A']*100
    }
)

scorecard3 = Scorecard(binning_process=binning_process, 
                       estimator=LogisticRegression(),
                       scaling_method="min_max",
                       scaling_method_params=scaling_method_params
                      ).fit(data, data.target,metric_missing=0, metric_special=0)

print(scorecard3.table(style='detailed'))

current behaviour

scorecard3.table(style='detailed') is some positive number, however the actual result should be zero, since there are only two bins . So one bin gets 100 and the other get 0.

expected results

by simply configure metric_special='empirical', will give the correct results, even though there is no special cases in the data or in the binning_process

scorecard1 = Scorecard(binning_process=binning_process,
                       estimator=LogisticRegression(),
                       scaling_method="min_max",
                       scaling_method_params=scaling_method_params
                      ).fit(data, data.target,metric_missing=0, metric_special='empirical')

print(scorecard1.table(style='detailed'))

I have a fix for that. It is actually quite obvious. The source code just ignored the argument metric_missing when metric_special !='empirical' However I found no docs about where and how to put new tests in this project. Can anyone give me some info ?

The text was updated successfully, but these errors were encountered:

…encia#226

guillermo-navas-palencia · 2022-12-18T11:07:12Z

Thank you @peterpanmj. I commented on the pull request.

…encia#226

fix metric_missing=0 is ignored in Scorecard._fit #226

peterpanmj added a commit to peterpanmj/optbinning that referenced this issue Dec 18, 2022

fix metric_missing=0 is ignored in Scorecard._fit guillermo-navas-pal…

ab7b940

…encia#226

guillermo-navas-palencia added the bug Something isn't working label Dec 18, 2022

guillermo-navas-palencia added this to the v0.17.3 milestone Dec 18, 2022

peterpanmj added a commit to peterpanmj/optbinning that referenced this issue Jan 15, 2023

fix metric_missing=0 is ignored in Scorecard._fit guillermo-navas-pal…

289f524

…encia#226

guillermo-navas-palencia added a commit that referenced this issue Jan 15, 2023

Merge pull request #227 from peterpanmj/miss_cat

fde1fe8

fix metric_missing=0 is ignored in Scorecard._fit #226

guillermo-navas-palencia closed this as completed Jan 15, 2023

guillermo-navas-palencia mentioned this issue Feb 12, 2023

Develop #231

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] argument metric_missing=0 is ignored when points for missing cat is calculated in scorecard table #226

[BUG] argument metric_missing=0 is ignored when points for missing cat is calculated in scorecard table #226

peterpanmj commented Dec 18, 2022

guillermo-navas-palencia commented Dec 18, 2022

[BUG] argument metric_missing=0 is ignored when points for missing cat is calculated in scorecard table #226

[BUG] argument metric_missing=0 is ignored when points for missing cat is calculated in scorecard table #226

Comments

peterpanmj commented Dec 18, 2022

description

example

current behaviour

expected results

guillermo-navas-palencia commented Dec 18, 2022