UndefinedMetricWarnings while running classification/main_train.py on SEN12MS #52

suryagutta · 2021-03-21T00:13:49Z

Getting the following warnings...Need to investigate and see if it's going to impact the results, if yes, we need to fix it.

/home/taeil/anaconda3/envs/hptest/lib/python3.7/site-packages/sklearn/metrics/_classification.py:1493: UndefinedMetricWarning: F-score is ill-defined and being set to 0.0 in labels with no true nor predicted samples. Use zero_division parameter to control this behavior.
average, "true nor predicted", 'F-score is', len(true_sum)
/home/taeil/anaconda3/envs/hptest/lib/python3.7/site-packages/sklearn/metrics/_classification.py:1493: UndefinedMetricWarning: F-score is ill-defined and being set to 0.0 in labels with no true nor predicted samples. Use zero_division parameter to control this behavior.
average, "true nor predicted", 'F-score is', len(true_sum)
/home/taeil/anaconda3/envs/hptest/lib/python3.7/site-packages/sklearn/metrics/_classification.py:1245: UndefinedMetricWarning: Precision is ill-defined and being set to 0.0 in labels with no predicted samples. Use zero_division parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))
/home/taeil/anaconda3/envs/hptest/lib/python3.7/site-packages/sklearn/metrics/_classification.py:1245: UndefinedMetricWarning: Recall is ill-defined and being set to 0.0 in labels with no true samples. Use zero_division parameter to control this behavior.
_warn_prf(average, modifier, msg_start, len(result))
Validation microPrec: 0.540000 microF1: 0.540000 sampleF1: 0.540000 microF2: 0.540000 sampleF2: 0.540000

The text was updated successfully, but these errors were encountered:

suryagutta · 2021-03-21T21:44:13Z

Precision is ill-defined and being set to 0.0 in labels with no predicted samples. Use zero_division parameter to control this behavior. Recall is ill-defined and being set to 0.0 in labels with no true samples. Use zero_division parameter to control this behavior.

This was done intentionally to give a warning based on the discussion: scikit-learn/scikit-learn#14876
The code is in https://github.com/scikit-learn/scikit-learn/blob/main/sklearn/metrics/_classification.py
Corresponding pull request which got merged: scikit-learn/scikit-learn#14900

Summary from the sklearn/metrics/_classification.py code:
When ``true positive + false positive == 0``, precision is undefined. When ``true positive + false negative == 0``, recall is undefined. In such cases, by default the metric will be set to 0, as will f-score, and ``UndefinedMetricWarning`` will be raised. This behavior can be modified with ``zero_division``.
Code:
`

Divide, and on zero-division, set scores and/or warn according to zero_division:

precision = _prf_divide(tp_sum, pred_sum, 'precision',
                        'predicted', average, warn_for, zero_division)
recall = _prf_divide(tp_sum, true_sum, 'recall',
                     'true', average, warn_for, zero_division)`

warn for f-score only if zero_division is warn, it is in warn_for and BOTH prec and rec are ill-defined

if zero_division == "warn" and ("f-score",) == warn_for:
    if (pred_sum[true_sum == 0] == 0).any():
        _warn_prf(
            average, "true nor predicted", 'F-score is', len(true_sum)
        )

`
Basically, the default behavior is to set to zero and show a warning. If we want, we can hide the warning by using the flag. I think it's not required to change the behavior at present as it's a warning, and no need to hide it as we might miss some important information in the future if we hide it.

taeil · 2021-03-21T21:47:47Z

Precision is ill-defined and being set to 0.0 in labels with no predicted samples. Use zero_division parameter to control this behavior. Recall is ill-defined and being set to 0.0 in labels with no true samples. Use zero_division parameter to control this behavior.

This was done intentionally to give a warning based on the discussion: scikit-learn/scikit-learn#14876
The code is in https://github.com/scikit-learn/scikit-learn/blob/main/sklearn/metrics/_classification.py
Corresponding pull request which got merged: scikit-learn/scikit-learn#14900

Summary from the sklearn/metrics/_classification.py code:
''' Whentrue positive + false positive == 0, precision is undefined. When true positive + false negative == 0, recall is undefined. In such cases, by default the metric will be set to 0, as will f-score, and UndefinedMetricWarningwill be raised. This behavior can be modified withzero_division. '''

Code:
`

Divide, and on zero-division, set scores and/or warn according to
# zero_division:
precision = _prf_divide(tp_sum, pred_sum, 'precision',
                        'predicted', average, warn_for, zero_division)
recall = _prf_divide(tp_sum, true_sum, 'recall',
                     'true', average, warn_for, zero_division)

# warn for f-score only if zero_division is warn, it is in warn_for
# and BOTH prec and rec are ill-defined
if zero_division == "warn" and ("f-score",) == warn_for:
    if (pred_sum[true_sum == 0] == 0).any():
        _warn_prf(
            average, "true nor predicted", 'F-score is', len(true_sum)
        )
`
Basically, the default behavior is to set to zero and show a warning. If we want, we can hide the warning by using the flag. I think it's not required to change the behavior at present as it's a warning, and no need to hide it as we might miss some important information in the future if we hide it.

One idea is to remove the classes that do not have samples, not sure how complicated it is.

suryagutta · 2021-03-21T22:04:06Z

The code is in https://github.com/Berkeley-Data/SEN12MS/blob/master/classification/metrics.py which calls the sklearn.metrics functions for different metrics. Those functions can take zero_division parameter.
zero_division (Sets the value to return when there is a zero division): "warn", 0 or 1
default="warn" . If set to "warn", this acts as 0, but warnings are also raised.

suryagutta added this to To do in sen12ms via automation Mar 21, 2021

taeil added this to the next milestone milestone Mar 21, 2021

suryagutta moved this from To do to In progress in sen12ms Mar 21, 2021

taeil moved this from In progress to On Hold in sen12ms Apr 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UndefinedMetricWarnings while running classification/main_train.py on SEN12MS #52

UndefinedMetricWarnings while running classification/main_train.py on SEN12MS #52

suryagutta commented Mar 21, 2021

suryagutta commented Mar 21, 2021 •

edited

taeil commented Mar 21, 2021

Divide, and on zero-division, set scores and/or warn according to

suryagutta commented Mar 21, 2021

UndefinedMetricWarnings while running classification/main_train.py on SEN12MS #52

UndefinedMetricWarnings while running classification/main_train.py on SEN12MS #52

Comments

suryagutta commented Mar 21, 2021

suryagutta commented Mar 21, 2021 • edited

Divide, and on zero-division, set scores and/or warn according to zero_division:

warn for f-score only if zero_division is warn, it is in warn_for and BOTH prec and rec are ill-defined

taeil commented Mar 21, 2021

Divide, and on zero-division, set scores and/or warn according to

suryagutta commented Mar 21, 2021

suryagutta commented Mar 21, 2021 •

edited