Fix performance issue with sample weights in model.fit() #17357

nershman · 2022-12-19T01:29:35Z

I previously had a PR open for this but I guess it got automatically closed when I reverted my commits...

Previous PR: #16177

@gbaned
@fchollet Since the way DataAdapter works is not clear to me I went back to training_utils.handle_partial_sample_weights.

The function is being passed a tensor when it should be passed a list. I think we can simply add a typecheck and if a tensor is passed then we wrap it in a list. This will fix both the slowdown as well as make sure the functions is checking that sample_weights correspond to inputs and outputs instead of checking every single sample in the tensor.

i.e.

if not isinstance(sample_weights, (list, tuple)):
    sample_weights = (sample_weights,)

And this will work fine, when the [sample_weights] workaround is used in model.fit() this is exactly what it does, it causes a tuple of one tensor to be passed to the function instead of just a tensor.
How is that?

Ensure that handle_partial_sample_weights recieves a list-like instead of a tensor.

haifeng-jin · 2022-12-22T07:12:52Z

Pending on another reply from @fchollet .
Hold till 01/09/2023 to add the pending label again if no response.

fchollet

LGTM, thanks

fchollet · 2023-01-09T16:13:01Z

Unfortunately I'm not able to merge as I've seeing a lot of test failures:

keras/engine:data_adapter_test
keras/engine:data_adapter_test
keras/engine:training_test
keras/engine:training_test
keras/metrics:metrics_correctness_test
keras/metrics:metrics_correctness_test
keras/tests:temporal_sample_weights_correctness_test
keras/tests:temporal_sample_weights_correctness_test

Can you take a look?

haifeng-jin · 2023-02-03T17:35:54Z

@nershman Would you please take a look at the test failures?
Thanks!

gbaned · 2023-02-15T16:54:59Z

Hi @nershman Can you please check @haifeng-jin's comments and keep us posted ? Thank you!

gbaned · 2023-03-15T12:10:57Z

Hi @nershman Any update on this PR? Please. Thank you!

nershman · 2023-03-24T17:37:25Z

Hi @nershman Any update on this PR? Please. Thank you!

Hi, I have been so busy with work recently, sorry. I have some notes on this and I'll look deeper into it this weekend.

modify partial weights check instead of changing shape of sample_weight, it causes issues downstream.

nershman · 2023-03-26T23:23:29Z

Wrapping the weights was causing issues further down in the function. I just added a case to the partial sample check in the beginning instead.

    if not isinstance(sample_weights, (list, tuple)): 
        any_sample_weight = (sample_weights,) is not None and sample_weights is not None  #wrap the weights during check instead of overwriting
        partial_sample_weight = any_sample_weight and sample_weights is None
    else: #normal check
        any_sample_weight = sample_weights is not None and any(
            w is not None for w in sample_weights
        )
        partial_sample_weight = any_sample_weight and any(
            w is None for w in sample_weights
        )

Tests pass on my machine now. (data_adapter_test, training_test, metrics_correctness_test, temporal_sample_weights_correctness_test)

simplified logic a but. the first check will always return true because even (None,) != None.

See keras-team/keras#17357

haifeng-jin

I am approving this PR to see if internal tests passes.

chuckatkins · 2023-04-06T07:07:07Z

Is it reasonable to try to have NumPy process the weights directly if possible, and in doing so give any non-finite weight the same treatment as None? Something like:

if sample_weights is None:
    any_sample_weight = False
    partial_sample_weight = False
else:
    try:
        sample_weights_isfinite = np.isfinite(sample_weights)
        any_sample_weight = np.any(sample_weights_isfinite)
        if any_sample_weight:
            partial_sample_weight = not np.all(sample_weights_isfinite)
            if partial_sample_weight:
                new_sample_weights = np.nan_to_num(sample_weights, nan=1.0, posinf=1.0, neginf=1.0)
                return new_sample_weights, any_sample_weight, partial_sample_weight
    except TypeError:
        if not isinstance(sample_weights, (list, tuple)):
            any_sample_weight = True
            partial_sample_weight = False
        else:
            any_sample_weight = any(w is not None for w in sample_weights)
            partial_sample_weight = any_sample_weight and any(w is None for w in sample_weights)

if not any_sample_weight:
    return None, any_sample_weight, partial_sample_weight

if not partial_sample_weight:
    return sample_weights, any_sample_weight, partial_sample_weight

@gbaned

Imported from GitHub PR #17357 I previously had a PR open for this but I guess it got automatically closed when I reverted my commits... Previous PR: #16177 @gbaned @fchollet Since the way DataAdapter works is not clear to me I went back to `training_utils.handle_partial_sample_weights`. The function is being passed a tensor when it should be passed a list. I think we can simply add a typecheck and if a tensor is passed then we wrap it in a list. This will fix both the slowdown as well as make sure the functions is checking that sample_weights correspond to inputs and outputs instead of checking every single sample in the tensor. i.e. ``` if not isinstance(sample_weights, (list, tuple)): sample_weights = (sample_weights,) ``` And this will work fine, when the `[sample_weights]` workaround is used in `model.fit()` this is exactly what it does, it causes a tuple of one tensor to be passed to the function instead of just a tensor. How is that? Copybara import of the project: -- 083b213 by Sherman <sma232@gmail.com>: Update training_utils.py Ensure that handle_partial_sample_weights recieves a list-like instead of a tensor. -- 82130f7 by Sherman <sma232@gmail.com>: Update training_utils.py modify partial weights check instead of changing shape of sample_weight, it causes issues downstream. -- a2d5ea9 by Sherman <sma232@gmail.com>: Update training_utils.py simplified logic a but. the first check will always return true because even (None,) != None. Merging this change closes #17357 FUTURE_COPYBARA_INTEGRATE_REVIEW=#17357 from nershman:master a2d5ea9 PiperOrigin-RevId: 522355426

nershman · 2023-04-07T16:28:23Z

@chuckatkins I think you should make a separate bug report for this, I'm not familiar with what you're trying to fix. But my concern with using numpy would be creating issues with eager execution.

Update training_utils.py

083b213

Ensure that handle_partial_sample_weights recieves a list-like instead of a tensor.

google-ml-butler bot added the size:XS label Dec 19, 2022

google-ml-butler bot assigned gbaned Dec 19, 2022

nershman mentioned this pull request Dec 19, 2022

fix model.fit() slow with sample_weight #16177

Closed

nershman changed the title ~~Update training_utils.py~~ Fix perforamnce issue with sample weights in model.fit() Dec 19, 2022

gbaned added this to Assigned Reviewer in PR Queue via automation Dec 19, 2022

gbaned requested a review from fchollet December 19, 2022 09:21

google-ml-butler bot added the keras-team-review-pending Pending review by a Keras team member. label Dec 19, 2022

nershman changed the title ~~Fix perforamnce issue with sample weights in model.fit()~~ Fix performance issue with sample weights in model.fit() Dec 20, 2022

haifeng-jin removed the keras-team-review-pending Pending review by a Keras team member. label Dec 22, 2022

fchollet approved these changes Dec 27, 2022

View reviewed changes

PR Queue automation moved this from Assigned Reviewer to Approved by Reviewer Dec 27, 2022

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels Dec 27, 2022

kokoro-team removed the kokoro:force-run label Dec 27, 2022

gbaned added ready to pull Ready to be merged into the codebase and removed ready to pull Ready to be merged into the codebase labels Jan 4, 2023

haifeng-jin removed the ready to pull Ready to be merged into the codebase label Feb 3, 2023

gbaned added the stat:awaiting response from contributor label Feb 14, 2023

google-ml-butler bot removed the stat:awaiting response from contributor label Mar 24, 2023

Update training_utils.py

82130f7

modify partial weights check instead of changing shape of sample_weight, it causes issues downstream.

Update training_utils.py

a2d5ea9

simplified logic a but. the first check will always return true because even (None,) != None.

dnerini added a commit to MeteoSwiss/mlpp-lib that referenced this pull request Mar 31, 2023

Fix performance issue with sample weights

1b8ea15

See keras-team/keras#17357

gbaned requested a review from fchollet April 3, 2023 14:33

google-ml-butler bot added the keras-team-review-pending Pending review by a Keras team member. label Apr 3, 2023

haifeng-jin added the kokoro:force-run label Apr 5, 2023

kokoro-team removed the kokoro:force-run label Apr 5, 2023

haifeng-jin approved these changes Apr 5, 2023

View reviewed changes

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels Apr 5, 2023

kokoro-team removed the kokoro:force-run label Apr 5, 2023

copybara-service bot mentioned this pull request Apr 6, 2023

PR #17357: Fix performance issue with sample weights in model.fit() #17930

Closed

copybara-service bot merged commit db138de into keras-team:master Apr 6, 2023

PR Queue automation moved this from Approved by Reviewer to Merged Apr 6, 2023

tilakrayal mentioned this pull request Jul 31, 2023

Keras model.fit takes long until training begins when sample_weight is provided tensorflow/tensorflow#48965

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix performance issue with sample weights in model.fit() #17357

Fix performance issue with sample weights in model.fit() #17357

nershman commented Dec 19, 2022 •

edited

Loading

haifeng-jin commented Dec 22, 2022

fchollet left a comment

fchollet commented Jan 9, 2023

haifeng-jin commented Feb 3, 2023

gbaned commented Feb 15, 2023

gbaned commented Mar 15, 2023

nershman commented Mar 24, 2023

nershman commented Mar 26, 2023 •

edited

Loading

haifeng-jin left a comment

chuckatkins commented Apr 6, 2023

nershman commented Apr 7, 2023

Fix performance issue with sample weights in model.fit() #17357

Fix performance issue with sample weights in model.fit() #17357

Conversation

nershman commented Dec 19, 2022 • edited Loading

haifeng-jin commented Dec 22, 2022

fchollet left a comment

Choose a reason for hiding this comment

fchollet commented Jan 9, 2023

haifeng-jin commented Feb 3, 2023

gbaned commented Feb 15, 2023

gbaned commented Mar 15, 2023

nershman commented Mar 24, 2023

nershman commented Mar 26, 2023 • edited Loading

haifeng-jin left a comment

Choose a reason for hiding this comment

chuckatkins commented Apr 6, 2023

nershman commented Apr 7, 2023

nershman commented Dec 19, 2022 •

edited

Loading

nershman commented Mar 26, 2023 •

edited

Loading