Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix 'Invalid Value Encountered' error during AutomLSearch #1346

Merged
merged 3 commits into from
Oct 26, 2020

Conversation

bchen1116
Copy link
Contributor

@bchen1116 bchen1116 commented Oct 26, 2020

Fix #1343

This is a highVarianceDataCheck error, rather than a LightGBM Error

Old docs here and new docs here

Note: The LightGBM warning messages are being addressed in this pr

@bchen1116 bchen1116 self-assigned this Oct 26, 2020
@codecov
Copy link

codecov bot commented Oct 26, 2020

Codecov Report

Merging #1346 into main will increase coverage by 0.01%.
The diff coverage is 100.00%.

Impacted file tree graph

@@            Coverage Diff             @@
##             main    #1346      +/-   ##
==========================================
+ Coverage   99.95%   99.95%   +0.01%     
==========================================
  Files         213      213              
  Lines       13606    13609       +3     
==========================================
+ Hits        13599    13602       +3     
  Misses          7        7              
Impacted Files Coverage Δ
evalml/data_checks/high_variance_cv_data_check.py 100.00% <100.00%> (ø)
...a_checks_tests/test_high_variance_cv_data_check.py 100.00% <100.00%> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update fe73ad5...9361e75. Read the comment docs.

@bchen1116 bchen1116 marked this pull request as ready for review October 26, 2020 16:33
Copy link
Contributor

@jeremyliweishih jeremyliweishih left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@@ -37,7 +37,10 @@ def validate(self, pipeline_name, cv_scores):
cv_scores = pd.Series(cv_scores)

messages = []
high_variance_cv = abs(cv_scores.std() / cv_scores.mean()) > self.threshold
if cv_scores.mean() == 0 and cv_scores.std() == 0:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would this work if we only check if the mean is equal to 0?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

mm yeah I think so. Since these are scores, if the mean is 0, the std is also 0. Thanks for pointing that out!

@bchen1116 bchen1116 merged commit ba5863e into main Oct 26, 2020
@dsherry dsherry mentioned this pull request Oct 29, 2020
@freddyaboulton freddyaboulton deleted the bc_1343_value branch May 13, 2022 15:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

LightGBM "invalid value encountered" warning during fit
2 participants