FIX overwriting metadata when both verified and unverified reported values #1186

Wauplin · 2022-11-14T16:49:02Z

Should fix #1185.

With this PR, 2 eval results are considered to describe the same object if and only if all attributes are the same except the metric value itself (see is_equal_except_value). Otherwise value is not overwritten.

cc @lewtun for review

…alue

HuggingFaceDocBuilder · 2022-11-14T16:52:15Z

The documentation is not available anymore as the PR was closed or merged.

lewtun

Thanks a lot for the quick fix on this bug @Wauplin 🔥 !

I've tested the PR against my own model can can confirm it works as expected: https://huggingface.co/autoevaluate/binary-classification/discussions/58/files

The code itself also LGTM!

lewtun · 2022-11-14T17:11:43Z

tests/test_repocard.py

+      verified: true
+---
+
+This is a test model card.


Small suggestion to help community developers understand what this is about:

Suggested change

This is a test model card.

This is a test model card containing one self-reported metric and one "verified" metric in the format produced by Hugging Face's [model evaluation service](https://huggingface.co/spaces/autoevaluate/model-evaluator).

nateraw · 2022-11-14T18:10:19Z

Seeing failing test here:

FAILED ../tests/test_repocard.py::RepocardMetadataUpdateTest::test_update_existing_result_without_overwrite - AssertionError: Regex pattern did not match.
 Regex: "You passed a new value for the existing metric 'name: Accuracy, type: accuracy'. Set `overwrite=True` to overwrite existing metrics."
 Input: "You passed a new value for the existing metric 'name: Accuracy, type:  accuracy'. Set `overwrite=True` to overwrite existing metrics."

codecov · 2022-11-15T08:49:03Z

Codecov Report

Base: 84.00% // Head: 84.03% // Increases project coverage by +0.02% 🎉

Coverage data is based on head (706c629) compared to base (711f688).
Patch coverage: 100.00% of modified lines in pull request are covered.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1186      +/-   ##
==========================================
+ Coverage   84.00%   84.03%   +0.02%     
==========================================
  Files          44       44              
  Lines        4321     4327       +6     
==========================================
+ Hits         3630     3636       +6     
  Misses        691      691

Impacted Files	Coverage Δ
src/huggingface_hub/repocard.py	`95.69% <100.00%> (-0.03%)`	⬇️
src/huggingface_hub/repocard_data.py	`98.49% <100.00%> (+0.08%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

julien-c · 2022-11-15T09:20:51Z

pinging @coyotte508 and @allendorf to make sure this is also the behavior that makes sense considering the server-side implem of verified eval results

(i'm a bit fuzzy on this subject myself)

lewtun · 2022-11-15T09:42:39Z

pinging @coyotte508 and @allendorf to make sure this is also the behavior that makes sense considering the server-side implem of verified eval results

(i'm a bit fuzzy on this subject myself)

I think this will become clearer once I refactor AutoTrain to include the new model card API in https://github.com/huggingface/autotrain-backend/pull/823 :)

* Disable tqdm progress bar if no TTY attached When dockerized applications write to STDOUT/STDERR, the applications can block due to logging back pressure (see https://docs.docker.com/config/containers/logging/configure/#configure-the-delivery-mode-of-log-messages-from-container-to-log-driver6 HuggingFace's TGI container is one such example (see huggingface/text-generation-inference#1186). Setting tqdm's `disable=None` will disable the progress bar if no tty is attached and help to resolve TGI's issue #1186. References: huggingface/text-generation-inference#1186 (comment) huggingface/text-generation-inference#1186 (comment) * Disable tqdm progress bar if no TTY attached in lfs.py

FIX overwriting metadata when both verified and unverified reported v…

e3abce5

…alue

Wauplin requested review from lewtun and nateraw November 14, 2022 16:49

lewtun approved these changes Nov 14, 2022

View reviewed changes

fix error message

706c629

Wauplin merged commit 131fd35 into main Nov 15, 2022

Wauplin deleted the 1185-fix-metadata-update-insert-verified-true branch November 15, 2022 08:49

mssalvatore mentioned this pull request Jan 25, 2024

Disable tqdm progress bar if no TTY attached #2000

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX overwriting metadata when both verified and unverified reported values #1186

FIX overwriting metadata when both verified and unverified reported values #1186

Wauplin commented Nov 14, 2022 •

edited

Loading

HuggingFaceDocBuilder commented Nov 14, 2022 •

edited

Loading

lewtun left a comment

lewtun Nov 14, 2022

nateraw commented Nov 14, 2022

codecov bot commented Nov 15, 2022

julien-c commented Nov 15, 2022

lewtun commented Nov 15, 2022

	This is a test model card.
	This is a test model card containing one self-reported metric and one "verified" metric in the format produced by Hugging Face's [model evaluation service](https://huggingface.co/spaces/autoevaluate/model-evaluator).

FIX overwriting metadata when both verified and unverified reported values #1186

FIX overwriting metadata when both verified and unverified reported values #1186

Conversation

Wauplin commented Nov 14, 2022 • edited Loading

HuggingFaceDocBuilder commented Nov 14, 2022 • edited Loading

lewtun left a comment

Choose a reason for hiding this comment

lewtun Nov 14, 2022

Choose a reason for hiding this comment

nateraw commented Nov 14, 2022

codecov bot commented Nov 15, 2022

Codecov Report

julien-c commented Nov 15, 2022

lewtun commented Nov 15, 2022

Wauplin commented Nov 14, 2022 •

edited

Loading

HuggingFaceDocBuilder commented Nov 14, 2022 •

edited

Loading