separate model card git push from the rest #13514

elishowk · 2021-09-10T13:54:55Z

What does this PR do?

After model card metadata contents validation was deployed to the Hub, we need to ensure transformer's trainer git push are not blocked because of an invalid README.mld yaml.
as discussed with @julien-c @Pierrci @sgugger and @LysandreJik the first step to match Hub's model card validation system is to avoid failing a whole git push after training, for the only reason that README.md metadata is not valid.
therefore, I tried in this PR to git push the training result independently from the modelcard update, so that the modelcard update failing does not fail the rest, keeping only logging for README.Md push failures.
Relates to display git push warnings huggingface_hub#326

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

sgugger

Thanks for fixing this!

src/transformers/trainer.py

sgugger · 2021-09-10T15:49:50Z

src/transformers/trainer.py

+            self.create_model_card(model_name=model_name, **kwargs)
+            try:
+                self.repo.push_to_hub(commit_message="update model card README.md")
+            except Exception as exc:


Not sure whether it's best to catch the error and log it, or just let it be raised naturally. I don't have a strong opinion on this so let's see what others think!

Yes me neither IDK, it's an ergonomy issue because at this stage, the training has been pushed, so the question is what should we do with a faulty Readme ?

And without forgetting fixing model card metadata generation like with this issue #13528

@LysandreJik, any thoughts?

If this happens mid-training, I would advocate for a very visible warning that the model card is incorrect, rather than erroring out. However, if it's possible to generate the model card before the training/evaluation starts (I understand some values will be invalid such as evaluation results) and identify a potential failure there, then we could error out.

Definitely fine to just log the error though.

PS: Shouldn't the exception caught be a bit more specific than Exception?

About the warnings on git push : git push warnings are now logged as a logger.warning cf. https://github.com/huggingface/huggingface_hub/pull/326/files
As explained the docs https://huggingface.co/transformers/main_classes/trainer.html?highlight=trainer#logging, the only possibility (AFAIK) where a user could not see the warning is if her/his script using transformers sets the logLevel to error only. Elsewhere, the user gets the same warning as huggingface-cli users. Am i wrong ?

I think logging at the error level is fine. Can you remove the PR from draft and mrge it?

Done, please note that I mixed (may be I'm wring) a commit to update to the docs, because huggingface/model_card is gonna be deprecated soon I think

If not its place here, I can remove it

I resolved the conflict by adding blocking=blocking. Is it okay for you ?

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

julien-c · 2021-09-11T07:40:13Z

For tracking purposes, do we have another issue for datasets: - null @elishowk? (which will still fail with this, AFAICT)

elishowk · 2021-09-11T11:16:26Z

For tracking purposes, do we have another issue for datasets: - null @elishowk? (which will still fail with this, AFAICT)

Yep : #13528

elishowk · 2021-09-14T16:08:32Z

🥳 first PR on transformers, thanks for your help you all !

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

separate model card git push from the rest

adb38ca

elishowk added model card Related to pretrained model cards work in progress trainer labels Sep 10, 2021

elishowk requested review from LysandreJik and sgugger September 10, 2021 13:54

elishowk self-assigned this Sep 10, 2021

sgugger approved these changes Sep 10, 2021

View reviewed changes

Update src/transformers/trainer.py

73ce112

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

elishowk mentioned this pull request Sep 11, 2021

Trainer's create_model_card creates an invalid yaml metadata datasets: - null #13528

Closed

ramirezalexis approved these changes Sep 11, 2021

View reviewed changes

elias@showk.me added 2 commits September 14, 2021 16:47

catch only EnvironmentError in push_to_hub

9c20f81

point documentation on model card schema to the hub's

5255d1c

elishowk marked this pull request as ready for review September 14, 2021 15:40

Merge branch 'master' into fix-trainer-modelcard

3eabf1d

elishowk requested a review from sgugger September 14, 2021 16:02

LysandreJik approved these changes Sep 14, 2021

View reviewed changes

elishowk merged commit 054b601 into huggingface:master Sep 14, 2021

elishowk deleted the fix-trainer-modelcard branch September 14, 2021 16:07

Albertobegue pushed a commit to Albertobegue/transformers that referenced this pull request Jan 13, 2022

separate model card git push from the rest (huggingface#13514)

3aee58b

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Albertobegue pushed a commit to Albertobegue/transformers that referenced this pull request Jan 27, 2022

separate model card git push from the rest (huggingface#13514)

754c655

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

separate model card git push from the rest #13514

separate model card git push from the rest #13514

elishowk commented Sep 10, 2021 •

edited

sgugger left a comment

sgugger Sep 10, 2021

elishowk Sep 11, 2021

elishowk Sep 11, 2021

sgugger Sep 13, 2021

LysandreJik Sep 13, 2021

elishowk Sep 14, 2021 •

edited

sgugger Sep 14, 2021

elishowk Sep 14, 2021 •

edited

elishowk Sep 14, 2021 •

edited

julien-c commented Sep 11, 2021

elishowk commented Sep 11, 2021

elishowk commented Sep 14, 2021

separate model card git push from the rest #13514

separate model card git push from the rest #13514

Conversation

elishowk commented Sep 10, 2021 • edited

What does this PR do?

Before submitting

sgugger left a comment

Choose a reason for hiding this comment

sgugger Sep 10, 2021

Choose a reason for hiding this comment

elishowk Sep 11, 2021

Choose a reason for hiding this comment

elishowk Sep 11, 2021

Choose a reason for hiding this comment

sgugger Sep 13, 2021

Choose a reason for hiding this comment

LysandreJik Sep 13, 2021

Choose a reason for hiding this comment

elishowk Sep 14, 2021 • edited

Choose a reason for hiding this comment

sgugger Sep 14, 2021

Choose a reason for hiding this comment

elishowk Sep 14, 2021 • edited

Choose a reason for hiding this comment

elishowk Sep 14, 2021 • edited

Choose a reason for hiding this comment

julien-c commented Sep 11, 2021

elishowk commented Sep 11, 2021

elishowk commented Sep 14, 2021

elishowk commented Sep 10, 2021 •

edited

elishowk Sep 14, 2021 •

edited

elishowk Sep 14, 2021 •

edited

elishowk Sep 14, 2021 •

edited