-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Set number of retries per error code #2758
Conversation
CI error is unrelated, see #2759 |
For I opened an issue: #2765 |
Also: the metrics include For the 4 datasets, we reached |
See https://huggingface.slack.com/archives/C04L6P8KNQ5/p1714480880576699?thread_ts=1714033939.429809&cid=C04L6P8KNQ5 (internal)
Errors like
CreateCommitError
should always be retried, because they correspond to the Hub being down, which should always be a temporary situation. I set the limit to 30 in that case (instead of 3). I set 30 to other error codes as well:HfHubError
,LockedDatasetTimeoutError
,PreviousStepStillProcessingError