-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
how to cleanup before re-running a test #984
Comments
Follow-up, I deleted the cache folders related to that test in
|
Oh, actually there is a |
Hum it doesn't seem to work with this command.
…On 2023-10-24 16:56, Arjun Suresh wrote:
Oh, actually there is a |--rerun| flag to force rerun even when results
exist. Can you please try it?
—
Reply to this email directly, view it on GitHub
<#984 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAKSMTPKKPIU2KB2OJGB64DYBATQNAVCNFSM6AAAAAA6ODZFWCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTONZYGAZDINZWGE>.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
This should be the command to get a valid result. If the estimated |
Ah yes, I think this is what I was missing, it is now running and it looks busy as expected. |
Hi,
I have followed the doc to the point where I can run the first example of the Bert test with CUDA:
https://github.com/mlcommons/ck/blob/master/docs/mlperf/inference/bert/README_nvidia.md
It seems to work, I see the GPU is busy for a while (probably around 30+ minutes)
But when it completes, if I re-run the exact same command, it completes in about 40 seconds and all the results are marked as INVALID.
So my question is if we need to run some kind of cleanup command between runs or if I am encountering a bug somewhere.
I have attached a small portion of the output, I see an error in there, but I don't know if it's fatal
bert-log.txt
Thanks !
Julien
The text was updated successfully, but these errors were encountered: