New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Local development: make triage and cleanup cron more reliable (not in production) #197
Comments
Do you see any files in if you see files, next thing to try is go to |
No. When you go to locahost:9000/upload-testcase do you see the testcase you uploaded? What happens when you click on it? That same testcase isn't showing up when you go to localhost:9000/testcsases? |
I saw lots of files in Because my bots run on localhost, so it should be not related to port forwarding. I will try triage cron. However, they didn't appear for many hours. I guess it won't help.
Right, I can see my uploaded testcase in /upload-testcase and the bots have already verified they crash. Some of them are labeled "Confirmed" and some of them are labeled "Duplicated". |
oops, they appear after triage cron manually! |
@kcwu - how long was your run_server running. did you see any stack that can tell how triage cron died ? can you start up a fresh run_server instance with --bootstrap, and retry just to see if triage cron can go down again ? |
My run_sever has run for more than 20 hours (before #177). The console log is longer than my terminal scrollback history (about 2.5 hours), so I don't have stack. Is the server log written to disk somewhere? From existing console log, there is only one "GET /triage" request in past 2.5 hours, which is my manual one. In other words, the cron stopped for some reason. I guess #177 make the cron down. |
More info, there are no "cron-service" messages in server's console log in past 2.5 hours. So it's not only triage cron down, it's cron fully stopped. After server restarted, I can see "cron-service" messages in sever's console log again. |
can you leave the server running for another 20 hours with fuzzing ongoing. lets see if triage goes down for you. you can check triage cron running in run_server log on console. |
Closing for now since #177 is unsupported usecase for local. If you can still reproduce it, we will reopen. |
This reproducible.
BTW, similar error happened for /cleanup as well. That is,
|
What caused the 503 ? Any explanation can help to fix. |
@kcwu @mhlakhani - this should be fixed now. please comment back if you see any similar issues. locally, crons should continue to work now, even in case of error. |
thanks @inferno-chromium! Will let you know here if I see it again. |
I am running local server and local bot.
I add a fuzzing job, upload a custom build. The bot runs and it generates few crashes and these cases appear in "Testcases" page. But it hadn't found certain one crashing testcase I found by other means.
So I uploaded the said crashing testcase (under "Upload Testcase" page)
It is verified by the bot that the testcase indeed crash.
This crashing testcase didn't appear in "Testcases" page. Is this expected behavior (uploaded case won't add into testcase collections) ?
I guess yes, so I continue next steps.
I uploaded a new custom build (under "Jobs" page, change file of existing jobs, then save). Just minor change which doesn't matter.
Add more bots and hope they will find the said crashing case by themselves faster.
The bots picked up the new build and did find the crashing testcase (*).
But I still don't see the crashing test case on "Testcases" page.
(*)
I know bots found the crash because of logs in the fuzzing log inside local/storage/local_gcs/test-fuzz-logs-bucket/objects/
The new testcase is stack-overflow, which is different "crash type" to existing groups (which are timeout, assert, integer-overflow, etc.). So it's not hidden due to grouping.
The text was updated successfully, but these errors were encountered: