New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GCB BigQuery notifier is crashing #6580
Comments
s/not crashing/crashing/? |
yes, fixed typo. |
I restarted everything and it seems to be working now. Not sure why. |
This is still happening (or, happening again): I don't know why or what's going on w/ this notifier. But I suspect the ultimate answer is that we should stop using that one and write our own notifier in C++ following the pattern we used for our logs and alerts notifiers. To be clear, I think what we probably want is:
|
I found the following golang stack trace in the logs:
|
FWIW, the script is deploying the "latest" version of the docker image: maybe try a previous version?
|
fyi, the GCB folks are looking into the crash of this BQ notifier. The crash is from this line: startTime, err := parsePBTime(step.Timing.StartTime) And the current guess is that not all build steps have timing info, we adding some |
Good news: @prabenzom submitted a few fixes to the BigQuery notifier and fixed the panic crashes. Yay! Thank you very much!!! Bad news: The BQ notifier is still dropping a lot of updates. The reason seems to be that non-SUCCESS build alerts are often missing
So it looks like our BQ data is missing info for all or most failed builds. @prabenzom suggested inserting some default timestamp if none is present, which seems fine to me (if it results in the BQ inserts succeeding). Not sure if/when they'll have time to do this. |
The new issue filed for the BQ notifier dropping failed statuses is GoogleCloudPlatform/cloud-build-notifiers#115 |
@prabenzom fixed the issue in GoogleCloudPlatform/cloud-build-notifiers@f246eff#diff-243269b0805de328ce233361b6a6e94d971e1d18e7727433c832fd42e4c19f28 Yay!! Our logs now look clean, and 2500 unack'd pubsub messages are now being processed: So I think this issue is fixed, and I'm hopeful that our BQ data and dashboards will now be accurate. |
The BigQuery notifiery job (https://github.com/googleapis/google-cloud-cpp/blob/main/ci/cloudbuild/notifiers/bigquery/README.md) is crashing on start up and not working. This is a community notifier that we run to write GCB notifications to a bigquery table. It was working originally, now it's failing. I'm not sure why, and we didn't write it, it's written in golang, so I can't debug it quickly.
I'm disabling this notifier for the time being until we can get it working again. It may be easier to just rewrite this thing using the C++ functions framework.
The text was updated successfully, but these errors were encountered: