-
Notifications
You must be signed in to change notification settings - Fork 18.1k
x/telemetry/internal/upload: TestRun_Concurrent failures #67737
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Found new dashboard test flakes for:
2024-05-22 23:36 x_telemetry-go1.21-darwin-amd64_13 telemetry@39ace7a4 release-branch.go1.21@9488a444 x/telemetry/internal/upload.TestRun_Concurrent (log)
2024-05-22 23:36 x_telemetry-go1.22-darwin-amd64_11 telemetry@39ace7a4 release-branch.go1.22@74565750 x/telemetry/internal/upload.TestRun_Concurrent (log)
2024-05-22 23:36 x_telemetry-go1.22-darwin-amd64_14 telemetry@39ace7a4 release-branch.go1.22@74565750 x/telemetry/internal/upload.TestRun_Concurrent (log)
2024-05-22 23:36 x_telemetry-go1.22-windows-amd64 telemetry@39ace7a4 release-branch.go1.22@74565750 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Found new dashboard test flakes for:
2024-06-03 22:45 x_telemetry-gotip-linux-arm telemetry@f2b69109 go@535f81e7 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Found new dashboard test flakes for:
2024-06-21 19:41 x_telemetry-gotip-darwin-amd64_11 telemetry@a740542b go@e8ee1dc4 x/telemetry/internal/upload.TestRun_Concurrent (log)
2024-06-21 19:41 x_telemetry-gotip-darwin-arm64_11 telemetry@a740542b go@e8ee1dc4 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Found new dashboard test flakes for:
2024-06-24 21:43 x_telemetry-go1.21-darwin-amd64-nocgo telemetry@210d439e release-branch.go1.21@dfb00670 x/telemetry/internal/upload.TestRun_Concurrent (log)
2024-06-24 21:43 x_telemetry-go1.21-darwin-amd64_14 telemetry@210d439e release-branch.go1.21@dfb00670 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Looks like a real race. I will take a look. |
Not a race: looks like a timezone / day boundary bug 🤦. |
Change https://go.dev/cl/596316 mentions this issue: |
Found new dashboard test flakes for:
2024-07-01 17:54 x_telemetry-go1.21-windows-arm64 telemetry@4e29c787 release-branch.go1.21@12e9b968 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Found new dashboard test flakes for:
2024-07-03 14:31 x_telemetry-gotip-windows-arm64 telemetry@5dc3c48f go@7d19d508 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Found new dashboard test flakes for:
2024-07-11 21:24 x_telemetry-go1.23-windows-arm64 telemetry@9d76c3d5 release-branch.go1.23@7dff7439 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Ah, I real race. Looking. |
Change https://go.dev/cl/598036 mentions this issue: |
With x/tools/cmd/stress, I was able to reproduce this failure easily. The CL above fixes the original locking logic to serve its intended purpose. But many more races exist around deleting counter files:
The fix above may reduce the flake rate to undetectable levels, but uploading is not concurrency safe. We need lock around the entire process of collecting, parsing, and aggregating reports, not just the final upload step, or give up on making uploading concurrency safe and rely on the Go command's existing guards. |
The upload locking logic left a brief race that could lead to duplicate uploads, as the existence of an uploaded report was checked before locking, not after. Fix this. As described in golang/go#67737, more races exist that could lead to broken or partial uploads, but this should prevent overcounting uploads, as was originally intended. This closes golang/go#65970, as it is the last thing I feel comfortable doing relative to the upload process. More refactoring will have to wait until 1.24. For golang/go#67737 Fixes golang/go#65970 Change-Id: Iadd58402bbe3fb32b4daf479d8d800eaef47c370 Reviewed-on: https://go-review.googlesource.com/c/telemetry/+/598036 LUCI-TryBot-Result: Go LUCI <golang-scoped@luci-project-accounts.iam.gserviceaccount.com> Reviewed-by: Hyang-Ah Hana Kim <hyangah@gmail.com> Auto-Submit: Robert Findley <rfindley@google.com>
Found new dashboard test flakes for:
2024-07-12 20:54 x_telemetry-go1.21-linux-ppc64_power10 telemetry@1f628dd3 release-branch.go1.21@e073febe x/telemetry/internal/upload.TestRun_Concurrent (log)
2024-07-12 20:54 x_telemetry-go1.21-linux-ppc64_power8 telemetry@1f628dd3 release-branch.go1.21@e073febe x/telemetry/internal/upload.TestRun_Concurrent (log)
2024-07-12 20:54 x_telemetry-go1.22-linux-arm telemetry@1f628dd3 release-branch.go1.22@4e548f2c x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Found new dashboard test flakes for:
2024-07-19 15:28 x_telemetry-go1.22-linux-riscv64 telemetry@4b97e47a release-branch.go1.22@4c50f916 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Found new dashboard test flakes for:
2024-07-19 15:28 x_telemetry-go1.22-linux-loong64 telemetry@4b97e47a release-branch.go1.22@4c50f916 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Found new dashboard test flakes for:
2024-08-15 15:06 x_telemetry-go1.23-netbsd-arm64 telemetry@0693e624 release-branch.go1.23@76346b35 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Found new dashboard test flakes for:
2024-10-02 14:22 x_telemetry-gotip-linux-arm telemetry@338fe243 go@ce60f703 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Found new dashboard test flakes for:
2024-11-07 21:31 x_telemetry-gotip-netbsd-arm telemetry@d4f9d0e4 go@2b334342 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Found new dashboard test flakes for:
2025-05-07 14:13 x_telemetry-go1.23-linux-ppc64le_power9 telemetry@d42286b9 release-branch.go1.23@f77084d1 x/telemetry/internal/upload.TestRun_Concurrent (log)
|
Issue created automatically to collect these failures.
Example (log):
— watchflakes
The text was updated successfully, but these errors were encountered: