-
Notifications
You must be signed in to change notification settings - Fork 3.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest: schemachange/random-load failed [revert stuck forever] #122659
Comments
It looks like these client-side goroutines were stuck waiting for a response:
|
This is the main thing that's suspicious from the CRDB logs:
That's logged at 18:23 on n2, which is when the operations on the cluster stopped. It happens again at 18:50 and 20:25, when I think is the schemachanger retrying the revert, but failing in the same way. @fqazi do you think we should make "type with ID 1112 does not exist" a non-retriable error? |
roachtest.schemachange/random-load failed with artifacts on master @ 5d02bd9ff6b2bccecf6d43fc6cd647167b91f782:
Parameters:
Same failure on other branches
|
Previously, when rolling back type descriptor schema changes if the descriptor was already dropped we would keep retrying the schema change. This would happen because we introduced a regression where the internal structured error was replaced with user facing pgerror based error. To address this, this patch will properly handle the UndefinedObject pgcode and avoid retrying during a rollback of a typedesc schema change. Fixes: cockroachdb#122958, cockroachdb#122659 Release note (bug fix): TYPEDESC SCHEMA CHANGE jobs could end up retrying forever if the descriptor targeted by them was already dropped.
Previously, when rolling back type descriptor schema changes if the descriptor was already dropped we would keep retrying the schema change. This would happen because we introduced a regression where the internal structured error was replaced with user facing pgerror based error. To address this, this patch will properly handle the UndefinedObject pgcode and avoid retrying during a rollback of a typedesc schema change. Fixes: #122958, #122659 Release note (bug fix): TYPEDESC SCHEMA CHANGE jobs could end up retrying forever if the descriptor targeted by them was already dropped.
roachtest.schemachange/random-load failed with artifacts on master @ 2d67111f0db7bec9bfed537542c60f37f8340f69:
Parameters:
ROACHTEST_arch=amd64
ROACHTEST_cloud=gce
ROACHTEST_coverageBuild=false
ROACHTEST_cpu=4
ROACHTEST_encrypted=false
ROACHTEST_fs=ext4
ROACHTEST_localSSD=true
ROACHTEST_metamorphicBuild=false
ROACHTEST_ssd=0
Help
See: roachtest README
See: How To Investigate (internal)
See: Grafana
This test on roachdash | Improve this report!
Jira issue: CRDB-38016
The text was updated successfully, but these errors were encountered: