-
Notifications
You must be signed in to change notification settings - Fork 3.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
20.1 release blockers list #45599
Comments
Added kv: idempotency failure across implicit commit breaks refreshing a parallel commit request #46341 |
Added sql: rollback to savepoint broken in mixed-version (19.2/20.1) clusters #46372 |
Added importccl,gcjob: failed and canceled imports don't seem to clean up their data #46684 |
Added kvserver: v20.1.0-beta.3: received ... results, limit was ... #46652 |
I am mitigating #46652 by turning it into a sentry-reported error instead of a crash (https://github.com/cockroachdb/cockroach/pull/46720/files). This may not be enough to remove it from the release blocker list, though. |
@andreimatei looks like you didn't actually add that issue to the list? Done now, though. |
Added the following three blockers:
|
Adding #46715 (PR to fix bug where table/index GC could be delayed indefinitely) |
Added sql telemetry fixes as a "soft" blocker. |
Added Admin UI/Observability blockers. |
@dhartunian none of the diagnostics-related ones are on there.. Most importantly #46331. Are we not planning to finish those? |
Added statement diagnostics bundle download to Admin UI release blockers as per @RaduBerinde's comment above. |
Adding #46792 which is a test-only change that fixes a flake. |
Adding #46818. |
Adding two more regressions on our end. Both have PRs already up for review.
|
Adding #46752 to the list. |
I've added "sql: recent regression in stopper quiescence time #47011 [knz]". Depending on what ends up being the problem, it might turn out to not be a release blocker, but also perhaps it will be. |
|
Added sql: WITH HASH hash column is not null but formula can return NULL #47055 |
Added "kv: invalid Raft truncation decision panic in cli unittests #43605", which I think can cause panics out in the wild. |
… On Mon, Apr 6, 2020 at 11:36 AM irfan sharif ***@***.***> wrote:
Added "kv: invalid Raft truncation decision panic in cli unittests #43605
<#43605>", which I think
can cause panics out in the wild.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#45599 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ANA4FVFYM7752C6NBGQTB6LRLHZIPANCNFSM4K73PXKA>
.
|
I think so, I'm typing it up today. I was just going to downgrade the panic. |
Checking off: kvserver: v20.1.0-beta.3: received ... results, limit was ... #46652 [tbg] @nvanbenschoten will give this bug another look to see if he can spot a bug but either way we don't want to block the release on it. Previous code inspection plus randomized testing was not able to reproduce it. Not satisfying but better than not releasing. |
We are all clear on the release blockers list; starting rc1 |
Added the need to backport #47165 as a release blocker (it doesn't need to go into rc1 though). |
Added: system.namespace unreadable from SQL from 19.2 nodes in mixed-version state #47167 This issue causes problems for 19.2 nodes in a mixed-version cluster. The nodes will not be able to read from |
adding #47156 - not important for RC. |
#47167 is a hard blocker -- it probably doesn't block a beta, but it does block release, so that means any sha we start qualifying right now would be considered a beta, not an RC. |
cc @miretskiy re above w.r.t. #47167 and RC.1 |
Adding #47219 - which is an old bug responsible for a new assertion failing. |
@nvanbenschoten is 47219 rc1 blocker as well? |
Adding #44453. |
I don't know exactly how that's being defined. It is a hard release blocker, so it should block the final release. But I don't think it should stop any intermediate release. Whether this means we should consider a currently qualifying release as a beta instead of an RC is up for discussion (#45599 (comment)) |
Also, adding "kv: avoid excessively wide range tombstones during Raft snapshot reception" deemed unsafe #44048 (comment) as a hard release blocker. We need to back that change out of the release, as it's too risky to do anything more with it at this point in the release cycle and it hypothetically risks replica inconsistencies. |
Adding: sql: upgrade from 19.x to 20.x eliminates all computed columns #47263 |
Added sql: hard scan limit removed incorrectly #47283. ETA for a fix is today |
Adding #47324 as a possible blocker. |
Adding the need to backport #47365 (sql: ignore soft limits on scan nodes for distsql planning) as a release blocker (not RC blocker though). |
Adding the need to backport #47350. I think we could technically release without this backport as it only affects Pebble, but we'd have to recommend against using Pebble and it would set back our rollout plans for Pebble. These fixes should only affect Pebble. |
Adding #47425 for release blocker - not important for RC. |
Adding #47187 as a hard release blocker. |
Added "kv: txn recovery false positive #47337 [andrei]". It's a bug causing transactions to be wrongfully committed, so it's quite bad I think. |
Adding #47471 as a soft release blocker. It's an old bug, but it's worse now that we perform significantly more ranged intent resolution. The fix is targetted and should be up today. |
Added #47406 which is fixed by cockroachdb/pebble#629. There is a subtle incompatibility between RocksDB and Pebble bloom filters which was causing the failures in #47406. While we've only seen problems in a test which switches back and forth between RocksDB and Pebble, I'm very anxious this could be causing other rare problems as Pebble is used to create sstables used for ingestion (e.g. during rebalancing, import, and restore) even with RocksDB is the storage engine in use. |
considering 20.1 released. |
PSA: The 20.1 release branch is now cut. The lucky winner SHA is 1225203.
As we entering an important stabilization period, do not backport anything into the release-20.1 branch unless it is part of the release blockers list. Please also add a comment if it blocks the beta releases.
Possible blockers for the 20.1 release.
General
AppDev
Bulk IO
nodelocal upload
needs to require admin user #46222 [@dt]ERROR: invalid previous backups (a new full backup may be required if a table has been created, dropped or truncated): no backup covers time [1586054134.368267700,0,1586054324.310442218,0) for range [/Table/2/1,/Table/2/2) (or backups out of order)
#47050Admin UI (Observability)
latency-parse
class name in non-latency settings ui: refactor usage oflatency-parse
class name in non-latency settings #46445 (poorly worded but this is a regression)Server / UI
version not initialized
sql: crash when connecting to a running cluster withversion not initialized
#46410 [ajwerner]KV
TxnCoordMeta
refactor broke retry/savepoint logic kv:TxnCoordMeta
refactor broke sqlalchemy retry/savepoint logic #45477SQL Schema
alter primary key roachtest failure roachtest: alterpk-tpcc failed #45812High priority bug, but not a release blocker per [yahor & jordan]sql: WITH HASH hash column is not null but formula can return NULL sql: WITH HASH hash column is not null but formula can return NULL #47055Removing from blockers list per @ajwerner (feature gated by flag)SQL Execution
SQL Planning
Storage
Performance Regressions
The text was updated successfully, but these errors were encountered: