sqlbase: change default jobs ttl back to default (25h) #45767

dt · 2020-03-05T19:57:06Z

We changed the default TTL on the jobs table to be 10min in reaction to some
issues where bisbehaved jobs destabilized a cluster by causing the jobs range
to become oversized due to mvcc garbage.

However this low TTL breaks incremental backups. Previously this was not a
significant issue for most users as BACKUP was typically only run on user
tables, but increasingly users are finding they want to backup metadata in
their system tables too, plus 20.1 will showcase full-cluster backups that
include system tables automatically.

This reverts the default TTL override for the jobs table, effectively
meaning 20.1-created clusters will inherit the 25h TTL. This also means
that if a user changes the default TTL to account for a different BACKUP
cadence, they don't need to also update a second TTL for the jobs table.

Release note (general change): on new clusters the internal system.jobs table now uses the default zoneconfig and TTL (25h)

cockroach-teamcity · 2020-03-05T19:57:14Z

This change is

andreimatei

LGTM

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @ajwerner, @andreimatei, and @pbardea)

ajwerner

I'm on board. Do we have a test that creates a boatload of jobs that update themselves a ton? Might be nice.

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @ajwerner and @pbardea)

We changed the default TTL on the jobs table to be 10min in reaction to some issues where bisbehaved jobs destabilized a cluster by causing the jobs range to become oversized due to mvcc garbage. However this low TTL breaks incremental backups. Previously this was not a significant issue for most users as BACKUP was typically only run on user tables, but increasingly users are finding they want to backup metadata in their system tables too, plus 20.1 will showcase full-cluster backups that include system tables automatically. This reverts the default TTL override for the jobs table, effectively meaning 20.1-created clusters will inherit the 25h TTL. This also means that if a user changes the default TTL to account for a different BACKUP cadence, they don't need to also update a second TTL for the jobs table. Release note (general change): on new clusters the internal system.jobs table now uses the default zoneconfig and TTL (25h)

dt · 2020-03-05T22:40:13Z

We don't -- any job that updated a ton has usually been fixed. I think this is usually a misbehaving job that gets us into trouble and that isn't something we really can test for. We can test an over-full range, but that is independent of jobs, right?

pbardea

LGTM

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @pbardea)

dt · 2020-03-05T23:51:47Z

bors r+

craig · 2020-03-06T01:34:40Z

Build succeeded

GitHub CI (Cockroach)

dt requested review from andreimatei, pbardea and ajwerner March 5, 2020 19:57

andreimatei reviewed Mar 5, 2020

View reviewed changes

dt force-pushed the jobs-ttl branch from 67b104b to c16c5d1 Compare March 5, 2020 22:36

ajwerner reviewed Mar 5, 2020

View reviewed changes

dt force-pushed the jobs-ttl branch from c16c5d1 to dbdcd5d Compare March 5, 2020 22:38

pbardea reviewed Mar 5, 2020

View reviewed changes

craig bot merged commit 2bd9e6a into cockroachdb:master Mar 6, 2020

dt deleted the jobs-ttl branch March 6, 2020 13:25

pbardea mentioned this pull request Mar 17, 2020

backupccl: full cluster backup incombatibility with default system table gc ttl #44987

Closed

jseldess mentioned this pull request Mar 19, 2020

sqlbase: change default jobs ttl back to default (25h) cockroachdb/docs#6892

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sqlbase: change default jobs ttl back to default (25h) #45767

sqlbase: change default jobs ttl back to default (25h) #45767

dt commented Mar 5, 2020

cockroach-teamcity commented Mar 5, 2020

andreimatei left a comment

ajwerner left a comment

dt commented Mar 5, 2020

pbardea left a comment

dt commented Mar 5, 2020

craig bot commented Mar 6, 2020

sqlbase: change default jobs ttl back to default (25h) #45767

sqlbase: change default jobs ttl back to default (25h) #45767

Conversation

dt commented Mar 5, 2020

cockroach-teamcity commented Mar 5, 2020

andreimatei left a comment

Choose a reason for hiding this comment

ajwerner left a comment

Choose a reason for hiding this comment

dt commented Mar 5, 2020

pbardea left a comment

Choose a reason for hiding this comment

dt commented Mar 5, 2020

craig bot commented Mar 6, 2020

Build succeeded