You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This follows on from #5050. Whereas #5050 makes it safe to attach a tenant to a new pageserver without detaching the old one, this epic will make a tenant migration seamless from postgres's point of view.
For failover (i.e. unplanned migration), only most recent LSNs become unavailable for reads. They become available again in a time bounded by the time taken for the new pageserver to download the writes since the remote_consistent_lsn on the old pageserver.
For planned migrations, postgres experiences no loss of read availability whatsoever.
The content you are editing has changed. Please copy your edits and refresh the page.
Before we can enable generation usage and these, we will need to have a way to restrict consumption_metrics.rs from a pageserver with previous generation number.
@jcsp do you have an issue somewhere on re-attach and maybe validate modifications? As far as I understood from the RFC, we should return mode in /re-attach response, but generation number there should also become optional for the secondaries then.
And it is also unclear what should we do about /validate? If it is called in the middle of migration by old primary, when new primary is already AttachedMulti, but compute not yet switched, then pageserver will invalidate tenant, what will happen then with not yet switched compute?
Including attachment states and/or secondary tenants in /re-attach responses is not essential for correctness here -- that's more of an optimization to enable tenants omitted in /re-attach to smoothly transition to secondary rather than detaching.
Motivation
This follows on from #5050. Whereas #5050 makes it safe to attach a tenant to a new pageserver without detaching the old one, this epic will make a tenant migration seamless from postgres's point of view.
See RFC: #5029
This ticket is the pageserver side of the work: control plane changes are described elsewhere:
DoD
For failover (i.e. unplanned migration), only most recent LSNs become unavailable for reads. They become available again in a time bounded by the time taken for the new pageserver to download the writes since the remote_consistent_lsn on the old pageserver.
For planned migrations, postgres experiences no loss of read availability whatsoever.
Tasks
The text was updated successfully, but these errors were encountered: