[discuss] Support for rolling upgrades #41795

tylersmalley · 2019-07-23T16:39:36Z

Much like with Elasticsearch, users should be able to provide a rolling upgrade for Kibana. This will be limited due to not having a true cluster-state, but we should be able to greatly improve the experience.

When Kibana is upgrading, we perform any pending migrations as described here. This creates a new index, and upon completion of the migrations points the .kibana alias to the new index. At this point, it's important that the previous versions of Kibana not make any mutations to this index.

Here is the change I am proposing:

On startup, the Kibana server will grab the underlying index of the alias and read/write to it as opposed to the alias directly. The first part of the migration process is to put this index into a read-only state, preventing writes. In the UI, we can pull and notify for two possible scenarios, if the current index does not match that of the underlying alias, or if the index is read-only. This allows for a newer version of Kibana to be stood up, while the existing instance was still functional in a read-only state. We can extend the health check to account for these additional checks to assist with automation on a load balancer.

Down the road, when we have a cluster state, it should be possible to re-route requests to the most recent version of Kibana.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2019-07-23T16:39:38Z

Pinging @elastic/kibana-platform

elasticmachine · 2019-07-23T16:39:39Z

Pinging @elastic/kibana-operations

epixa · 2019-07-23T18:33:31Z

Security permissions are another critical area that would need to be addressed to support this. Today, when Kibana starts up it pushes all the necessary privileges into Elasticsearch in order to support that exact version of Kibana, which can cause two problems in a rolling upgrade scenario:

An older instance might behave unexpectedly if its underlying permission model is wiped out.
An older instance restarting after a newer instance is brought up will override the newer instance's permission model.

cc @elastic/kibana-security

kobelb · 2019-07-24T15:34:02Z

The way it works presently is the last version of Kibana to start-up wins, and essentially locks out all other versions from being able to authorize users. Are we only concerned with supporting "rolling upgrades" or should we concerned with supporting potential roll-backs as well?

LeeDr · 2019-12-03T16:42:20Z

When the new version of Kibana starts up, it should go through the migration process (writing directly to the new index and not changing the alias yet).

If something fails, abort, log detailed message. The existing version keeps running.

If the migration succeeds, it compares the timestamp of the most recent change to that index (this might be something new we need?) to the timestamp when the migration started.

If there were no changes to the old index since the migration started, we know we can swap the alias to point to the new migrated index and shut down the old Kibana version (and change the proxy redirect on Cloud)
if there were changes to the old index since the migration started, the new version should log a message about how long the migration took, and try again. After some number of attempts, if new writes keep happening on the old index, at some point we (a manual administrator action maybe?) need to switch it to read-only.

rudolf · 2020-12-14T16:14:09Z

From #52202:

Note: Rolling upgrades introduce significant complexity for plugins and risk of bugs. We assume that as long as the downtime window is predictable, downtime as such is not a problem for our users. Since this allows us to have a dramatically simpler system we won't aim to implement rolling upgrades unless this assumption is proven wrong.

tylersmalley added discuss Team:Core Core services & architecture: plugins, logging, config, saved objects, http, ES client, i18n, etc Team:Operations Team label for Operations Team labels Jul 23, 2019

tylersmalley removed the Team:Core Core services & architecture: plugins, logging, config, saved objects, http, ES client, i18n, etc label Jul 23, 2019

tylersmalley mentioned this issue Mar 22, 2020

Kibana Helm Charts #59842

Closed

1 task

tylersmalley mentioned this issue May 13, 2020

Rolling upgrade from 6.5.4 to 6.6.1 do not work well #32341

Closed

rudolf closed this as completed Dec 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[discuss] Support for rolling upgrades #41795

[discuss] Support for rolling upgrades #41795

tylersmalley commented Jul 23, 2019

elasticmachine commented Jul 23, 2019

elasticmachine commented Jul 23, 2019

epixa commented Jul 23, 2019

kobelb commented Jul 24, 2019

LeeDr commented Dec 3, 2019 •

edited

Loading

rudolf commented Dec 14, 2020

[discuss] Support for rolling upgrades #41795

[discuss] Support for rolling upgrades #41795

Comments

tylersmalley commented Jul 23, 2019

elasticmachine commented Jul 23, 2019

elasticmachine commented Jul 23, 2019

epixa commented Jul 23, 2019

kobelb commented Jul 24, 2019

LeeDr commented Dec 3, 2019 • edited Loading

rudolf commented Dec 14, 2020

LeeDr commented Dec 3, 2019 •

edited

Loading