Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

When a Pod is deleted outside of ECK (K8s node maintenance, upgrades, etc.) cold tier data isn't maintained on the node #6444

Closed
SpencerLN opened this issue Feb 22, 2023 · 2 comments
Labels
>enhancement Enhancement of existing functionality

Comments

@SpencerLN
Copy link

Bug Report

What did you do?

Our ECK clusters are running on GKE, which frequently automatically applies updates or performs other maintenance, resulting in K8s nodes being replaced, pods being moved, etc. When this occurs for a Hot/Warm tier ES node this process is quick and the data is maintained since the data is on network-attached storage.

For Cold tier ES nodes, the shards are immediately re-allocated to other nodes and the data is re-downloaded from the backing snapshot repository instead of reloaded from the network-attached storage where the data still exists.

What did you expect to see?

Since the data is still on the PV, and the PV is re-attached to the new pod, I expect that the ES cluster should use the local data instead of restoring it from the snapshot repository.

What did you see instead? Under which circumstances?

The cluster immediately reallocated the cold tier shards to other cold tier nodes and they download the data from the snapshot repository, even after the ES node containing the data rejoins the cluster.

This behavior does not occur when ECK recreates/restarts the ES node as part of a rolling upgrade.

Environment

  • ECK version:

    2.6.1

  • Kubernetes information:

    insert any information about your Kubernetes environment that could help us:

    • Cloud: GKE - 1.22

It isn't clear to me whether this is a bug in ECK or in Elasticsearch. Either way, it doesn't seem to be the expected/desirable behavior.

elastic/elasticsearch#93114

@botelastic botelastic bot added the triage label Feb 22, 2023
@pebrc
Copy link
Collaborator

pebrc commented Mar 2, 2023

#6478 should address this if I understand the problem correctly

@pebrc pebrc added the >enhancement Enhancement of existing functionality label Mar 2, 2023
@botelastic botelastic bot removed the triage label Mar 2, 2023
@pebrc
Copy link
Collaborator

pebrc commented Mar 26, 2023

Closing this as we have #6544 which should solve the issue.

@pebrc pebrc closed this as completed Mar 26, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
>enhancement Enhancement of existing functionality
Projects
None yet
Development

No branches or pull requests

2 participants