-
Notifications
You must be signed in to change notification settings - Fork 364
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
layer: unimplemented support for evicting wanted deleted layers #6928
Labels
Comments
This comment was marked as outdated.
This comment was marked as outdated.
This comment was marked as outdated.
This comment was marked as outdated.
The problem is simpler: evicting a wanted_deleted layer always hangs. |
koivunej
added a commit
that referenced
this issue
Feb 29, 2024
…rics (#6131) Because of bugs evictions could hang and pause disk usage eviction task. One such bug is known and fixed #6928. Guard each layer eviction with a modest timeout deeming timeouted evictions as failures, to be conservative. In addition, add logging and metrics recording on each eviction iteration: - log collection completed with duration and amount of layers - per tenant collection time is observed in a new histogram - per tenant layer count is observed in a new histogram - record metric for collected, selected and evicted layer counts - log if eviction takes more than 10s - log eviction completion with eviction duration Additionally remove dead code for which no dead code warnings appeared in earlier PR. Follow-up to: #6060.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
As seen on 2024-02-27, evictions started hanging, and so deletions started hanging, disk usage based eviction task got stuck, and manual intervention was required.
Initial suspicion was wrong, only discovered one obviously missing metric update in: #6931.
Later realized it was the wanted deleted layers inability to communicate anything to
Layer::evict_and_wait
which was the cause of the hangs.Slack channel: #temp-2024-02-27-stuck-disk-usage-based-eviction
The text was updated successfully, but these errors were encountered: