Skip to content

Thanos compactor should retry failed bucket operation instead retry the whole compaction #6111

@alexqyle

Description

@alexqyle

Is your proposal related to a problem?

Not all bucket clients Thanos using are supporting retry failed operation internally. The current logic Thanos dealing with bucket operation failure is retrying the whole compaction iteration again. This is not ideal for single bucket operation failure during compaction. If it is a relative big compaction, it will waste a lot time to compact it again due to one bucket operation failure.

The following are retriable bucket failure in Thanos compactor:

Describe the solution you'd like

Could have bucket client wrapper on top of each bucket client implementation to have configurable retry logic to retry bucket operation on underlying bucket client.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions