Skip to content

HBASE-29800 WAL logs are unprotected during first full backup#7717

Open
DieterDP-ng wants to merge 1 commit intoapache:masterfrom
DieterDP-ng:HBASE-29800
Open

HBASE-29800 WAL logs are unprotected during first full backup#7717
DieterDP-ng wants to merge 1 commit intoapache:masterfrom
DieterDP-ng:HBASE-29800

Conversation

@DieterDP-ng
Copy link
Contributor

The BackupLogCleaner prevents WAL files that are needed for future backups from being deleted. In the case where a backup root has a single running backup, there was a small timeframe where relevant files were unprotected because only completed backups were taken into consideration. This commit fixes this.

The old mechanism relied on the "backup start code", which is a timestamp that denotes (per backup root) the lowest (earliest) log-roll timestamp that occurred for the backup. Because this concept had no added value, but is complex to reason about, it is removed. Usages are replaced with equal behavior based on timestamps stored in the backup info. (The backup start codes were calculated in the same way, just stored separately.)

Note that the backup start code calculation suffers from HBASE-29628 (log-roll timestamps of decommissioned region servers are not cleaned up, causing the start code to be lower than it should be). That problem is still present in this commit.

The BackupLogCleaner prevents WAL files that are needed for future backups
from being deleted. In the case where a backup root has a single running
backup, there was a small timeframe where relevant files were unprotected
because only completed backups were taken into consideration. This commit
fixes this.

The old mechanism relied on the "backup start code", which is a timestamp
that denotes (per backup root) the lowest (earliest) log-roll timestamp that
occurred for the backup. Because this concept had no added value, but is
complex to reason about, it is removed. Usages are replaced with equal
behavior based on timestamps stored in the backup info. (The backup start
codes were calculated in the same way, just stored separately.)

Note that the backup start code calculation suffers from HBASE-29628
(log-roll timestamps of decommissioned region servers are not cleaned up,
causing the start code to be lower than it should be). That problem is
still present in this commit.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant