Fix mt-backfill data flush #2043

shanson7 · 2022-11-08T13:30:35Z

We have noticed for a while that the mt-backfill tool never seems to flush all of the data it has. It turns out, this is due to a mix of using system time and chunk spans.

AggMetric.lastWrite uses the system time when the data point was ingested, not the timestamp of the datapoint. This is probably the right thing to do to avoid prematurely flushing when backfilling.

However, Aggregator won't GC a chunk until lastWriteTime+agg.span <= chunkMinTs. For rollups with longer spans (say 4h) that means the backfill tool would need to run for 4 hours before it would flush the tail datapoints.

This change is to add a simple ForceGC function so the backfill tool can do flush everything on shutdown. I figured this was the safest way to fix this without changing the behavior of the core components.

shanson7 added 4 commits November 8, 2022 13:18

Fix typo

a75d2ea

Add test for GC

23b975a

Add forceGC function

a0567d9

Flush chunks on shutdown

5bfecdb

robert-milan approved these changes Dec 26, 2022

View reviewed changes

robert-milan merged commit 304f5e6 into grafana:master Dec 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix mt-backfill data flush #2043

Fix mt-backfill data flush #2043

shanson7 commented Nov 8, 2022

Fix mt-backfill data flush #2043

Fix mt-backfill data flush #2043

Conversation

shanson7 commented Nov 8, 2022