Skip to content

Conversation

pracucci
Copy link
Contributor

What this PR does:
In past few days we experienced TSDB WAL corruption twice. Given the root cause investigation (and possible fix) is a separate issue, I've also realised we're not mapping all TSDB metrics so in this PR I'm fixing this gap.

Which issue(s) this PR fixes:
N/A

Checklist

  • Tests updated
  • Documentation added
  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

@@ -1858,12 +1858,12 @@ func TestIngester_flushing(t *testing.T) {
i.FlushHandler(httptest.NewRecorder(), httptest.NewRequest("POST", "/flush", nil))

// Flush handler only triggers compactions, but doesn't wait for them to finish. Let's wait for a moment, and then verify.
test.Poll(t, 1*time.Second, true, func() interface{} {
test.Poll(t, 5*time.Second, uint64(0), func() interface{} {
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This test was flaky again on my local machine, so I had:

  1. Increase polling timeout from 1s to 5s
  2. Compared on the actual number, so we can see what's the value of NumSeries() when it fails

CHANGELOG.md Outdated
@@ -89,6 +89,11 @@
* [ENHANCEMENT] Blocks storage: reduced number of bucket listing operations to list block content (applies to newly created blocks only). #3363
* [ENHANCEMENT] Ruler: Include the tenant ID on the notifier logs. #3372
* [ENHANCEMENT] Blocks storage Compactor: Added `-compactor.enabled-tenants` and `-compactor.disabled-tenants` to explicitly enable or disable compaction of specific tenants. #3385
* [ENHANCEMENT] Blocks storage ingester: exported more TSDB-related metrics. #3412
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👋 The v1.5 release just started and you will need to rebase to the latest master and move any CHANGELOG entries into the unreleased section. Thanks!

Signed-off-by: Marco Pracucci <marco@pracucci.com>
@pracucci pracucci force-pushed the expose-all-tsdb-metrics branch from e86a923 to a786ac0 Compare October 28, 2020 17:15
@pracucci pracucci merged commit 73257d5 into cortexproject:master Oct 29, 2020
@pracucci pracucci deleted the expose-all-tsdb-metrics branch October 29, 2020 09:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants