disable index dedupe when rf > 1 and current or upcoming index type is boltdb-shipper #2206

sandeepsukhani · 2020-06-10T13:56:15Z

What this PR does / why we need it:
We use dedupe caches for chunks and seriesIDs to determine whether we already have written a chunk or a seriesID already to the store. Since boltdb-shipper only uploads boltdb files periodically to make the index available to all the other services, an ingester which first wrote a chunk or seriesID goes down and other ingesters didn't write them due to deduplication, we would be missing some logs in query responses until that ingester is down. This would be a problem even during rollouts.

This PR disables index deduplication and avoid using write dedupe cache(which dedupes seriesIDs) when replication factor > 1 and current or upcoming index type is boltdb-shipper.

Checklist

Documentation added
Tests updated

codecov-commenter · 2020-06-10T14:08:14Z

Codecov Report

Merging #2206 into master will increase coverage by 0.03%.
The diff coverage is 61.53%.

@@            Coverage Diff             @@
##           master    #2206      +/-   ##
==========================================
+ Coverage   61.62%   61.65%   +0.03%     
==========================================
  Files         160      160              
  Lines       13577    13586       +9     
==========================================
+ Hits         8367     8377      +10     
+ Misses       4588     4586       -2     
- Partials      622      623       +1

Impacted Files	Coverage Δ
pkg/loki/modules.go	`11.52% <61.53%> (+2.38%)`	⬆️
pkg/querier/queryrange/downstreamer.go	`95.87% <0.00%> (-2.07%)`	⬇️
pkg/canary/comparator/comparator.go	`80.24% <0.00%> (+2.41%)`	⬆️

slim-bean · 2020-06-10T14:44:07Z

I was hoping we wouldn't need this if we could update cortex to fix the underlying issue?

sandeepsukhani · 2020-06-10T15:19:16Z

I am not sure which issue we are talking about here. This is based on a change in Cortex that I did which helped with this.

sandeepsukhani · 2020-06-10T16:12:48Z

Another note, this problem is specific to only boltdb shipper because all the other stores have strong consistency while boltdb shipper doesn't.

slim-bean · 2020-06-26T12:21:25Z

I will look at this closer, maybe it's the naming that confuses me.

We don't want to disable write de-duplication of chunks, we still want the chunks to be written to the cache.

Instead we just want to make sure when a chunk is already in the cache we still write the index entry.

This does get more complicated however if the filesystem is not shared, I will look closer at this.

sandeepsukhani · 2020-06-27T10:51:44Z

@slim-bean I have changed the flag in Cortex to allow disabling of just index deduplication. If enabled, it would still write just the index even if the chunk is already written.

…s boltdb-shipper

…e index and still dedupes chunks

sandeepsukhani · 2020-07-24T11:07:18Z

@slim-bean let us merge this PR which has a helper function to check whether current or upcoming index store is of type boltdb-shipper. This would help with doing changes to #2166 that we discussed yesterday.

slim-bean

LGTM!

pull-request-size bot added the size/M label Jun 10, 2020

sandeepsukhani force-pushed the disable-write-dedupe-boltdb-shipper branch from 1fc5b02 to 66d68de Compare June 27, 2020 10:05

sandeepsukhani requested a review from slim-bean June 27, 2020 10:51

sandeepsukhani changed the title ~~disable write dedupe when rf > 1 and current or upcoming index type is boltdb-shipper~~ disable index dedupe when rf > 1 and current or upcoming index type is boltdb-shipper Jun 27, 2020

sandeepsukhani added 2 commits July 24, 2020 16:34

disable write dedupe when rf > 1 and current or upcoming index type i…

62e7299

…s boltdb-shipper

use DisableIndexDeduplication which disables deduplication of just th…

1ed418b

…e index and still dedupes chunks

sandeepsukhani force-pushed the disable-write-dedupe-boltdb-shipper branch from 66d68de to 1ed418b Compare July 24, 2020 11:05

slim-bean approved these changes Jul 24, 2020

View reviewed changes

update comments

d3aa114

sandeepsukhani merged commit 807193d into grafana:master Jul 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

disable index dedupe when rf > 1 and current or upcoming index type is boltdb-shipper #2206

disable index dedupe when rf > 1 and current or upcoming index type is boltdb-shipper #2206

sandeepsukhani commented Jun 10, 2020 •

edited

Loading

codecov-commenter commented Jun 10, 2020 •

edited

Loading

slim-bean commented Jun 10, 2020

sandeepsukhani commented Jun 10, 2020

sandeepsukhani commented Jun 10, 2020

slim-bean commented Jun 26, 2020

sandeepsukhani commented Jun 27, 2020

sandeepsukhani commented Jul 24, 2020

slim-bean left a comment

disable index dedupe when rf > 1 and current or upcoming index type is boltdb-shipper #2206

disable index dedupe when rf > 1 and current or upcoming index type is boltdb-shipper #2206

Conversation

sandeepsukhani commented Jun 10, 2020 • edited Loading

codecov-commenter commented Jun 10, 2020 • edited Loading

Codecov Report

slim-bean commented Jun 10, 2020

sandeepsukhani commented Jun 10, 2020

sandeepsukhani commented Jun 10, 2020

slim-bean commented Jun 26, 2020

sandeepsukhani commented Jun 27, 2020

sandeepsukhani commented Jul 24, 2020

slim-bean left a comment

Choose a reason for hiding this comment

sandeepsukhani commented Jun 10, 2020 •

edited

Loading

codecov-commenter commented Jun 10, 2020 •

edited

Loading