Add compactor HTTP API for uploading TSDB blocks #1694

aknuds1 · 2022-04-13T11:45:49Z

What this PR does

Add compactor HTTP API for uploading TSDB blocks.

TODOs

Which issue(s) this PR fixes or relates to

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

pracucci

Thanks @aknuds1 for working on this! I did a very high level review, focusing on API design. Could you take a look at my comments, please?

pkg/api/api.go

pkg/distributor/distributor.go

pkg/mimirpb/mimir.proto

pkg/distributor/distributor.go

cmd/mimirtool/main.go

pkg/distributor/distributor.go

aknuds1 · 2022-04-15T13:01:27Z

Thanks for the review @pracucci! I will incorporate your feedback when I can start refining the PR. Concentrating on implementing the endpoint for finishing backfills now.

pracucci · 2022-04-15T13:06:56Z

What happens when an upload is aborted? If the client interrupts the upload and will never recover it, the partial files will be stored in uploads/ forever. We may think of doing a cleanup in the compactor cleanup (not in this PR, but we need to design it - can you add it to the design doc, please?).

aknuds1 · 2022-04-15T13:11:21Z

What happens when an upload is aborted? If the client interrupts the upload and will never recover it, the partial files will be stored in uploads/ forever. We may think of doing a cleanup in the compactor cleanup (not in this PR, but we need to design it - can you add it to the design doc, please?).

@pracucci I have thought about the same issue, and was planning to return to it after implementing the basic scenario ("happy path"). Good thinking about letting the compactor cleanup do it, I can note it in the design doc.

pkg/ingester/ingester.go

pstibrany · 2022-04-25T15:17:19Z

What is the rationale behind using distributor, grpc and ingester to upload blocks to the storage?

I would suggest to use component, which already has access to the blocks storage. Personally I would suggest compactor, or purger (which also handles tenant deletion API, although purger doesn't exactly mean "block uploads" to me). This component should not need to forward the messages over gRPC to yet-another component, but can handle the API calls directly.

aknuds1 · 2022-04-25T15:27:09Z

What is the reationale behind using distributor, grpc and ingester to upload blocks to the storage?

I would suggest to use component, which already has access to the blocks storage. Personally I would suggest compactor, or purger (which also handles tenant deletion API). This component should not need to forward the messages over gRPC to yet-another component, but can handle the API calls directly.

@pstibrany it's because I figured the distributor would be the logical component for receiving the backfill requests. Would there be any (technical) drawbacks from putting the HTTP endpoints in compactor or purger?

It seems a bit weird to me from a logical PoV though to put backfill endpoints in compactor or purger components though? It's basically a type of ingestion?

WDYT @pracucci?

pstibrany · 2022-04-25T15:31:43Z

it's because I figured the distributor would be the logical component for receiving the backfill requests

Distributor doesn't currently talk to blocks storage in any way.

Would there be any drawbacks from putting the HTTP endpoints in compactor or purger?

I don't think so, other than it seems "weird". Compactor already handles lot of blocks maintanance work (cleanup, block index), but doesn't currently expose any API. "Purger" doesn't exactly mean "backfilling" in my mind. If we overlook these minor issues, I think they both are better choice than distributor.

We may also end up with completely separate "backfill" module, and only people interested in having backfill API can run it. (Or "blocks-api" module, and then we can also provide blocks-reading API in the future.)

aknuds1 · 2022-04-25T15:37:21Z

it's because I figured the distributor would be the logical component for receiving the backfill requests

Distributor doesn't currently talk to blocks storage in any way.

@pstibrany It forwards ingestion requests to the ingester though. My thinking was backfill is another form of ingestion. I'm not sure what's technically the best. I'm open to moving the functionality to another component. I think compactor sounds the least weird of the two, logically.

We may also end up with completely separate "backfill" module, and only people interested in having backfill API can run it. (Or "blocks-api" module, and then we can also provide blocks-reading API in the future.)

That could also make sense, although maybe overkill for this PR?

pstibrany · 2022-04-25T15:45:35Z

I'm not sure what's technically the best. I'm open to moving the functionality to another component. I think compactor sounds the least weird of the two, logically.

I think the best option is one where we can leverage HTTP protocol, so that client can stream the data through some Mimir component directly to the storage, without any intermediate local files. That rules out distributor (no access to blocks storage). Ingesters could handle the HTTP API calls directly (without going through distributors), but I don't think they should be wasting their CPU cycles on this, or on blocks verification.

We may also end up with completely separate "backfill" module, and only people interested in having backfill API can run it. (Or "blocks-api" module, and then we can also provide blocks-reading API in the future.)

That could also make sense, although maybe overkill for this PR?

Perhaps, I don't think cost of having more modules is so high. Compactors have advantage that they already have local disk (we will need that to download full block and validate it, before successfully finishing the upload).

aknuds1 · 2022-04-25T15:46:51Z

@pstibrany how about we move the endpoints to the compactor module then?

pstibrany · 2022-04-25T15:50:35Z

@pstibrany how about we move the endpoints to the compactor module then?

Sounds good to me. It allows us to get rid of gRPC middle step.

We can keep backfill implementation separate from compactor, and only pass bucket configuration to the HTTP handlers. It will make it simple to use from different module in the future.

aknuds1 · 2022-04-25T15:53:15Z

@pstibrany how about we move the endpoints to the compactor module then?

Sounds good to me. It allows us to get rid of gRPC middle step.

We can keep backfill implementation separate from compactor, and only pass bucket configuration to the HTTP handlers. It will make it simple to use from different module in the future.

@pstibrany Sounds good! I'll just verify tomorrow with @pracucci that he agrees.

pracucci · 2022-04-26T07:05:53Z

We're working to simplify the Mimir architecture and operations. In this perspective, for users it's easier to understand there are only 2 ingress components in Mimir:

Write path: distributor
Read path: query-frontend

Do the backfill API write or read? Write. Then I would keep the API exposed by distributor. Then, internally we can route requests to other components. I'm fine routing to compactor instead of ingester (I think it makes sense), but I would keep the API exposed by distributor. This also allow for some traffic sharding (e.g. shuffle-sharding).

For the same reason, I would avoid adding any new component for backfilling.

About the purger: the existance of this component is a legacy of the chunks storage. I believe it shouldn't even exist in Mimir anymore (and its feature merged in other already-existing components), but that's a separate issue to discuss.

aknuds1 · 2022-04-26T07:29:33Z

Thanks for chiming in @pracucci! I'll keep the API as is then, considering moving the block storage logic to the compactor.

pracucci · 2022-04-26T07:32:36Z

Do the backfill API write or read? Write. Then I would keep the API exposed by distributor.

I would like to hear @09jvilla opinion too. Exposing the API directly from compactor is easier from an implementation perspective. My main take is about simplifying the architecture for the final user, but that doesn't necessarily simplify the implementation for us.

pstibrany · 2022-04-26T07:37:47Z

Do the backfill API write or read? Write. Then I would keep the API exposed by distributor. Then, internally we can route requests to other components. I'm fine routing to compactor instead of ingester (I think it makes sense), but I would keep the API exposed by distributor.

If it is the Distributor that exposes the API, and we don't want distributors doing the upload to the storage directly (which would be an option too, but I don't think we should do that), then distributor needs to reroute the request. We use gRPC for this rerouting. gRPC introduces extra complication and forces us to reimplement streaming of body from HTTP request via gRPC. If we didn't introduce gRPC to the mix, we could simply pass io.Reader from HTTP request directly to the object client's Upload method. In other words, rerouting from distributor to another component via gRPC seems like unnecessary complication, which is the reason why I disagree to use distributor to expose the API.

Another thing to consider: component handling the upload will need to perform validation of the block during the finish. This means downloading the block locally and perform various checks on the index and chunks, to make sure we don’t let wrong block in. I don’t think ingesters should be doing this validation – it’s IO and CPU intensive, and ingesters have better things to do. Similarly distributors would be a poor choice. I think compactor makes sense to do this.

tl;dr: I'm still in favor of directly exposing upload HTTP API from compactor, without going through distributor first.

@09jvilla what is your take?

aknuds1 · 2022-04-26T14:14:59Z

@pstibrany @pracucci I implemented Andy's/Peter's proposal to upload blocks directly to the destination, instead of going via a staging directory ("uploads"). I ensure that the block is considered unfinished while uploading by only writing meta.json when completing the block upload session.

A great benefit of this is I don't have to fork Thanos to implement a bucket move method, which is actually a lot of work due to all the different bucket backends.

09jvilla · 2022-04-28T03:51:06Z

Warning - long thread below. I've put in a TLDR at the bottom if you're in a hurry.

If we expose it via the distributor, the argument is that it simplifies the interface for the user. They only need to think about the distributor as the single ingress component for the system (regardless of whether those metrics are realtime or historic). However, does it add complexity to what the user has to configure upfront? In other words, does that extra grpc connection and request forwarding that has to happen add some extra configuration burden for the user? If it does, that might mitigate the benefits of the distributor as the single interface. If it doesn't, just ignore this point.
In general I would rather us take on the complexity for our users, so us doing a little more implementation work upfront to save our users some cognitive load is worth it. This would again argue for having it exposed via the distributor. However, if this is going to make issues harder for users to debug (because of the extra distributor to compactor hop), then it may add just as many problems as it solves.
Its hard to say what the implications would be on the ease-of-use effort. In the monolithic mode, this doesn't really matter that much right? Because both the compactor and the distributor are deployed in the same process? (I mean sure it changes the api route but the base address is the same). Depending on how you do a miniservices implementation (e.g., put both the distributor and compactor on target=write), this might also be true.
Let's say we do downsampling in the compactor at some point. That probably would be configured via an HTTP API on the compactor, right? We could then frame it to the user as:

distributor --> ingress for write path (i.e. 'live data')
query-frontend --> ingress for read path
compactor --> handles historic data functions (handles custom retention, downsampling, deletion of data, and import of old data)

So you can still keep it relatively simple for the user, but still have this live on the compactor.

TLDR: In most cases, this should be a one time action the user is taking. ("As a user, I need to spin up a new cluster and want to pull in my historic data"). There are some folks like Adobe who need to periodically import historic data, but I think this is the less common case.

Because its not a super frequent action the user would take, I'm not that worried that it will add a lot of user confusion (plus I tried to make some arguments in # 1- # 4 that maybe the confusion isn't as bad as we think).

Given that, I would vote in favor of what is simpler for us to maintain in the long term, which if I'm reading this thread correctly, would be putting it on the compactor.

aknuds1 · 2022-04-28T08:09:53Z

Thanks @09jvilla.

Warning - long thread below. I've put in a TLDR at the bottom if you're in a hurry.

If we expose it via the distributor, the argument is that it simplifies the interface for the user. They only need to think about the distributor as the single ingress component for the system (regardless of whether those metrics are realtime or historic). However, does it add complexity to what the user has to configure upfront? In other words, does that extra grpc connection and request forwarding that has to happen add some extra configuration burden for the user? If it does, that might mitigate the benefits of the distributor as the single interface. If it doesn't, just ignore this point.

I'm not aware of any extra configuration required, at least none has been introduced so far in the PR.

In general I would rather us take on the complexity for our users, so us doing a little more implementation work upfront to save our users some cognitive load is worth it. This would again argue for having it exposed via the distributor. However, if this is going to make issues harder for users to debug (because of the extra distributor to compactor hop), then it may add just as many problems as it solves.

I'd need input from the others (Marco, Peter, others?) on how much extra maintenance effort the (gRPC) hop from distributor to ingester might mean in practice.

pstibrany · 2022-04-28T08:42:34Z

From user's point of view, there is no extra complexity due to using gRPC. Extra complexity is only in our code.

We already have API exposed by different components (distributor, query-frontend, alertmanager, ruler), so adding compactor to the mix with backfilling API (and also perhaps tenant deletion API, which currently lives in "purger" component) doesn't seem to make the situation much worse.

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

pstibrany · 2022-06-16T12:28:53Z

CHANGELOG.md

@@ -30,6 +30,7 @@
 * [ENHANCEMENT] Chunk Mapper: reduce memory usage of async chunk mapper. #2043
 * [ENHANCEMENT] Ingesters: Added new configuration option that makes it possible for mimir ingesters to perform queries on overlapping blocks in the filesystem. Enabled with `-blocks-storage.tsdb.allow-overlapping-queries`. #2091
 * [ENHANCEMENT] Ingester: reduce sleep time when reading WAL. #2098
+* [ENHANCEMENT] Compactor: Add HTTP API for uploading TSDB blocks. #1694


Let's also mention that the feature is experimental (here: docs/sources/operators-guide/configuring/about-versioning.md:35) for now, as API may change a bit still (I expect that validation will modify "finish block" call to return progress updates).

Thanks for the heads up @pstibrany, done. PTAL.

pstibrany

Thank you. I think we're ready to merge this PR after last comments are fixed.

pkg/compactor/block_upload.go

pkg/compactor/block_upload_test.go

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

pstibrany

Thank you very much for your effort on this PR!

aknuds1 · 2022-06-17T07:38:17Z

Thanks for the thorough and quick reviews @pstibrany!

* Compactor: Add HTTP API for uploading TSDB blocks Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com> Co-authored-by: Peter Štibraný <pstibrany@gmail.com> Co-authored-by: aldernero <vernon.w.miller@gmail.com>

aknuds1 added enhancement New feature or request component/ingester component/distributor component/mimirtool labels Apr 13, 2022

aknuds1 force-pushed the feat/backfill branch 3 times, most recently from 24d99f3 to 113808c Compare April 14, 2022 14:22

pracucci reviewed Apr 15, 2022

View reviewed changes

andyasp reviewed Apr 21, 2022

View reviewed changes

pkg/ingester/ingester.go Outdated Show resolved Hide resolved

aknuds1 force-pushed the feat/backfill branch 2 times, most recently from 46fa91c to 817b5a1 Compare April 25, 2022 11:37

aknuds1 added 8 commits June 14, 2022 17:53

Improve comment

371fe80

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Merge remote-tracking branch 'origin/main' into feat/backfill

bd16911

Merge remote-tracking branch 'origin/main' into feat/backfill

f2606bd

Add tests

6384d99

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Reduce code duplication

bc8df77

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Merge remote-tracking branch 'origin/main' into feat/backfill

b2bf100

Add test case

106ba2a

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Add tests

6f07e73

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

aknuds1 force-pushed the feat/backfill branch from c1b34a2 to 6f07e73 Compare June 16, 2022 07:59

aknuds1 changed the title ~~Add compactor HTTP API for backfilling blocks~~ Add compactor HTTP API for uploading blocks Jun 16, 2022

aknuds1 changed the title ~~Add compactor HTTP API for uploading blocks~~ Add compactor HTTP API for uploading TSDB blocks Jun 16, 2022

aknuds1 added 3 commits June 16, 2022 10:03

Add changelog entry

0d71074

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Add tests

5609beb

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Merge remote-tracking branch 'origin/main' into feat/backfill

2aa0cb4

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

aknuds1 requested a review from pstibrany June 16, 2022 08:38

pstibrany reviewed Jun 16, 2022

View reviewed changes

pkg/compactor/block_upload.go Outdated Show resolved Hide resolved

pkg/compactor/block_upload.go Show resolved Hide resolved

pkg/compactor/block_upload.go Outdated Show resolved Hide resolved

pkg/compactor/block_upload_test.go Outdated Show resolved Hide resolved

aknuds1 added 4 commits June 16, 2022 15:34

Remove unused parameters

9feae38

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Reduce code duplication

cf8ed71

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Document experimental feature

9c7650b

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Merge remote-tracking branch 'origin/main' into feat/backfill

d6dd163

aknuds1 requested a review from pstibrany June 16, 2022 13:55

aknuds1 added 2 commits June 16, 2022 16:12

Fix linting issue

4446ca1

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

Simplify, fix comment

c23c91e

Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>

pstibrany approved these changes Jun 17, 2022

View reviewed changes

aknuds1 merged commit 94f00f8 into main Jun 17, 2022

aknuds1 deleted the feat/backfill branch June 17, 2022 07:43

aknuds1 mentioned this pull request Jun 20, 2022

Compactor: Enable TSDB block upload on per-tenant basis #2126

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add compactor HTTP API for uploading TSDB blocks #1694

Add compactor HTTP API for uploading TSDB blocks #1694

aknuds1 commented Apr 13, 2022 •

edited

Loading

pracucci left a comment

aknuds1 commented Apr 15, 2022

pracucci commented Apr 15, 2022

aknuds1 commented Apr 15, 2022 •

edited

Loading

pstibrany commented Apr 25, 2022 •

edited

Loading

aknuds1 commented Apr 25, 2022 •

edited

Loading

pstibrany commented Apr 25, 2022 •

edited

Loading

aknuds1 commented Apr 25, 2022

pstibrany commented Apr 25, 2022

aknuds1 commented Apr 25, 2022

pstibrany commented Apr 25, 2022

aknuds1 commented Apr 25, 2022

pracucci commented Apr 26, 2022

aknuds1 commented Apr 26, 2022 •

edited

Loading

pracucci commented Apr 26, 2022

pstibrany commented Apr 26, 2022

aknuds1 commented Apr 26, 2022 •

edited

Loading

09jvilla commented Apr 28, 2022 •

edited

Loading

aknuds1 commented Apr 28, 2022

pstibrany commented Apr 28, 2022

pstibrany Jun 16, 2022

aknuds1 Jun 16, 2022

pstibrany left a comment

pstibrany left a comment

aknuds1 commented Jun 17, 2022

Add compactor HTTP API for uploading TSDB blocks #1694

Add compactor HTTP API for uploading TSDB blocks #1694

Conversation

aknuds1 commented Apr 13, 2022 • edited Loading

What this PR does

TODOs

Which issue(s) this PR fixes or relates to

Checklist

pracucci left a comment

Choose a reason for hiding this comment

aknuds1 commented Apr 15, 2022

pracucci commented Apr 15, 2022

aknuds1 commented Apr 15, 2022 • edited Loading

pstibrany commented Apr 25, 2022 • edited Loading

aknuds1 commented Apr 25, 2022 • edited Loading

pstibrany commented Apr 25, 2022 • edited Loading

aknuds1 commented Apr 25, 2022

pstibrany commented Apr 25, 2022

aknuds1 commented Apr 25, 2022

pstibrany commented Apr 25, 2022

aknuds1 commented Apr 25, 2022

pracucci commented Apr 26, 2022

aknuds1 commented Apr 26, 2022 • edited Loading

pracucci commented Apr 26, 2022

pstibrany commented Apr 26, 2022

aknuds1 commented Apr 26, 2022 • edited Loading

09jvilla commented Apr 28, 2022 • edited Loading

aknuds1 commented Apr 28, 2022

pstibrany commented Apr 28, 2022

pstibrany Jun 16, 2022

Choose a reason for hiding this comment

aknuds1 Jun 16, 2022

Choose a reason for hiding this comment

pstibrany left a comment

Choose a reason for hiding this comment

pstibrany left a comment

Choose a reason for hiding this comment

aknuds1 commented Jun 17, 2022

aknuds1 commented Apr 13, 2022 •

edited

Loading

aknuds1 commented Apr 15, 2022 •

edited

Loading

pstibrany commented Apr 25, 2022 •

edited

Loading

aknuds1 commented Apr 25, 2022 •

edited

Loading

pstibrany commented Apr 25, 2022 •

edited

Loading

aknuds1 commented Apr 26, 2022 •

edited

Loading

aknuds1 commented Apr 26, 2022 •

edited

Loading

09jvilla commented Apr 28, 2022 •

edited

Loading