-
Notifications
You must be signed in to change notification settings - Fork 106
Conversation
a57819b
to
d15a8bd
Compare
Can you explain what problem does this tool solve? |
Precisely. It is used as part of an out of band process to selectively fill in older data (primarily from legacy systems) for which more recent data already exists in Metrictank. |
I'm wondering how we can avoid code duplication, since we already have mt-whisper-importer-writer which listens on http for chunk write requests. If the client can accommodate a slow or temporary down backend, then it can simply wait and retry the http posts (this is what mt-whisper-importer-reader does), and you don't need a queue in between. |
Not sure what you are implying here. Do you mean kafka when you say "a queue in between"? In our case we have a custom task that knows how to take some input and create the datapoints that MT expects. It does not know how to create the chunks (and I don't believe it should). So I still think we need some tool to create the chunks, which honestly is a large part of the work here. We use this tool out-of-band. We just spin up a pod with 3 containers (our custom process, kafka, and mt-backfill) and feed in the data to backfill. Realistically, it's only a few hundred lines of code, and reuses a lot of the primary MT code. If you see some more to factor out to reduce the code footprint, I'd be open to that. The reason I want to upstream this is so it stays current if, for example, an interface changes. |
yes
I see. I was thinking of ways to integrate this more into the existing tools rather introduce yet another one-off. Some ideas:
|
@shanson7 what do you think of option 1 above? am i missing something? |
That is essentially what this tool is doing, but stripping out bits that aren't needed and automatically shutting down when all data is processed. It's intended for automated runs. |
OK then that's fine. I think we mainly need docs then:
also remove the "WIP" from the title since it seems it's no longer a wip :) |
@shanson7 just wanted to confirm you're not waiting on anything from me for this one. this is close to finished. see above. |
d15a8bd
to
4530ebc
Compare
We have been using this tool to (manually) backfill time-series data from users. It is non-trivial and a little finicky.
While this tool is functional for our purposes, I'm not quite sure if it's ready for upstreaming or what changes we might want to make to make it work.