Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tags Leaking while batching #5636

Closed
rbkasat opened this issue Mar 26, 2019 · 3 comments
Closed

Tags Leaking while batching #5636

rbkasat opened this issue Mar 26, 2019 · 3 comments
Labels
bug unexpected problem or unintended behavior
Milestone

Comments

@rbkasat
Copy link

rbkasat commented Mar 26, 2019

Relevant telegraf.conf:

System info:

[Include Telegraf version, operating system name, and other relevant details]

Steps to reproduce:

Although Its very hard to reproduce but we are seeing a bug in telegraf. We currently write batches to influxdb while reading data from kafka.
Sometimes the tags are spilled from one measurement into others and causing a bunch of unrelated tags to appear in different measurements.
We read a huge amount of data from kafka and push to influx , reading is in line protocol as well as writing inline protocol. Approximated 5 Million points per minute are processed.

Expected behavior:

No Tag Spill

Actual behavior:

Tags are mixed up

Additional info:

[Include gist of relevant config, logs, etc.]

@danielnelson
Copy link
Contributor

Thanks for the report, just to clarify, you are using the kafka_consumer input and the influxdb output?

To work on this, it would be very helpful if you can find a way to minimally reproduce the issue, perhaps with a fixed data set and then attempting to isolate the issue to either the input or the output by replacing them with the file input or output.

@danielnelson danielnelson added bug unexpected problem or unintended behavior need more info labels Mar 26, 2019
@rbkasat
Copy link
Author

rbkasat commented Mar 26, 2019

Thanks for the quick reply, I am using Kafka Consumer and influxdb output. points that are published to the kafka topics are from java influxdb library.
I am trying my best to reproduce this with the dataset but since we process billions of points and this occurs suddenly from certain topic and it is hard to reproduce, but it is a big issue since once the tag is spilled it stays until next retention and increases cardinality.

@goller goller removed the bug unexpected problem or unintended behavior label Apr 1, 2019
@danielnelson danielnelson added this to the 1.10.2 milestone Apr 2, 2019
@danielnelson danielnelson added bug unexpected problem or unintended behavior and removed need more info labels Apr 2, 2019
@danielnelson
Copy link
Contributor

Closed in #5650

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug unexpected problem or unintended behavior
Projects
None yet
Development

No branches or pull requests

3 participants