Detect duplicates across files in textfile collector #274

carlpett · 2018-11-13T06:22:43Z

Fixes #272.
There are still a couple of cases where the textfile collector can cause the entire scrape to fail, but this should solve for a major contributor of the real-world cases, I hope.

brian-brazil · 2018-11-13T09:16:59Z

collector/textfile.go

+		if seenIn, ok := seen[h]; ok {
+			repr := friendlyString(*metricFamily.Name, names, values)
+			log.Warnf("Metric %s was read from %s, but has already been collected from file %s, skipping", repr, path, seenIn)
+			continue


This is not a good idea, you're now randomly dropping metrics depending on the iteration order. We've had issues with this type of approach in the past.

Yeah, it's not great, I agree. However, failing the scrape completely isn't that good either. Do you see some middle road or better approach?
Also, I just realized I've forgotten to set the error flag when this happens, so now it is not alertable. That needs to be fixed.

The node exporter fails, as this is an invalid setup. It's better to hard fail than silently return partial data.

@SuperQ FYI

Should we be failing the entire scrape, though? Skipping the textfile stuff completely seems okay (although I'll note that the metrics aren't randomly dropped, iteration order is deterministic here, so which metrics are dropped is consistent across scrapes given consistent input), but any individual breakage from any collector resulting in up=0 seems a bit harsh?

You should approach it however you approach any individual collector failing. If you permit that, then you should have metrics indicating which collectors did/didn't work.

Right, that would be my preference. It will require a bit of gymnastics though, since promhttp bails the entire scrape if we allow the data to get there.
I guess I'll do some sort of buffer to be able to detect failure before passing the metrics through the channel. Would this be something that should be done in node_exporter as well?

That'd be a question for @SuperQ, I've heard no plans in that direction.

carlpett · 2020-03-02T18:40:51Z

Won't be taking this approach, so closing this.

Detect duplicates across files in textfile collector

9868a27

brian-brazil reviewed Nov 13, 2018

View reviewed changes

da77a mentioned this pull request Feb 18, 2020

Same name and label value error results in no Prometheus format data #466

Closed

carlpett closed this Mar 2, 2020

carlpett deleted the handle-textfile-duplicates branch March 28, 2020 12:36

carlpett mentioned this pull request Apr 22, 2021

Fix textfile crashes with duplicate metrics #759

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detect duplicates across files in textfile collector #274

Detect duplicates across files in textfile collector #274

carlpett commented Nov 13, 2018

brian-brazil Nov 13, 2018

carlpett Nov 13, 2018

brian-brazil Nov 13, 2018

carlpett Nov 18, 2018

brian-brazil Nov 18, 2018

carlpett Nov 18, 2018

brian-brazil Nov 18, 2018

carlpett commented Mar 2, 2020

Detect duplicates across files in textfile collector #274

Detect duplicates across files in textfile collector #274

Conversation

carlpett commented Nov 13, 2018

brian-brazil Nov 13, 2018

Choose a reason for hiding this comment

carlpett Nov 13, 2018

Choose a reason for hiding this comment

brian-brazil Nov 13, 2018

Choose a reason for hiding this comment

carlpett Nov 18, 2018

Choose a reason for hiding this comment

brian-brazil Nov 18, 2018

Choose a reason for hiding this comment

carlpett Nov 18, 2018

Choose a reason for hiding this comment

brian-brazil Nov 18, 2018

Choose a reason for hiding this comment

carlpett commented Mar 2, 2020