testing: parallel benchmark results are poorly documented #31884

aclements · 2019-05-07T14:14:34Z

The way parallel benchmark results are reported is easy to misinterpret, and the correct way to interpret them is not documented. Typically this makes perfectly-scalable benchmarks appear to perform much worse at low parallelism, and makes not-at-all-scalable benchmarks appear to be doing fine. For example, this came up in #31820 (comment)

Specifically, the "ns/op" reported is not CPU-ns/op, it's wall-ns/op. For example, suppose each op takes exactly 100 ns, regardless of parallelism. If the single-threaded benchmark runs for 1 sec, it will execute 10,000,000 ops, so ns/op = 1s/10,000,000 ops = 100 ns/op. But if the same benchmark runs 4-way parallel for 1 sec, it will execute 40,000,000 ops, so ns/op = 1s/40,000,000 = 25 ns/op. (I really wish it didn't work this way...)

Interpreting the results of CPU-bound parallel benchmarks is further complicated by hyper-threading (though this isn't the fault of the testing package).

I don't think we can change the reported ns/op at this point. We could perhaps introduce a new metric for parallel benchmarks. At the very least, we should document this.

When many UUIDs are being generated concurrently, contention on the atomic counter can slow things down. There might be ways to speed this up, but for now, just add a parallel benchmark so we can measure the baseline. Initial results with `go test -bench . -cpu 1,2` on my machine (two physical cores): BenchmarkContended 56413502 18.3 ns/op BenchmarkContended-2 82533951 33.5 ns/op Note that the 33.5ns/op measure is worse than it appears, because that's wall ns/op, not cpu-ns/op (see golang/go#31884), so the time is actually 67.0ns/op when contended.

rpickz · 2021-03-15T19:23:49Z

Looking at improving this documentation now.

Considering where best to document - thinking this detail would be best added in a little of the documentation for testing.B.RunParallel and within the summary of the testing package as a note.

Would anyone recommend any additional areas to document this?

gopherbot · 2022-11-02T08:30:32Z

Change https://go.dev/cl/447136 mentions this issue: testing: Document RunParallel ns/op behavior

felixge · 2022-11-02T08:37:02Z

I just submitted https://go-review.googlesource.com/c/go/+/447136 for improving the documentation, PTAL.

I don't think we can change the reported ns/op at this point.

That's unfortunate. I think fixing the value would be the best solution here. This doesn't really fall under the Go 1 compatibility promise, does it?

We could perhaps introduce a new metric for parallel benchmarks.

I can send a patch for this. Maybe real-ns/op?

It also seems like it's possible to overwrite the reported "ns/op" with a better value like this:

start := time.Now()
b.RunParallel(func(p *testing.PB) {
	// ...
})

nsOp := float64(time.Since(start).Nanoseconds()) / (float64(b.N) / float64(runtime.GOMAXPROCS(-1)))
b.ReportMetric(nsOp, "ns/op")

Perhaps instead of reporting a new metric, we could add a convenience API, e.g. b.CorrectParallelMetrics().

But again, that's also ugly. I'd prefer if the current behavior could be fixed.

Regardless of further action, I hope the documentation patch makes sense and can be applied in the meantime.

puzpuzpuz · 2022-11-02T08:51:39Z

I can send a patch for this. Maybe real-ns/op?

I'd suggest adding a ops/s metric (throughput) along with the existing one (ns/op). This should make the output much more intuitive.

Updates #31884 Change-Id: Ibad3d31038a8426c0bce61c1726392880f934865 Reviewed-on: https://go-review.googlesource.com/c/go/+/447136 Reviewed-by: Bryan Mills <bcmills@google.com> Run-TryBot: Bryan Mills <bcmills@google.com> Reviewed-by: Benny Siegert <bsiegert@gmail.com> Auto-Submit: Bryan Mills <bcmills@google.com> TryBot-Result: Gopher Robot <gobot@golang.org>

aclements added the Documentation label May 7, 2019

aclements added this to the Go1.13 milestone May 7, 2019

aclements mentioned this issue May 7, 2019

sync: Pool.Get is half the speed when uncontended #31820

Closed

andybons added the NeedsFix The path to resolution is known, but the work has not been done. label May 8, 2019

rogpeppe mentioned this issue May 9, 2019

add parallel benchmark rogpeppe/fastuuid#6

Merged

andybons modified the milestones: Go1.13, Go1.14 Jul 8, 2019

rsc modified the milestones: Go1.14, Backlog Oct 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

testing: parallel benchmark results are poorly documented #31884

testing: parallel benchmark results are poorly documented #31884

aclements commented May 7, 2019

rpickz commented Mar 15, 2021

gopherbot commented Nov 2, 2022

felixge commented Nov 2, 2022 •

edited

Loading

puzpuzpuz commented Nov 2, 2022 •

edited

Loading

testing: parallel benchmark results are poorly documented #31884

testing: parallel benchmark results are poorly documented #31884

Comments

aclements commented May 7, 2019

rpickz commented Mar 15, 2021

gopherbot commented Nov 2, 2022

felixge commented Nov 2, 2022 • edited Loading

puzpuzpuz commented Nov 2, 2022 • edited Loading

felixge commented Nov 2, 2022 •

edited

Loading

puzpuzpuz commented Nov 2, 2022 •

edited

Loading