common: add "avglat" in perf result to calculate average latency. #12199

liupan1111 · 2016-11-28T13:28:38Z

Signed-off-by: Pan Liu pan.liu@istuary.com

Signed-off-by: Pan Liu <pan.liu@istuary.com>

liupan1111 · 2016-11-28T13:34:02Z

When we want to tune the performance, we may need perf dump command. For the variables added by "add_time_avg", only "sum" and "avgcount" are dumped out, so I have to compute average latency by hand: sum/avgcount. Indeed, for this kind of performance tuning, average latency of different part is even more important when compare two different perf results.

In my modification, the unit of avglat is ms, not second, so that easier to read.

@liewegas @tchaikov , please help take a look.

liewegas · 2016-11-28T14:20:56Z

I'm not sure this helps you. The sum and avgcount are totals over the lifetime of the counter, so dividing them directly doesn't tell you much.. you need to take the delta with the previous measurement (say, 1s ago), and then device that to get a useful value.

Either way, this seems like something that consuming tools (e.g., 'ceph daemonperf ...') should be doing...

liupan1111 · 2016-11-28T14:42:10Z

@liewegas, there is also one command very usefull: ceph daemon... perf reset all. In am working tuning ceph on P3700 ssds. Using perf reset all and perf dump really help me a lot. :) After adding avg lat, it is much easier for me to compare The latency before and After perf reset all.

liupan1111 · 2016-12-14T06:37:54Z

@liewegas @tchaikov , I tried ceph daemonperf:

ceph daemonperf /var/run/ceph/ceph-osd.0.asok
---objecter--- -----------osd-----------
writ read actv|recop rd wr lat ops |
0 0 0 | 0 0 50M 332 48
0 0 0 | 0 0 40M 369 39
0 0 0 | 0 0 50M 341 49
0 0 0 | 0 0 59M 280 57
0 0 0 | 0 0 67M 263 64
0 0 0 | 0 0 33M 467 32
0 0 0 | 0 0 40M 385 39

I didn't find any perf information for a variable such as "l_bluestore_compress_lat". I also found there was no option to output this kind info:
ceph daemonperf {daemon_name|socket_path} [{interval} [{count}]]

I agree this tool should support this. But seems many enhancement should be done to support. In this way, I think my modification is practical, can help users to do analysis.

What is your opinion?

Thanks.

liewegas · 2016-12-14T07:00:09Z

(cc @dmick) We'd love to see this command expanded so that it takes a list of metrics to watch. Perhaps with a magic argument to include (or not include) the default ones. Open to suggestions there!

liupan1111 · 2017-02-24T13:42:18Z

@dmick , what is your opinion about this PR? I believe avg lat is useful for debug.

dmick · 2017-02-27T19:17:12Z

I guess my opinion is that, as Sage said, the averages aren't very useful. I get that along with manually resetting the counters they can be made more useful, but that's a pretty brute-force solution, and affects more than just the counter you're interested in.

That said, it does seem useful to be able to select a set of stats with arguments to ceph daemon, so that experiments like yours and others, while they may be less-generally useful, can still be accomplished without hacking the code. I'd vote for moving in that direction rather than adding more questionably-useful hardcoded stats.

common: add "avglat" in perf result to calculate average latency.

0d41cc7

Signed-off-by: Pan Liu <pan.liu@istuary.com>

tchaikov self-assigned this Nov 28, 2016

tchaikov added common feature labels Nov 30, 2016

liupan1111 closed this Mar 1, 2017

liupan1111 mentioned this pull request Jul 17, 2017

common/perf_counters: add average time for PERFCOUNTER_TIME #15478

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

common: add "avglat" in perf result to calculate average latency. #12199

common: add "avglat" in perf result to calculate average latency. #12199

liupan1111 commented Nov 28, 2016

liupan1111 commented Nov 28, 2016

liewegas commented Nov 28, 2016

liupan1111 commented Nov 28, 2016

liupan1111 commented Dec 14, 2016 •

edited

liewegas commented Dec 14, 2016 via email

liupan1111 commented Feb 24, 2017

dmick commented Feb 27, 2017

common: add "avglat" in perf result to calculate average latency. #12199

common: add "avglat" in perf result to calculate average latency. #12199

Conversation

liupan1111 commented Nov 28, 2016

liupan1111 commented Nov 28, 2016

liewegas commented Nov 28, 2016

liupan1111 commented Nov 28, 2016

liupan1111 commented Dec 14, 2016 • edited

liewegas commented Dec 14, 2016 via email

liupan1111 commented Feb 24, 2017

dmick commented Feb 27, 2017

liupan1111 commented Dec 14, 2016 •

edited