Wrong latency distribution on the benchmark test #81

wenxzhen · 2019-10-29T13:10:51Z

Dear all,
When I am using the wrk2 to do the http performance test, I found the latency distribution is quite puzzle as described below:

I have captured the http traffic with the wireshark, and I didn't find any traffic with latency greater than 1s.

WRK2 version: wrk 4.0.0 [kqueue] Copyright (C) 2012 Will Glozer
My commends: wrk2 -t 4 -c 10 -d 60s --rate 1000 -s header.lua --latency "$BaseUrl"

Can anyone tell me what's the problem here? Appreciated for your kindly help.

giltene · 2019-10-29T15:12:39Z

This is a classic example of correct reporting of experienced latency when the target system simply can’t keep up with the requested rate, and incorporates back pressure in its network protocols (e.g. “uses TCP”, or “puts orders for coffee in a pile of paper slips that baristas pick up and execute”).

Wrk2 reports the response time that would be experienced by clients if those clients actually attempted to access the system at the rate requested, not if the clients were willing to wait and come back when the system is ready to respond, without holding that wait time against it.

Imagine you had 1,000 people per hour come to a coffee shop and waited in line for coffee, and the coffee shop had 10 baristas furiously picking up orders and executing them at a total rate of 127.56 coffee cups per hour between them. A “coffee shark” watching the baristas take the next pending coffee order from a pile of orders and make it would report 4.7 minutes on the average to compete a cup of coffee (60 * 10 / 127.56). But the median person in line would be reporting “it takes way more than 5 minutes to get a cup of coffee here”.

Your command line asked the test to generate 1000 reqs per second (with at most 10 requests in flight at any given time, -c 10), and your output shows that under that setup, the system was only able to serve 127.56 reqs per second on the average. That means that a backlog was accumulating throughout your 60 seconds test, and requests in that backlog are experiencing multi-second latencies dominated by wait times before they hit the wire for your shark to measure.

If the service rate was completely stable at the average of 127.56 reqs per second, you would have processed ~7,652 requests during the 60 seconds the test ran for. You would also be accumulating a backlog of ~872 requests every second. A total of 60,000 requests would be eligible to start during the 60 seconds window at your requested rate of 1000/sec, but most of those (52,348) will never complete within the test window. Of the ones that do complete by the 60 seconds mark, the last one, which is the 7,652nd request sent, was supposed to be sent 7.562 seconds after the test period started (at 1,000/sec as requested), and would have taken 52.438 seconds from when it was eligible to be sent until a response was received. But all that wire shark will see of that time is the tiny fraction of a second that the request will spend on the wire during the last gasp of the 60 seconds test period.

wenxzhen · 2019-10-30T01:39:35Z

thanks to @giltene .

wenxzhen · 2019-10-30T01:41:28Z

@giltene append another question here, how wrk2 calculates the "mean lat." and "rate sampling interval"? Is it possible to do a warm up before the test?

giltene · 2019-10-30T02:02:26Z

There is already a built-in warmup: There is a ~10 seconds "calibration" process that runs at full speed but is not counted in the eventual results.
https://github.com/giltene/wrk2/blob/master/src/wrk.c#L344 is triggered ~ CALIBRATE_DELAY_MS into the start of the run.
Unfortunately, there is no flag to control this currently. CALIBRATE_DELAY_MS is a compile constant of 10000. You can always edit wrk.h and recompile wrk2....

Fatahir · 2020-02-18T08:14:57Z

@giltene do you know how to generate latency of requests per unit time? I think we can do it in done function through "latency(i)". But i dont know how to do it. Latency parameter in "done" function is usertype data

wenxzhen closed this as completed Oct 30, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong latency distribution on the benchmark test #81

Wrong latency distribution on the benchmark test #81

wenxzhen commented Oct 29, 2019

giltene commented Oct 29, 2019

wenxzhen commented Oct 30, 2019

wenxzhen commented Oct 30, 2019

giltene commented Oct 30, 2019

Fatahir commented Feb 18, 2020

Wrong latency distribution on the benchmark test #81

Wrong latency distribution on the benchmark test #81

Comments

wenxzhen commented Oct 29, 2019

giltene commented Oct 29, 2019

wenxzhen commented Oct 30, 2019

wenxzhen commented Oct 30, 2019

giltene commented Oct 30, 2019

Fatahir commented Feb 18, 2020