As we can see that the top tests of plaintext and json are so close, the wrk become the bottleneck of those tests. I think we should make the server full, not the client full, so that we can know the real capability of the server. To achieve that, should we give more cpu cores for wrk? Another solution could be just run the wrk and the server on the same host, so that to make the whole system busy.