Pull Queries: Perf test #3545

big-andy-coates · 2019-10-11T11:55:52Z

Current state:

Based on a simple setup like below

CREATE STREAM orders_stream (order_id STRING, item_id INTEGER, qty INTEGER) WITH (KAFKA_TOPIC='orders_stream', PARTITIONS=1, REPLICAS=1, VALUE_FORMAT='JSON');

INSERT INTO orders_stream(order_id, item_id, qty) VALUES ('order-1', 1, 1);
INSERT INTO orders_stream(order_id, item_id, qty) VALUES ('order-1', 2, 3);
INSERT INTO orders_stream(order_id, item_id, qty) VALUES ('order-2', 1, 2);


SET 'auto.offset.reset'='earliest';
CREATE TABLE order_quantities AS
  SELECT order_id,
         sum(qty) as total_qty
  FROM orders_stream
  GROUP BY order_id;


SELECT * FROM order_quantities WHERE ROWKEY = 'order-1';

We found a few bottlenecks.

ServiceContext keeps getting initialized per request + validation is performed per pull query as well (pull queries open a kafka admin client per request #3663) . Pull queries take about ~7ms average
There is cost of the SQL parsing + compilation Pull Queries: pre-compile known steps #3542 . After fixing above, pull queries still take about ~2-3ms average

The text was updated successfully, but these errors were encountered:

apurvam · 2019-10-25T16:29:38Z

@vinothchandar I believe you are already working on this? Maybe reassign?

vinothchandar · 2019-10-25T16:41:57Z

done. makes sense.

vinothchandar · 2019-10-30T19:30:29Z

Goals going forward :

Get the KSQL level overheads to under 1 ms (parsing, compilation)
Run a benchmark generating orders data using ksql-datagen tool and wrk client workload generator (for ease of reproducibility by everyone)

vinothchandar · 2019-10-31T17:12:07Z

Benchmark setup :

Use ksql-datagen to generate some orders (more commandline opts to be added)

ksql-datagen quickstart=orders topic=orders_topic

ksql queries to setup the final table. We decide how many orders we want in the table for pull queries (1000 in example below) and map each orders_raw row to an order id in that range.
This involves a UDF : randomstr(min, max) which generates a random string to help vary the row sizes in storage and a UDAF : STR_MAX which implements a max of strings, used to update the value for a given orderId and keep 1 string in the agg_order_data column.

SET 'auto.offset.reset'='earliest';
CREATE STREAM orders_raw (
        ordertime BIGINT,
        orderid INT,
        itemid VARCHAR,
        orderunits DOUBLE,
        address STRUCT<
            city VARCHAR,
            state VARCHAR,
            zipcode INT>)
     WITH (
        KAFKA_TOPIC='orders_topic',
        VALUE_FORMAT='JSON');

CREATE STREAM orders_stream AS
SELECT
  *,
  CONCAT('order-', CAST(CAST(FLOOR(RANDOM() * 1000) AS BIGINT) AS VARCHAR)) as gen_orderid,
  RANDOMSTR(10, 20) as order_data
FROM orders_raw;


SET 'auto.offset.reset'='earliest';
CREATE TABLE order_quantities AS
SELECT
  gen_orderid,
  sum(orderunits) as total_qty,
  STR_MAX(order_data) as agg_order_data
FROM orders_stream
GROUP BY gen_orderid;

Once we have this, the following lua script now generate pull queries that randomly picks an order in the range and queries it for benchmarking.

[ksql-benchmark]$ cat orders-pull-query-bench.lua 
-- Generates pull queries that fetch out random orders from KSQL 
num_orders = 1000

function init(args)
   print("ksqlDB workload generator")
end

request = function()
   wrk.method = "POST"
   wrk.body   = '{"ksql":"SELECT * FROM order_quantities WHERE ROWKEY = \'order-' .. math.random(num_orders) .. '\';"}'
   wrk.headers["Content-Type"] = "application/vnd.ksql.v1+json"
   return wrk.format(nil, nil)
end


[ksql-benchmark]$ wrk -t 1 -c 1 -d 5 --latency -s ./orders-pull-query-bench.lua http://localhost:8088/ksql
Running 5s test @ http://localhost:8088/ksql
  1 threads and 1 connections
  Thread Stats   Avg      Stdev     Max   +/- Stdev
    Latency   180.76us  282.09us   5.29ms   98.83%
    Req/Sec     6.17k   589.59     6.86k    78.43%
  Latency Distribution
     50%  139.00us
     75%  163.00us
     90%  215.00us
     99%  786.00us
  31297 requests in 5.10s, 143.48MB read
  Non-2xx or 3xx responses: 31297
Requests/sec:   6137.02
Transfer/sec:     28.13MB
------------------------------

vpapavas · 2019-11-01T01:01:12Z

This is a flame graph of a pull query with the above mentioned fixes:

The next bottleneck seems to be building the logical plan #3709

vinothchandar · 2019-11-06T18:49:08Z

With #3542 and #3663 put in, ksql can do about 2.5K-3K pull queries per box at < 20ms p90 latency

rodesai · 2019-11-06T19:21:25Z

@vinothchandar can you add the specs of the environment you tested on? (cloud provider, instance type, memory, cpu specs, storage used, jvm settings)

vpapavas · 2019-11-07T03:58:40Z

Cloud provider = AWS
Instance type = i3xlarge
Memory = 32GB
CPU = 4 procs, 2 cores
Storage = SSD
jvm settings = no extra settings

big-andy-coates mentioned this issue Oct 11, 2019

Pull Query Epic #3548

Closed

27 tasks

big-andy-coates self-assigned this Oct 11, 2019

big-andy-coates added the engine label Oct 11, 2019

vinothchandar assigned vinothchandar and unassigned big-andy-coates Oct 25, 2019

vinothchandar closed this as completed Nov 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pull Queries: Perf test #3545

Pull Queries: Perf test #3545

big-andy-coates commented Oct 11, 2019 •

edited by vinothchandar

Loading

apurvam commented Oct 25, 2019

vinothchandar commented Oct 25, 2019

vinothchandar commented Oct 30, 2019

vinothchandar commented Oct 31, 2019

vpapavas commented Nov 1, 2019

vinothchandar commented Nov 6, 2019

rodesai commented Nov 6, 2019

vpapavas commented Nov 7, 2019

Pull Queries: Perf test #3545

Pull Queries: Perf test #3545

Comments

big-andy-coates commented Oct 11, 2019 • edited by vinothchandar Loading

apurvam commented Oct 25, 2019

vinothchandar commented Oct 25, 2019

vinothchandar commented Oct 30, 2019

vinothchandar commented Oct 31, 2019

Benchmark setup :

vpapavas commented Nov 1, 2019

vinothchandar commented Nov 6, 2019

rodesai commented Nov 6, 2019

vpapavas commented Nov 7, 2019

big-andy-coates commented Oct 11, 2019 •

edited by vinothchandar

Loading