compute: use `drop_dataflow` #18442

teskje · 2023-03-28T10:42:51Z

I'm using this PR to track testing of drop_dataflow for dropping dataflows in COMPUTE.

Testing

Motivation

This PR adds a known-desirable feature.

Advances #2392.

Tips for reviewer

Checklist

This PR has adequate test coverage / QA involvement has been duly considered.
This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
This PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way) and therefore is tagged with a T-proto label.
If this PR will require changes to cloud orchestration, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).
This PR includes the following user-facing behavior changes:

teskje · 2023-03-30T08:56:34Z

For stability testing, I ran the following script against my staging env:

#! /bin/env python3

from threading import Thread

import pg8000.native

def run_loop(sql):
    conn = pg8000.native.Connection([...])
    while True:
        try:
            for line in sql:
                conn.run(line)
        except Exception as exc:
            print(f"error: {exc}")

q1 = """
SELECT
	l_returnflag,
	l_linestatus,
	sum(l_quantity) AS sum_qty,
	sum(l_extendedprice) AS sum_base_price,
	sum(l_extendedprice * (1 - l_discount)) AS sum_disc_price,
	sum(l_extendedprice * (1 - l_discount) * (1 + l_tax)) AS sum_charge,
	avg(l_quantity) AS avg_qty,
	avg(l_extendedprice) AS avg_price,
	avg(l_discount) AS avg_disc,
	count(*) AS count_order
FROM
	lineitem
WHERE
	l_shipdate <= DATE '1998-12-01' - INTERVAL '60' day
GROUP BY
	l_returnflag,
	l_linestatus
ORDER BY
	l_returnflag,
	l_linestatus
"""

workloads = [
    # one-off selects
    [q1],
    # indexes
    [
        f"CREATE VIEW v_indexes AS {q1}",
        "CREATE DEFAULT INDEX on v_indexes",
        "SELECT mz_internal.mz_sleep(10)",
        "DROP VIEW v_indexes",
    ],
    # indexes fast
    [
        f"CREATE VIEW v_indexes_fast AS {q1}",
        "CREATE DEFAULT INDEX on v_indexes_fast",
        "DROP VIEW v_indexes_fast",
    ],
    # MVs
    [
        f"CREATE MATERIALIZED VIEW mv_mvs AS {q1}",
        "SELECT mz_internal.mz_sleep(10)",
        "DROP MATERIALIZED VIEW mv_mvs",
    ],
    # MVs fast
    [
        f"CREATE MATERIALIZED VIEW mv_mvs_fast AS {q1}",
        "DROP MATERIALIZED VIEW mv_mvs_fast",
    ],
    # subscribes
    [
        f"DECLARE c CURSOR FOR SUBSCRIBE ({q1}); FETCH 1 c",
    ],
]

threads = []
for sql in workloads:
    thread = Thread(target=run_loop, args=(sql,))
    thread.start()
    threads.append(thread)

for t in threads:
    t.join()

I let that run against a replica with the following configurations:

cpu_limit: 6
memory_limit: 48GiB
scale: 4
workers: 5

(This is a "small" instance but with the "scale" bumped up to ensure we also test multi-process communication.)

After 15 hours, everything looks good! None of the replica pods crashed or produced errors. Resource usage looks stable throughout the entire time frame:

After shutting the test down, there don't appear to be any logging leaks: All introspection sources only contain records from system dataflows.

teskje · 2023-03-31T14:14:07Z

My initial test on staging had two flaws:

The DDL statements seemed to overwhelm environmentd and made it very slow to handle queries, which is why only 1300 dataflows were created and dropped during the 15 hours.
SELECT statements set an until frontier, which makes the dataflow shut down immediately after having produced the required snapshot, so drop_dataflow is not really exercised by SELECTs.

Because of these issues, I ran a second test with the following changes:

The load generation script runs only SELECTs, no DDL statements, in four threads.
The adapter code is patched to not set the until frontier when planning peeks.

Apart from that I also increased the number of replica processes to increase the chance of finding issues with the inter-process communication. This is the new replica configuration:

cpu_limit: 4
memory_limit: 8GiB
scale: 16
workers: 3

After 18 hours, 123000 dataflows have been created and dropped. As with the first test, none of the replica pods crashed or produced errors, and resource usage looks stable throughout the entire time frame:

(The stray green line is me switching on memory profiling.)

Between 2673e8c and ff516c8, drop_dataflow appears to be working well, based on testing in Materialize (MaterializeInc/materialize#18442). So remove the "public beta" warning from the docstring. Fix TimelyDataflow#306.

This commit moves some code around in preparation for adding support for active dataflow cancellation. These changes are not required, but they slighlty improve readability. * Add a `ComputeState` constructor method. This allows us to make some of the `ComputeState` fields immediately private. * Factor out collection dropping code from `handle_allow_compaction` into a `drop_collection` method.

This commit patches timely to get drop-safetly for reachability log events (TimelyDataflow/timely-dataflow#517). We need to revert this before we can merge.

This commit implements active dataflow cancellation in compute, by invoking timely's `drop_dataflow` method when a dataflow is allowed to compact to the empty frontier.

This commit adds a test verifying that active dataflow cancellation actually works. It does so by installing a divergent dataflow, dropping it and then checking the introspection sources to ensure it doesn't exist anymore.

philip-stoev · 2023-04-05T15:11:07Z

I would like to stress test this with the RQG. @teskje please let me know when the time will be right to do this.

teskje · 2023-04-05T15:13:27Z

@philip-stoev The time is right now, please go ahead! Note that you have to set the active_dataflow_cancellation feature flag to enable this feature.

philip-stoev · 2023-04-10T11:22:12Z

Item No 1. This situation does not produce a cancellation:

Run

CREATE TABLE t1 (f1 INTEGER) ;
INSERT INTO t1 WITH MUTUALLY RECURSIVE flip(x INTEGER) AS (VALUES(1) EXCEPT ALL SELECT * FROM flip) SELECT * FROM flip;

Kill the psql client from the outside using killall

philip-stoev

I pushed extensions to your testdrive test, please take them along for the ride to main.

As a separate effort, in order to complement your test, I created a stress test around a divergent WMR dataflow plus INSERT ... SELECT and SUBSCRIBE cursors. Unfortunately:

INSERT ... SELECTs run only one statement at a time
cursors cause CRDB to just consume CPU like crazy
there must be some other bottleneck in the adapter because even SET queries are slow under load.

A size 8-8 cluster was used.

So I do not think I was able to drive as much concurrency as one would want. Either way, there were dozens of active subscribe dataflows in the system (and all of them would be subject to forcible cancellation) There were no panics , deadlocks or obvious memory leaks.

teskje force-pushed the drop-dataflow branch 2 times, most recently from 050232f to 9f2be75 Compare March 28, 2023 13:23

teskje mentioned this pull request Mar 29, 2023

Drop implementation for Tracker TimelyDataflow/timely-dataflow#517

Merged

teskje force-pushed the drop-dataflow branch from 9f2be75 to 49b8a1b Compare March 29, 2023 09:14

teskje force-pushed the drop-dataflow branch from aa25308 to 9950736 Compare March 30, 2023 16:11

teskje force-pushed the drop-dataflow branch from 9950736 to 54f2f15 Compare March 31, 2023 15:31

teskje mentioned this pull request Mar 31, 2023

compute: make import frontier logging drop-safe #18531

Merged

5 tasks

teskje force-pushed the drop-dataflow branch from 54f2f15 to 83b6027 Compare March 31, 2023 15:58

benesch mentioned this pull request Mar 31, 2023

Mark drop_dataflow as stable TimelyDataflow/timely-dataflow#519

Draft

teskje force-pushed the drop-dataflow branch 2 times, most recently from f40e78b to a9420f1 Compare April 4, 2023 10:43

teskje added 5 commits April 4, 2023 12:50

DON'T MERGE!

e6be11f

This commit patches timely to get drop-safetly for reachability log events (TimelyDataflow/timely-dataflow#517). We need to revert this before we can merge.

compute: active dataflow cancellation

2850dfb

This commit implements active dataflow cancellation in compute, by invoking timely's `drop_dataflow` method when a dataflow is allowed to compact to the empty frontier.

Add a feature flag for active dataflow cancellation

e7a30df

testdrive: a test for active dataflow cancellation

47d7035

This commit adds a test verifying that active dataflow cancellation actually works. It does so by installing a divergent dataflow, dropping it and then checking the introspection sources to ensure it doesn't exist anymore.

teskje force-pushed the drop-dataflow branch from a9420f1 to 47d7035 Compare April 4, 2023 10:58

philip-stoev self-requested a review April 5, 2023 15:09

testdrive: Additional tests for dataflow cancellation

2d19080

philip-stoev mentioned this pull request Apr 10, 2023

Cancellation of divergent recursive dataflows #16800

Closed

philip-stoev approved these changes Apr 10, 2023

View reviewed changes

teskje mentioned this pull request Apr 11, 2023

INSERT ... SELECTs are not cancelled on session disconnect #18715

Closed

petrosagg mentioned this pull request Apr 12, 2023

compute: add token-controlled fuse in feedback edges #18718

Merged

5 tasks

teskje mentioned this pull request Jun 22, 2023

compute: introduce a CollectionState #20090

Merged

5 tasks

teskje mentioned this pull request Oct 6, 2023

"Parallel Workload (cancel)" is flaky due to OOM #22228

Closed

teskje mentioned this pull request Nov 30, 2023

[Epic] Active dataflow cancellation #2392

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

compute: use `drop_dataflow` #18442

compute: use `drop_dataflow` #18442

teskje commented Mar 28, 2023 •

edited

teskje commented Mar 30, 2023

teskje commented Mar 31, 2023 •

edited

philip-stoev commented Apr 5, 2023

teskje commented Apr 5, 2023

philip-stoev commented Apr 10, 2023

philip-stoev left a comment

compute: use drop_dataflow #18442

Are you sure you want to change the base?

compute: use drop_dataflow #18442

Conversation

teskje commented Mar 28, 2023 • edited

Testing

Motivation

Tips for reviewer

Checklist

teskje commented Mar 30, 2023

teskje commented Mar 31, 2023 • edited

philip-stoev commented Apr 5, 2023

teskje commented Apr 5, 2023

philip-stoev commented Apr 10, 2023

philip-stoev left a comment

Choose a reason for hiding this comment

compute: use `drop_dataflow` #18442

compute: use `drop_dataflow` #18442

teskje commented Mar 28, 2023 •

edited

teskje commented Mar 31, 2023 •

edited