Should we ban full outer join for streaming query? #8084

chenzl25 · 2023-02-21T07:39:53Z

Describe the bug

A null row from either left or right side produces the same row (null, null) to the downstream.

create table t (a int primary key);
insert into t values(null);
create materialized view v as select t1.* from t as t1 full join t as t2 on t1.a = t2.a; -- panic

To Reproduce

No response

Expected behavior

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

chenzl25 · 2023-02-21T07:46:05Z

cc @yuhao-su

fuyufjh · 2023-02-21T07:48:43Z

What's the cause of panic?

chenzl25 · 2023-02-21T07:58:59Z

What's the cause of panic?

left side: +[null] --> Full Join -> +[null, null]
right side: +[null] --> Full Join -> +[null, null]

From the downstream's view, the full join operator inserts the same row twice.

yuhao-su · 2023-02-21T08:09:31Z

The correct behavior should be:
left side: +[null] --> Full Join -> +[null, null]
left side: +[null] --> Full Join -> -[null, null] +[null, null]
It might be a bug

chenzl25 · 2023-02-21T08:12:48Z

The correct behavior should be: left side: +[null] --> Full Join -> +[null, null] left side: +[null] --> Full Join -> -[null, null] +[null, null] It might be a bug

I don't think it is a bug. The batch query will output 2 rows instead of one row.

yuhao-su · 2023-02-21T08:27:22Z

I don't think it is a bug. The batch query will output 2 rows instead of one row.

You are right! I can't think of any easy way to fix this. Maybe we should ban it for now. cc. @st1page

st1page · 2023-02-21T08:38:21Z

😇 in fact, I prefer to ban all null primary keys. The only origin of that is Materialized view with GROUP BY K where k could be null. But if there any users need that behavior?

BugenZhao · 2023-02-21T10:16:20Z

Link to #8059.

fuyufjh · 2023-02-21T10:31:16Z

😇 in fact, I prefer to ban all null primary keys. The only origin of that is Materialized view with GROUP BY K where k could be null. But if there any users need that behavior?

Group by CUBE will generate a NULL group by default. :lark-cry:

fuyufjh · 2023-02-21T10:37:16Z

I prefer to keep the full outer join. In batch query it's almost useless but I guess it might be more useful for stream joining stream. Just guess.

But I don't have any idea to fix it now 🤔

chenzl25 · 2023-02-21T11:33:39Z

Maybe we can use a trick like I used in union all operator before. Use a project plus a constant (indicating which side) to extend the stream key of both sides. In this way, we can tell the difference between left side +[null] and right side +[null], because they will become +[null, 0] and +[null, 1].

dev=> explain create materialized view v as select * from t union all select * from t;
                                  QUERY PLAN
-------------------------------------------------------------------------------
 StreamMaterialize { columns: [a, 0:Int32(hidden)], pk_columns: [a, 0:Int32] }
 └─StreamUnion { all: true }
   ├─StreamExchange { dist: HashShard(t.a, 0:Int32) }
   | └─StreamProject { exprs: [t.a, 0:Int32] }
   |   └─StreamTableScan { table: t, columns: [a] }
   └─StreamExchange { dist: HashShard(t.a, 1:Int32) }
     └─StreamProject { exprs: [t.a, 1:Int32] }
       └─StreamTableScan { table: t, columns: [a] }
(8 rows)

fuyufjh · 2023-02-22T03:31:18Z

Maybe we can use a trick like I used in union all operator before. Use a project plus a constant (indicating which side) to extend the stream key of both sides. In this way, we can tell the difference between left side +[null] and right side +[null], because they will became +[null, 0] and +[null, 1].

I think this is the correct direction. Let me explain more about my thoughts.

Theoretically, you may consider a full outer join as a union all which combines the results from 3 ways:

All the results of the "inherent" inner join i.e. [left_row, right_row]
Result of [left_row, NULLs] for those not matched from left side
Result of [NULLs, right_row] for those not matched from right side

For the left outer join, only 1 & 2 exists. Luckily, they must be non-conflict because a left_row must belong to either 1 or 2, not both, so the left_row ensures the uniqueness of left_pk in the result's PK (which is [left_pk, right_pk])

While, for the full outer join, the problem happened because result rows in 2 & 3 can be conflicted, as @chenzl25's example shows.

Thus, I think adding a column to mark the "source" (1/2/3, as explained above) of the result row is the correct solution, but might be too heavy.

fuyufjh · 2023-02-22T03:31:24Z

Another way to mitigate the problem is to forbid null PKs on base tables. I know this cannot solve the problem completely because you can construct a MView with aggregation, but it can reduce the odds hopefully.

By the way, PG also rejects null PK:

dev=# create table t1 (pk int, jk int, primary key (pk));
CREATE TABLE
dev=# insert into t1 values (null, 5);
ERROR:  null value in column "pk" of relation "t1" violates not-null constraint

yuhao-su · 2023-02-22T06:20:35Z

By the way, PG also rejects null PK

Yes, we can simply remove the pk constraint and get the same result. I think PG will add a hidden column on the source in this case.

I can't think of any way to fully solve this problem by banning null pk from the source since we have agg. So I prefer adding a column to mark the "source" solution. The cost of adding 1 column on two sides only in full outer join sound acceptable to me.

st1page · 2023-03-08T09:43:05Z

Another way to mitigate the problem is to forbid null PKs on base tables. I know this cannot solve the problem completely because you can construct a MView with aggregation, but it can reduce the odds hopefully.

By the way, PG also rejects null PK:
dev=# create table t1 (pk int, jk int, primary key (pk));
CREATE TABLE
dev=# insert into t1 values (null, 5);
ERROR:  null value in column "pk" of relation "t1" violates not-null constraint

How about just adding a filter to ignore NULL stream key for the full outer join as a workaround.

yuhao-su · 2023-03-08T09:48:37Z

How about just adding a filter to ignore NULL stream key for the full outer join.

This will provide incorrect result 🥵

st1page · 2023-03-08T09:58:57Z

we can control the incorrect field with a more narrow predicate on the output of the outer join. just add a filter to remove the outer join's output rows where the output stream key is NULL.

SELECT * from t1 full outer join t2 on t1.pk = t2.pk;
/*will be transformed to*/
SELECT * from (
    SELECT * from t1 full outer join t2 on t1.pk = t2.pk;
) where NOT (t1.pk IS NULL AND t2.pk IS NULL)

liurenjie1024 · 2023-03-09T07:45:38Z

Why we don't just check every derived pk in plan node and check that it can't be all nullable?

st1page · 2023-03-09T08:06:04Z

innocent in fact, I prefer to ban all null primary keys. The only origin of that is Materialized view with GROUP BY K where k could be null. But if there any users need that behavior?

Group by CUBE will generate a NULL group by default. :lark-cry:

@liurenjie1024 :lark_cry

liurenjie1024 · 2023-03-13T06:06:43Z

I think this is a bug in our optimizer to determine the primary key of each streaming plan node. In full outer join, the pk + join key may not be unique when pk can be null, and in this case we may need to add extra column to ensure uniqueness.

fuyufjh · 2023-03-13T08:27:05Z

I think this is a bug in our optimizer to determine the primary key of each streaming plan node. In full outer join, the pk + join key may not be unique when pk can be null, and in this case we may need to add extra column to ensure uniqueness.

Correct, but we need to trade off between this very rare cases and complexity. I agree with @st1page that we can tolerate this incorrect behavior (i.e. just don't panic) in an off-line discussion

liurenjie1024 · 2023-03-13T09:23:34Z

Correct, but we need to trade off between this very rare cases and complexity. I agree with @st1page that we can tolerate this incorrect behavior (i.e. just don't panic) in an off-line discussion

I feel weird to allow incorrect behavior. Why not just ban such kind of query which may cause this wrong behavior? I think this is much effort.

st1page · 2023-03-14T03:14:15Z

We will ban it when we have "not null" property in the optimizer and now workaround in #8520

chenzl25 added the type/bug Something isn't working label Feb 21, 2023

github-actions bot added this to the release-0.1.18 milestone Feb 21, 2023

yuhao-su self-assigned this Mar 3, 2023

st1page mentioned this issue Mar 14, 2023

fix(streaming): ignore null stream key from full outer join to workaround #8520

Merged

6 tasks

chenzl25 closed this as completed Mar 20, 2023

st1page mentioned this issue Sep 8, 2023

bug: MemTable error: Inconsistent operation with empty Delete Key #12140

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should we ban full outer join for streaming query? #8084

Should we ban full outer join for streaming query? #8084

chenzl25 commented Feb 21, 2023

chenzl25 commented Feb 21, 2023

fuyufjh commented Feb 21, 2023

chenzl25 commented Feb 21, 2023

yuhao-su commented Feb 21, 2023 •

edited

chenzl25 commented Feb 21, 2023

yuhao-su commented Feb 21, 2023

st1page commented Feb 21, 2023

BugenZhao commented Feb 21, 2023

fuyufjh commented Feb 21, 2023

fuyufjh commented Feb 21, 2023

chenzl25 commented Feb 21, 2023 •

edited

fuyufjh commented Feb 22, 2023

fuyufjh commented Feb 22, 2023 •

edited

yuhao-su commented Feb 22, 2023 •

edited

st1page commented Mar 8, 2023 •

edited

yuhao-su commented Mar 8, 2023

st1page commented Mar 8, 2023

liurenjie1024 commented Mar 9, 2023

st1page commented Mar 9, 2023

liurenjie1024 commented Mar 13, 2023

fuyufjh commented Mar 13, 2023

liurenjie1024 commented Mar 13, 2023

st1page commented Mar 14, 2023 •

edited

Should we ban full outer join for streaming query? #8084

Should we ban full outer join for streaming query? #8084

Comments

chenzl25 commented Feb 21, 2023

Describe the bug

To Reproduce

Expected behavior

Additional context

chenzl25 commented Feb 21, 2023

fuyufjh commented Feb 21, 2023

chenzl25 commented Feb 21, 2023

yuhao-su commented Feb 21, 2023 • edited

chenzl25 commented Feb 21, 2023

yuhao-su commented Feb 21, 2023

st1page commented Feb 21, 2023

BugenZhao commented Feb 21, 2023

fuyufjh commented Feb 21, 2023

fuyufjh commented Feb 21, 2023

chenzl25 commented Feb 21, 2023 • edited

fuyufjh commented Feb 22, 2023

fuyufjh commented Feb 22, 2023 • edited

yuhao-su commented Feb 22, 2023 • edited

st1page commented Mar 8, 2023 • edited

yuhao-su commented Mar 8, 2023

st1page commented Mar 8, 2023

liurenjie1024 commented Mar 9, 2023

st1page commented Mar 9, 2023

liurenjie1024 commented Mar 13, 2023

fuyufjh commented Mar 13, 2023

liurenjie1024 commented Mar 13, 2023

st1page commented Mar 14, 2023 • edited

yuhao-su commented Feb 21, 2023 •

edited

chenzl25 commented Feb 21, 2023 •

edited

fuyufjh commented Feb 22, 2023 •

edited

yuhao-su commented Feb 22, 2023 •

edited

st1page commented Mar 8, 2023 •

edited

st1page commented Mar 14, 2023 •

edited