Frames: Ensure nulls are read as default values when appropriate. by gianm · Pull Request #14020 · apache/druid

gianm · 2023-04-04T05:39:34Z

Fixes a bug where LongFieldWriter didn't write a properly transformed zero when writing out a null. This had no meaningful effect in SQL-compatible null handling mode, because the field would get treated as a null anyway. But it does have an effect in default-value mode: it would cause Long.MIN_VALUE to get read out instead of zero.

Also adds NullHandling checks to the various frame-based column selectors, allowing reading of nullable frames by servers in default-value mode.

Fixes a bug where LongFieldWriter didn't write a properly transformed zero when writing out a null. This had no meaningful effect in SQL-compatible null handling mode, because the field would get treated as a null anyway. But it does have an effect in default-value mode: it would cause Long.MIN_VALUE to get read out instead of zero. Also adds NullHandling checks to the various frame-based column selectors, allowing reading of nullable frames by servers in default-value mode.

imply-cheddar · 2023-04-04T22:48:18Z

extensions-core/multi-stage-query/src/test/java/org/apache/druid/msq/exec/MSQFaultsTest.java

+    final String sql = "INSERT INTO foo1\n"
+                     + "SELECT TIME_PARSE(dim1) AS __time, dim1 as cnt\n"
+                     + "FROM foo\n"
+                     + "PARTITIONED BY DAY\n"
+                     + "CLUSTERED BY dim1";


If you were to add a WHERE TIME_PARSE(dim1) is not null to the query, do the results of the two queries becomes the same regardless of mode? I think they should, but curious.

I just tried it, and yes, they are the same in that case. Both queries ingest the two rows with parseable timestamps, and ignore the four with unparseable timestamps.

LakshSingla

Should the null handling be done in the StringFieldWriters as well? Consider the following test case in CalciteQueryTest (permalink) which works with the native engine, however doesn't with MSQ. Also due to the ORDER BY the order of the results produced can be different when comparing nulls versus when comparing null with ""

SELECT dim1, dim2, SUM(cnt) AS thecnt
FROM druid.foo
GROUP BY dim1, dim2
HAVING SUM(cnt) = 1
ORDER BY dim2
LIMIT 4

gianm · 2023-04-05T16:27:27Z

Should the null handling be done in the StringFieldWriters as well? Consider the following test case in CalciteQueryTest (permalink) which works with the native engine, however doesn't with MSQ.

@LakshSingla I didn't update the StringFieldWriter (or other writers, generally) but I did update the StringFieldReader to return null rather than "" in default-value mode. That's consistent with how selectors behave on regular segments.

However, this didn't help with the test testGroupByLimitPushDownWithHavingOnLong, at least in default-value mode. I think it's because coerce is turning the null into "" in the native path, but not the MSQ path. I have another PR I'm working on to update that, which may unlock the ability to run the test case in MSQ.

I did update the test to run with MSQ in SQL-compat mode though, by adding:

    if (NullHandling.sqlCompatible()) {
      msqCompatible();
    }

gianm · 2023-04-05T22:37:55Z

The failing test has to do with branch coverage in unit tests (jdk8, sql-compat=false) / processing_modules_test. That one isn't going to pass, since the new branches only execute when sql-compat=true. So I think that means we should ignore it.

LakshSingla

To disambiguate the coerce mentioned above, I think you mean the one that is present in the NativeQueryMaker right? If so, this will also fix a few other test cases that I was seeing fail because of a type mismatch between DOUBLE and FLOAT, so I think that would be pretty cool.
Do we not require the changes in the StringFrameColumnReader for appropriate null handling as well? I see that there is something done in the getStringUtf8 method, so I might be wrong
I think we should handle the nulls appropriately in the following line (permalink) as well, when the StringFieldReader has figured out that the field is a NULL_BYTE. WDYT?
Unrelated to the change, while digging through that piece of code, I found the following condition (permalink)

if ((dataLength == 0 && NullHandling.replaceWithDefault()) ||
    (dataLength == 1 && memory.getByte(dataStart) == FrameWriterUtils.NULL_STRING_MARKER)) {
        return null;
}

Should it instead be:

if ((dataLength == 0 && NullHandling.replaceWithDefault()) ||
    (dataLength == 1 && memory.getByte(dataStart) == FrameWriterUtils.NULL_STRING_MARKER && NullHandling.replaceWithDefault())) {
        return null;
}

Rest of the changes LGTM 🚀

gianm · 2023-04-06T20:01:20Z

@LakshSingla —

To disambiguate the coerce mentioned above, I think you mean the one that is present in the NativeQueryMaker right? If so, this will also fix a few other test cases that I was seeing fail because of a type mismatch between DOUBLE and FLOAT, so I think that would be pretty cool.

Yes, that's the one I mean. In a future PR I'm planning to use this same logic for MSQ results too.

Do we not require the changes in the StringFrameColumnReader for appropriate null handling as well? I see that there is something done in the getStringUtf8 method, so I might be wrong

It's already there: getStringUtf8 turns empty strings to nulls if NullHandling.replaceWithDefault().

I think we should handle the nulls appropriately in the following line (permalink) as well, when the StringFieldReader has figured out that the field is a NULL_BYTE. WDYT?

There's nothing special to do there, since in default-value mode, the convention is that nulls and empty strings are both returned as nulls from selectors. So the NULL_BYTE branch is already behaving as expected.

Unrelated to the change, while digging through that piece of code, I found the following condition (permalink)

It believe it's correct as-is. It's saying that we should return null in two cases:

dataLength == 0 && NullHandling.replaceWithDefault(): The frame contains an empty string, and NullHandling.replaceWithDefault() is true. In this mode, by convention selectors return null instead of empty string. (The method name is misleading; it refers to what happens later, in query results, when null are turned back into empty strings.)
dataLength == 1 && memory.getByte(dataStart) == FrameWriterUtils.NULL_STRING_MARKER): The frame contains an actual null. Behavior here is the same in both modes: a null is returned.

LakshSingla · 2023-04-09T23:59:49Z

Thanks for the PR! Merging since codecov failures can be ignored due to #14020 (comment)

…ache#14020) * Frames: Ensure nulls are read as default values when appropriate. Fixes a bug where LongFieldWriter didn't write a properly transformed zero when writing out a null. This had no meaningful effect in SQL-compatible null handling mode, because the field would get treated as a null anyway. But it does have an effect in default-value mode: it would cause Long.MIN_VALUE to get read out instead of zero. Also adds NullHandling checks to the various frame-based column selectors, allowing reading of nullable frames by servers in default-value mode. (cherry picked from commit d52bc33)

The main change is the new tests: we now subclass CalciteJoinQueryTest in CalciteSelectJoinQueryMSQTest twice, once for Broadcast and once for SortMerge. Two supporting production changes for default-value mode: 1) InputNumberDataSource is marked as concrete, to allow leftFilter to be pushed down to it. 2) In default-value mode, numeric frame field readers can now return nulls. This is necessary when stacking joins on top of joins: nulls must be preserved for semantics that match broadcast joins and native queries. 3) In default-value mode, StringFieldReader.isNull returns true on empty strings in addition to nulls. This is more consistent with the behavior of the selectors, which map empty strings to null as well in that mode. As an effect of change (2), the InsertTimeNull change from apache#14020 (to replace null timestamps with default timestamps) is reverted. IMO, this is fine, as either behavior is defensible, and the change from apache#14020 hasn't been released yet.

* MSQ: Subclass CalciteJoinQueryTest, other supporting changes. The main change is the new tests: we now subclass CalciteJoinQueryTest in CalciteSelectJoinQueryMSQTest twice, once for Broadcast and once for SortMerge. Two supporting production changes for default-value mode: 1) InputNumberDataSource is marked as concrete, to allow leftFilter to be pushed down to it. 2) In default-value mode, numeric frame field readers can now return nulls. This is necessary when stacking joins on top of joins: nulls must be preserved for semantics that match broadcast joins and native queries. 3) In default-value mode, StringFieldReader.isNull returns true on empty strings in addition to nulls. This is more consistent with the behavior of the selectors, which map empty strings to null as well in that mode. As an effect of change (2), the InsertTimeNull change from #14020 (to replace null timestamps with default timestamps) is reverted. IMO, this is fine, as either behavior is defensible, and the change from #14020 hasn't been released yet. * Adjust tests. * Style fix. * Additional tests.

…ache#14020) * Frames: Ensure nulls are read as default values when appropriate. Fixes a bug where LongFieldWriter didn't write a properly transformed zero when writing out a null. This had no meaningful effect in SQL-compatible null handling mode, because the field would get treated as a null anyway. But it does have an effect in default-value mode: it would cause Long.MIN_VALUE to get read out instead of zero. Also adds NullHandling checks to the various frame-based column selectors, allowing reading of nullable frames by servers in default-value mode.

gianm added Bug Area - MSQ For multi stage queries - https://github.com/apache/druid/issues/12262 labels Apr 4, 2023

cryptoe approved these changes Apr 4, 2023

View reviewed changes

gianm added 2 commits April 4, 2023 08:12

Fix tests.

8387c11

Fixes to tests; remove unnecessary check from readers.

5299ea1

github-actions bot added the Area - Documentation label Apr 4, 2023

imply-cheddar reviewed Apr 4, 2023

View reviewed changes

LakshSingla reviewed Apr 5, 2023

View reviewed changes

Adjustments from review.

d9252c1

LakshSingla reviewed Apr 6, 2023

View reviewed changes

LakshSingla merged commit d52bc33 into apache:master Apr 9, 2023

LakshSingla mentioned this pull request Apr 10, 2023

Discrepancy with null in APPROX_COUNT_DISTINCT_BUILTIN between MSQ and native #13950

Open

gianm deleted the fix-frames-null-longs branch April 14, 2023 01:53

gianm mentioned this pull request Apr 18, 2023

MSQ: Subclass CalciteJoinQueryTest, other supporting changes. #14105

Merged

clintropolis mentioned this pull request May 8, 2023

[Backport] Frames: Ensure nulls are read as default values when appropriate. #14228

Closed

clintropolis added this to the 26.0 milestone May 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Frames: Ensure nulls are read as default values when appropriate.#14020

Frames: Ensure nulls are read as default values when appropriate.#14020
LakshSingla merged 4 commits intoapache:masterfrom
gianm:fix-frames-null-longs

gianm commented Apr 4, 2023

Uh oh!

imply-cheddar Apr 4, 2023

Uh oh!

gianm Apr 5, 2023 •

edited

Loading

Uh oh!

LakshSingla left a comment

Uh oh!

gianm commented Apr 5, 2023 •

edited

Loading

Uh oh!

gianm commented Apr 5, 2023

Uh oh!

LakshSingla left a comment •

edited

Loading

Uh oh!

gianm commented Apr 6, 2023 •

edited

Loading

Uh oh!

LakshSingla commented Apr 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

gianm commented Apr 4, 2023

Uh oh!

imply-cheddar Apr 4, 2023

Choose a reason for hiding this comment

Uh oh!

gianm Apr 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LakshSingla left a comment

Choose a reason for hiding this comment

Uh oh!

gianm commented Apr 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gianm commented Apr 5, 2023

Uh oh!

LakshSingla left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gianm commented Apr 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LakshSingla commented Apr 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gianm Apr 5, 2023 •

edited

Loading

gianm commented Apr 5, 2023 •

edited

Loading

LakshSingla left a comment •

edited

Loading

gianm commented Apr 6, 2023 •

edited

Loading