Fix deletes with subqueries and compression #6789

nikkhils · 2024-03-27T08:10:46Z

For UPDATEs and DELETEs when a compressed chunk is involved, the code decompresses the relevant data into the uncompressed portion of the chunk. This happens during execution, so it's possible that if the planner doesn't have a plan for the uncompressed chunk then we might miss scanning out on those decompressed rows. We now check for the possibility of a compressed chunk becoming partial during the planning itself and tag on an APPEND plan on top of scans on the compressed and uncompressed parts.

codecov · 2024-03-27T09:11:36Z

Codecov Report

Attention: Patch coverage is 72.72727% with 3 lines in your changes are missing coverage. Please review.

Project coverage is 80.88%. Comparing base (59f50f2) to head (0eb2c9e).
Report is 87 commits behind head on main.

Files	Patch %	Lines
tsl/src/nodes/decompress_chunk/decompress_chunk.c	72.72%	0 Missing and 3 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6789      +/-   ##
==========================================
+ Coverage   80.06%   80.88%   +0.81%     
==========================================
  Files         190      191       +1     
  Lines       37181    36513     -668     
  Branches     9450     9531      +81     
==========================================
- Hits        29770    29534     -236     
- Misses       2997     3174     +177     
+ Partials     4414     3805     -609

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

antekresic

LGTM

I would like for somebody like @akuzm to take a look at this since he is most familiar with DecompressChunk here, but the change seems straight forward.

akuzm · 2024-03-27T12:03:24Z

tsl/test/sql/compression_update_delete.sql

+-- check that DML causes transparent decompression and that
+-- data gets shifted to the uncompressed parts
+EXPLAIN (costs off) DELETE FROM test_partials WHERE time <> ALL(SELECT time from test_partials);


This delete query shouldn't actually touch anything, right? So the fact that the data is decompressed here is a missing optimization that we should add at some point. Can you change this to actually delete one row from a compressed batch, to make the test more robust? Also might be good to add one chunk with at least two compressed batches, so that both parts of the partial chunk path are tested.

@akuzm I believe it could be a chicken and egg thingy because unless we decompress we might not get access to all the values that we need to check against.

@akuzm there are other tests which test out various scenarios with a mix of multiple partial, complete compressed, uncompressed chunks in tsl/test/sql/compression_ddl.sql already.

@akuzm I believe it could be a chicken and egg thingy because unless we decompress we might not get access to all the values that we need to check against.

I mean, imagine in the future we have a more optimal path for DELETEs, for example through TAM, and it's not going to decompress anything if the WHERE doesn't match anything. So the test will stop working. It would be good if some rows actually matched the clause and were deleted, e.g. if it was time >= ALL(..., then the latest row would match.

Ok, changed

akuzm · 2024-03-27T12:03:43Z

tsl/test/sql/compression_update_delete.sql

+-- data gets shifted to the uncompressed parts
+EXPLAIN (costs off) DELETE FROM test_partials WHERE time <> ALL(SELECT time from test_partials);
+DELETE FROM test_partials WHERE time <> ALL(SELECT time from test_partials);
+-- P, P, P


Looks like this comment needs updating :)

Umm, all 3 chunks will now become partial right? That's why the P, P, P.

Ah OK, I didn't understand what it means and thought it was some temporary comment :)

Reworded the comment :-)

akuzm

Not very familiar with this code tbh, but this looks like an improvement. The test should be made more robust though.

For UPDATEs and DELETEs when a compressed chunk is involved, the code decompresses the relevant data into the uncompressed portion of the chunk. This happens during execution, so it's possible that if the planner doesn't have a plan for the uncompressed chunk then we might miss scanning out on those decompressed rows. We now check for the possibility of a compressed chunk becoming partial during the planning itself and tag on an APPEND plan on top of scans on the compressed and uncompressed parts.

nikkhils self-assigned this Mar 27, 2024

nikkhils force-pushed the del_subq branch from e7e38bc to 69efd10 Compare March 27, 2024 08:14

nikkhils requested review from antekresic and mkindahl March 27, 2024 08:15

nikkhils linked an issue Mar 27, 2024 that may be closed by this pull request

[Bug]: DELETE FROM compressed chunks ignoring subquery #6781

Closed

nikkhils force-pushed the del_subq branch from 69efd10 to 592cba0 Compare March 27, 2024 09:01

antekresic approved these changes Mar 27, 2024

View reviewed changes

nikkhils requested a review from akuzm March 27, 2024 09:17

akuzm reviewed Mar 27, 2024

View reviewed changes

akuzm approved these changes Mar 27, 2024

View reviewed changes

mkindahl removed their request for review March 28, 2024 07:38

nikkhils force-pushed the del_subq branch from 592cba0 to 0eb2c9e Compare March 28, 2024 11:54

nikkhils enabled auto-merge (rebase) March 28, 2024 11:56

nikkhils merged commit ea5c7f1 into timescale:main Mar 28, 2024
40 of 41 checks passed

timescale-automation mentioned this pull request Mar 28, 2024

Backport to 2.14.x: #6789: Fix deletes with subqueries and compression #6792

Open

nikkhils deleted the del_subq branch March 28, 2024 12:46

fabriziomello mentioned this pull request Apr 30, 2024

Release notes header 2.15.0 #6874

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix deletes with subqueries and compression #6789

Fix deletes with subqueries and compression #6789

nikkhils commented Mar 27, 2024

codecov bot commented Mar 27, 2024 •

edited

antekresic left a comment

akuzm Mar 27, 2024

nikkhils Mar 28, 2024

nikkhils Mar 28, 2024

akuzm Mar 28, 2024

nikkhils Mar 28, 2024

akuzm Mar 27, 2024

nikkhils Mar 28, 2024

akuzm Mar 28, 2024

nikkhils Mar 28, 2024

akuzm left a comment

Fix deletes with subqueries and compression #6789

Fix deletes with subqueries and compression #6789

Conversation

nikkhils commented Mar 27, 2024

codecov bot commented Mar 27, 2024 • edited

Codecov Report

antekresic left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

akuzm left a comment

Choose a reason for hiding this comment

codecov bot commented Mar 27, 2024 •

edited