Use data inheritance from VirtualTupleTableSlot in compressed batch #6615

akuzm · 2024-02-07T19:37:51Z

This simplifies passing the columnar data out of the DecompressChunk to Vectorized Aggregation node which we plan to implement. Also this should improve memory locality and bring us closer to the architecture used in TAM for ArrowTupleSlot.

Disable-check: force-changelog-file

codecov · 2024-02-07T19:47:44Z

Codecov Report

Attention: 23 lines in your changes are missing coverage. Please review.

Comparison is base (59f50f2) 80.06% compared to head (af72722) 81.47%.
Report is 15 commits behind head on main.

Files	Patch %	Lines
tsl/src/nodes/decompress_chunk/compressed_batch.c	79.36%	0 Missing and 13 partials ⚠️
tsl/src/nodes/decompress_chunk/batch_queue_heap.c	61.53%	0 Missing and 5 partials ⚠️
tsl/src/nodes/decompress_chunk/compressed_batch.h	57.14%	0 Missing and 3 partials ⚠️
tsl/src/nodes/decompress_chunk/batch_array.c	66.66%	0 Missing and 1 partial ⚠️
tsl/src/nodes/decompress_chunk/batch_queue_fifo.h	75.00%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6615      +/-   ##
==========================================
+ Coverage   80.06%   81.47%   +1.41%     
==========================================
  Files         190      191       +1     
  Lines       37181    36418     -763     
  Branches     9450     9464      +14     
==========================================
- Hits        29770    29673      -97     
+ Misses       2997     2979      -18     
+ Partials     4414     3766     -648

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

github-actions · 2024-02-12T14:20:35Z

@mahipv, @jnidzwetzki: please review this pull request.

Powered by pull-review

jnidzwetzki

Overall it looks good. Could you add some comments to the used functions (e.g., compressed_batch_lazy_init, compressed_batch_current_tuple) before merging?

In addition, it might be good to add a comment and document how the base pointer of the VirtualTupleTableSlot is used to make this understandable without knowing this PR.

nikkhils · 2024-02-19T09:39:14Z

tsl/src/nodes/decompress_chunk/compressed_batch.c

+	batch_state->compressed_slot =
+		MakeSingleTupleTableSlot(dcontext->compressed_slot_tdesc, compressed_slot->tts_ops);
+
+	/* Get a reference the the output TupleTableSlot */


typo "the the"

nikkhils · 2024-02-19T09:45:46Z

tsl/src/nodes/decompress_chunk/compressed_batch.c

@@ -700,7 +750,7 @@ compressed_batch_set_compressed_tuple(DecompressContext *dcontext,
 		 * columns. This can be improved by only decompressing the columns
 		 * needed for sorting.
 		 */
-		batch_state->next_batch_row = batch_state->total_batch_rows;
+		compressed_batch_discard_tuples(batch_state);


does this mean that earlier there was a memory leak here?

I think no, the memory context and tuple slots would be reset when the next compressed tuple arrived.

nikkhils

LGTM.

This simplifies passing the columnar data out of the DecompressChunk to Vectorized Aggregation node which we plan to implement. Also this should improve memory locality and bring us closer to the architecture used in TAM for ArrowTupleSlot.

github-actions bot assigned akuzm Feb 7, 2024

akuzm marked this pull request as ready for review February 12, 2024 14:20

github-actions bot requested review from jnidzwetzki and mahipv February 12, 2024 14:20

akuzm changed the title ~~Reduce indirections in compressed batch~~ Use data inheritance from VirtualTupleTableSlot in compressed batch Feb 12, 2024

jnidzwetzki approved these changes Feb 16, 2024

View reviewed changes

nikkhils reviewed Feb 19, 2024

View reviewed changes

nikkhils approved these changes Feb 19, 2024

View reviewed changes

akuzm force-pushed the batch-layout branch from 587f968 to 1ff9dda Compare February 19, 2024 16:01

akuzm force-pushed the batch-layout branch from 1ff9dda to af72722 Compare February 19, 2024 16:04

akuzm enabled auto-merge (squash) February 19, 2024 16:04

akuzm merged commit e978619 into timescale:main Feb 19, 2024
42 of 43 checks passed

akuzm deleted the batch-layout branch February 19, 2024 16:18

fabriziomello mentioned this pull request May 6, 2024

Release 2.15.0: Release branch #6889

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use data inheritance from VirtualTupleTableSlot in compressed batch #6615

Use data inheritance from VirtualTupleTableSlot in compressed batch #6615

akuzm commented Feb 7, 2024 •

edited

Loading

codecov bot commented Feb 7, 2024 •

edited

Loading

github-actions bot commented Feb 12, 2024

jnidzwetzki left a comment

nikkhils Feb 19, 2024

nikkhils Feb 19, 2024

akuzm Feb 19, 2024

nikkhils left a comment

Use data inheritance from VirtualTupleTableSlot in compressed batch #6615

Use data inheritance from VirtualTupleTableSlot in compressed batch #6615

Conversation

akuzm commented Feb 7, 2024 • edited Loading

codecov bot commented Feb 7, 2024 • edited Loading

Codecov Report

github-actions bot commented Feb 12, 2024

jnidzwetzki left a comment

Choose a reason for hiding this comment

nikkhils Feb 19, 2024

Choose a reason for hiding this comment

nikkhils Feb 19, 2024

Choose a reason for hiding this comment

akuzm Feb 19, 2024

Choose a reason for hiding this comment

nikkhils left a comment

Choose a reason for hiding this comment

akuzm commented Feb 7, 2024 •

edited

Loading

codecov bot commented Feb 7, 2024 •

edited

Loading