Improve DenseTensorSchema.key_range #193

gsakkis · 2022-09-26T09:01:20Z

DenseTensorSchema.key_range currently returns an IntRange of the key dimension values for dense arrays. For improved efficiency and memory predictability however, it is better if every partition of this key range (generated by partition_by_count and/or partition_by_weight) starts at a tile boundary so that any integral TileDB tile is read (at most) once.

This PR updates DenseTensorSchema.key_range to return a ConstrainedPartitionsIntRange whose start_offsets are a range with step equal to the tile size of the key dimension, ensuring that every partition (with the possible exception of the first) starts at a tile boundary.

While this change is in general beneficial, it is even more important for extending properly #191 to dense arrays (since it is more likely that the default IntRange partitioning doesn't align with tile boundaries for an arbitrary user-selected key dimension slice).

shortcut-integration · 2022-09-26T09:01:22Z

This pull request has been linked to Shortcut Story #21626: Support for slicing on key dimension.

georgeSkoumas · 2022-10-03T12:47:52Z

tiledb/ml/readers/_tensor_schema/dense.py

    @property
-    def key_range(self) -> IntRange:
+    def key_range(self) -> ConstrainedPartitionsIntRange:
+        self._key_range: ConstrainedPartitionsIntRange


Minor and old...self._key_range is not defined in __init__.

This is a cached property, it is defined (and cached) the first time the property is accessed instead of __init__. In Python 3.8+ this is provided by functools.cached_property, since we support 3.7 we do it manually instead.

georgeSkoumas

LGTM!

…ntRange

gsakkis requested review from georgeSkoumas and ktsitsi September 26, 2022 09:01

georgeSkoumas reviewed Oct 3, 2022

View reviewed changes

georgeSkoumas approved these changes Oct 3, 2022

View reviewed changes

gsakkis changed the base branch from gsa/InclusiveRange-updates to master October 3, 2022 12:56

Change DenseTensorSchema.key_range to return a ConstrainedPartitionsI…

56a12c7

…ntRange

gsakkis force-pushed the gsa/sc-21626/support-for-slicing-on-key-dimension-dense branch from 47fba99 to 56a12c7 Compare October 3, 2022 12:56

gsakkis merged commit b9baa54 into master Oct 3, 2022

gsakkis deleted the gsa/sc-21626/support-for-slicing-on-key-dimension-dense branch October 3, 2022 12:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve DenseTensorSchema.key_range #193

Improve DenseTensorSchema.key_range #193

Uh oh!

gsakkis commented Sep 26, 2022

Uh oh!

shortcut-integration bot commented Sep 26, 2022

Uh oh!

georgeSkoumas Oct 3, 2022

Uh oh!

gsakkis Oct 3, 2022

Uh oh!

georgeSkoumas left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Improve DenseTensorSchema.key_range #193

Improve DenseTensorSchema.key_range #193

Uh oh!

Conversation

gsakkis commented Sep 26, 2022

Uh oh!

shortcut-integration bot commented Sep 26, 2022

Uh oh!

georgeSkoumas Oct 3, 2022

Choose a reason for hiding this comment

Uh oh!

gsakkis Oct 3, 2022

Choose a reason for hiding this comment

Uh oh!

georgeSkoumas left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants