Provide framework for generic lazily evaluated operation results #1350

RobinTF · 2024-05-18T00:32:19Z

Still WIP. Currently missing:

Discussion about remaining TODOs
Lots of unit tests
Also most likely some functions need to be broken up into smaller pieces once we found everything else to be working "correctly".
Documentation of all newly introduced functions once they're becoming somewhat "final"
Cold Fusion & World domination?

src/engine/Operation.cpp

RobinTF · 2024-05-18T00:36:16Z

src/engine/Operation.cpp

+          result._resultPointer->resultTable()->idTable().numColumns();
+      LOG(DEBUG) << "Computed result of size " << resultNumRows << " x "
+                 << resultNumCols << std::endl;
+    }


Does this debug message provide any real benefit to make it worth somehow incorporating it into lazily evaluated operations?

codecov · 2024-05-18T00:56:07Z

Codecov Report

Attention: Patch coverage is 47.47664% with 281 lines in your changes missing coverage. Please review.

Project coverage is 88.06%. Comparing base (f9e730c) to head (c465685).

Files	Patch %	Lines
src/engine/Result.cpp	25.89%	182 Missing and 4 partials ⚠️
src/engine/Operation.cpp	58.33%	32 Missing and 3 partials ⚠️
src/engine/Filter.cpp	56.75%	15 Missing and 1 partial ⚠️
src/engine/IndexScan.cpp	5.88%	15 Missing and 1 partial ⚠️
src/engine/ExportQueryExecutionTrees.cpp	73.33%	7 Missing and 5 partials ⚠️
src/util/Cache.h	82.08%	0 Missing and 12 partials ⚠️
src/util/CacheableGenerator.h	0.00%	2 Missing ⚠️
src/engine/QueryExecutionTree.cpp	83.33%	0 Missing and 1 partial ⚠️
src/engine/QueryPlanner.cpp	50.00%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1350      +/-   ##
==========================================
- Coverage   88.89%   88.06%   -0.84%     
==========================================
  Files         327      329       +2     
  Lines       28974    29430     +456     
  Branches     3210     3271      +61     
==========================================
+ Hits        25756    25917     +161     
- Misses       2066     2331     +265     
- Partials     1152     1182      +30

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@hannahbast

This PR contains all the changes from the infrastructure for lazy operation evaluation (#1350) that are simple and repetitive, but touch many files. In particular: * Rename the `ResultTable` class to `Result` (a TODO suggested by @hannahbast some time ago). * Add a new parameter `bool requestLaziness` to `Operation::computeResult`. This parameter is currently unused.

…t-table

…the way

This makes the code much simpler, and makes no difference for almost all queries. The expensive part (reading from disk and decompressing) is still done in parallel, only the writing to the `IdTable` is now serialized + there is an additional copy compared to before. An example query that is slower now because of this change is: materialize a large index scan (for example, for the predicate `rdf:type`) and group by subject (there is a shortcut for grouping by object when there are few objects). But such queries will become lazy soon anyway (see #1350) and then this will be irrelevant.

…t-table

…eiburg#1323) This makes the code much simpler, and makes no difference for almost all queries. The expensive part (reading from disk and decompressing) is still done in parallel, only the writing to the `IdTable` is now serialized + there is an additional copy compared to before. An example query that is slower now because of this change is: materialize a large index scan (for example, for the predicate `rdf:type`) and group by subject (there is a shortcut for grouping by object when there are few objects). But such queries will become lazy soon anyway (see ad-freiburg#1350) and then this will be irrelevant.

sonarcloud · 2024-06-15T16:22:49Z

Quality Gate passed

Issues
32 New issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
0.0% Duplication on New Code

See analysis details on SonarCloud

RobinTF commented May 18, 2024

View reviewed changes

src/engine/Operation.cpp Show resolved Hide resolved

RobinTF commented May 18, 2024

View reviewed changes

RobinTF mentioned this pull request May 21, 2024

Refactoring preliminaries for lazy operations (Part 1) #1352

Merged

RobinTF added 25 commits May 23, 2024 16:26

Rename ResultTable -> Result

80667bd

Wrap idTable in variant

31b2c11

Add ability to create Result from generator

4d0204c

Start fixing caching issues

515ed0c

Avoid another class of exceptions

ca1cbed

Optimize imports

9e7f3cb

Introduce ReusableGenerator class

4c75d42

Try to make caching work

892e4a5

Fiddle around with const a bit

586365c

Add more TODOs

80e2dbd

Fix TextLimit code after rebase

18ca5b1

Fix compilation issues for ReusableGenerator

86a9f4b

Remove offset calculations from exporter

7f0a5e7

Fix typo

aee20dd

Add comments

7576b2e

Make supportsLimit private to avoid misuse

7765a25

Properly use minimum limit if present

f815be8

Start adding code to manipulate code after cache extraction

90cca50

Implement fallback mechanism for failed cache share

694c21f

Fix accidental edit of Usage.md

ea8b81f

Consume result as master

50e4529

Add proper condition variables

16eedd8

Implement code that allows for proper recomputation of cache size

bf8f085

Refactor a bit

771eb5b

Aggregate tables at the end of lazy results

8aa9060

Merge remote-tracking branch 'ad-freiburg/master' into refactor-resul…

93a5892

…t-table

RobinTF force-pushed the refactor-result-table branch from 51eaabf to 93a5892 Compare May 23, 2024 14:33

Change how maxSend works

9c445ea

RobinTF mentioned this pull request May 23, 2024

Consistent handling of implicit and explicit LIMIT clauses #1355

Merged

RobinTF added 11 commits May 23, 2024 19:25

Correct call order

b9ca4aa

Correct call order

43dddd0

Rethink approach to apply limits and offset

4ac7892

Add back headers

ef17e67

Add back result limiter for subqueries

aabb81b

Try to fix subtle bug with runtime information detail

66a38b4

Merge branch 'max-send-changes' into refactor-result-table

999baee

Merge remote-tracking branch 'ad-freiburg/master' into refactor-resul…

c291ff7

…t-table

Add back comment

9f17e07

Rename resultTable -> result

389f3f1

Merge remote-tracking branch 'ad-freiburg/master' into refactor-resul…

000af28

…t-table

RobinTF force-pushed the refactor-result-table branch from c7ebab6 to 000af28 Compare June 6, 2024 16:31

RobinTF added 4 commits June 9, 2024 22:04

Add correctness check to prevent double move due to race condition

ba142a0

Start implementing tests for new cache feature and fixing bugs along …

44562c7

…the way

Some Test cleanup

0f3a59a

Mark variable as maybe_unused

d226849

hannahbast mentioned this pull request Jun 13, 2024

Implement materialized index scans by materializing lazy scans #1323

Merged

RobinTF added 5 commits June 13, 2024 23:31

Merge remote-tracking branch 'ad-freiburg/master' into refactor-resul…

552a268

…t-table

Restructure recomputeSize a bit to avoid unwanted behaviour

cde135a

Add remaining cache tests

cf6b4c9

Merge remote-tracking branch 'ad-freiburg/master' into refactor-resul…

b2138bf

…t-table

Add tests for IteratorWrapper

0c589e3

Fix line endings

c465685

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide framework for generic lazily evaluated operation results #1350

Provide framework for generic lazily evaluated operation results #1350

RobinTF commented May 18, 2024 •

edited

RobinTF May 18, 2024

codecov bot commented May 18, 2024 •

edited

sonarcloud bot commented Jun 15, 2024

Provide framework for generic lazily evaluated operation results #1350

Are you sure you want to change the base?

Provide framework for generic lazily evaluated operation results #1350

Conversation

RobinTF commented May 18, 2024 • edited

RobinTF May 18, 2024

Choose a reason for hiding this comment

codecov bot commented May 18, 2024 • edited

Codecov Report

sonarcloud bot commented Jun 15, 2024

Quality Gate passed

RobinTF commented May 18, 2024 •

edited

codecov bot commented May 18, 2024 •

edited