Add general index benchmarks #599

jeltsch · 2025-02-27T17:24:24Z

This pull request adds index benchmarks that are independent of index type and instantiates them for compact and ordinary indexes.

jorisdral · 2025-03-12T08:51:21Z

bench/micro/Bench/Database/LSMTree/Internal/Index.hs

+incrementalConstructionAppends
+    :: Int      -- ^ Number of keys used in the construction


Suggested change

incrementalConstructionAppends

:: Int -- ^ Number of keys used in the construction

incrementalConstructionAppends ::

Int -- ^ Number of keys used in the construction

@jeltsch did you forget to resolve this?

I thought that this wouldn’t be so important, given that it was labeled a “suggested” change and the pull request was already approved as it were.

Generally, I think that having this sort of hanging indentation by not putting :: above the arrows makes for an inferior layout. I know that your editor and GitHub seem to have problems with a layout like the present one when it comes to syntax highlighting, but I think we shouldn’t let current bugs of tools influence our layout.

Well, I can make the change. Should I?

jorisdral · 2025-03-12T09:04:00Z

bench/micro/Bench/Database/LSMTree/Internal/Index.hs

+-- | Deterministically constructs a value using a QuickCheck generator.
+generated :: Gen a -> a
+generated (MkGen exec) = exec (mkQCGen 411) 30
+
+{-|
+    Constructs serialised keys that conform to the key size constraint of
+    compact indexes.
+-}
+keysForIndexCompact :: Int             -- ^ Number of keys
+                    -> [SerialisedKey] -- ^ Constructed keys
+keysForIndexCompact = vector                                        >>>
+                      generated                                     >>>
+                      map (getKeyForIndexCompact >>> SerialisedKey)
+
+{-|
+    Constructs append operations whose serialised keys conform to the key size
+    constraint of compact indexes.
+-}
+appendsForIndexCompact :: Int      -- ^ Number of keys used in the construction
+                       -> [Append] -- ^ Constructed append operations
+appendsForIndexCompact = keysForIndexCompact                >>>
+                         mkPages 0.03 (choose (0, 16)) 0.01 >>>
+                         generated                          >>>
+                         toAppends


I personally normally avoid using QuickCheck generators because they are often in flux, and because they are primarily tailored towards testing, not benchmarking. It can be more future proof to write these functions with System.Random and related functions. When we change QuickCheck generators, we might accidentally change a benchmark as well

I’m also not completely satisfied with using QuickCheck generators for benchmarks. That said, by using System.Random directly, we lose the ability to use existing utilities, like we use mkPages above. I’d like to leave the above code as it is for now. It might be worthwhile, though, to generally revisit the uses of QuickCheck generators in our benchmarks.

That said, by using System.Random directly, we lose the ability to use existing utilities, like we use mkPages above.

That's not completely true. You should run the mkPages Gen with a seed, yes, but you can generate the keys with System.Random. See the Index.Compact benchmarks

So you’re advocating a mixed-approach with some data being generated by directly using System.Random and other by using QuickCheck?

In the compact-index benchmark module, I can only see one use of mkPages, the one in constructionEnv. However, there the generator constructed by mkPages is run via generate, which means that it uses a seed produced by the global random number generator, not a seed specified in the source file. That said, I could still use my generated function for generators to work with fixed seeds.

Should I change the general index benchmarks to use that mixed approach where just the page data is generated using QuickCheck?

jorisdral · 2025-03-12T09:07:10Z

BTW, I think all commits can be squashed into one before merging

jeltsch requested a review from dcoutts as a code owner February 27, 2025 17:24

jeltsch added the enhancement New feature or request label Feb 27, 2025

jeltsch requested review from jorisdral, mheinzel and recursion-ninja as code owners February 27, 2025 17:24

jeltsch self-assigned this Feb 27, 2025

jeltsch requested a review from wenkokke as a code owner February 27, 2025 17:24

jeltsch force-pushed the jeltsch/general-index-benchmarks branch from 9770b4c to 5081353 Compare March 11, 2025 18:41

jorisdral approved these changes Mar 12, 2025

View reviewed changes

jeltsch mentioned this pull request Mar 12, 2025

Eliminate some allocations from ordinary index search #264

Closed

Add general index benchmarks

b635b2c

jeltsch force-pushed the jeltsch/general-index-benchmarks branch from 4904374 to b635b2c Compare March 12, 2025 19:44

jeltsch enabled auto-merge March 12, 2025 19:45

jeltsch added this pull request to the merge queue Mar 12, 2025

Merged via the queue into main with commit 585b257 Mar 12, 2025
27 checks passed

jeltsch deleted the jeltsch/general-index-benchmarks branch March 12, 2025 20:40

jeltsch mentioned this pull request Jun 3, 2025

Generalize benchmarks to work with different index types #560

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add general index benchmarks #599

Add general index benchmarks #599

Uh oh!

jeltsch commented Feb 27, 2025

Uh oh!

jorisdral Mar 12, 2025

Uh oh!

jorisdral Mar 14, 2025

Uh oh!

jeltsch Mar 14, 2025

Uh oh!

jorisdral Mar 12, 2025 •

edited

Loading

Uh oh!

jeltsch Mar 12, 2025

Uh oh!

jorisdral Mar 14, 2025 •

edited

Loading

Uh oh!

jeltsch Mar 14, 2025

Uh oh!

jorisdral commented Mar 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		incrementalConstructionAppends
		:: Int -- ^ Number of keys used in the construction

Add general index benchmarks #599

Add general index benchmarks #599

Uh oh!

Conversation

jeltsch commented Feb 27, 2025

Uh oh!

jorisdral Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

jorisdral Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

jeltsch Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

jorisdral Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jeltsch Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

jorisdral Mar 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jeltsch Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

jorisdral commented Mar 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jorisdral Mar 12, 2025 •

edited

Loading

jorisdral Mar 14, 2025 •

edited

Loading