Fix micro benchmarks #538

mheinzel · 2025-01-21T16:43:03Z

Some micro benchmarks don't run any more. I fixed two where the benchmark environment code got broken by recent changes.

Fixes #440. (this was one of them)

jorisdral · 2025-01-21T16:50:07Z

Related: #440

The minimum length of keys for the compact index increased from 6 to 8 bytes at some point. Also, releasing a run was changed to remove the files associated with it. The cleanup code was manually doing the same, which then became unnecessary and started causing issues. Merge micro bench: release run during cleanup only

This previously tried creating the wbblobs file with an empty path, so there was an existing directory (the benchmark's root directory) already where the file was supposed to go.

With recent changes related to WriteBufferBlobs, flushing a write buffer started copying its blobs to a new file. This made the setup fail, since it generated random blob references, but no actual blobs they point at.

This fixes the file paths errors for the write buffer blobs. It is also generally the right direction to go: If we ever want these tests to work with a write buffer and blobs, lookups must look at the original write buffer blobs, not an empty one.

dcoutts

LGTM

jorisdral

Very nice!

jorisdral · 2025-01-23T08:30:32Z

bench/micro/Bench/Database/LSMTree/Internal/Merge.hs

+            -- We make sure to immediately close resulting runs so we don't run
+            -- out of file handles or disk space. However, we don't want it to
+            -- be part of the measurement, as it includes deleting files.
+            -- Therefore, ... TODO


There is a dangling TODO here

Oops, that slipped through. Thanks for pointing it out. #547

jorisdral · 2025-01-23T08:40:01Z

bench/micro/Bench/Database/LSMTree/Internal/Lookup.hs

 randomEntry g = frequency [
      (20, \g' -> let (!v, !g'') = uniform g' in (Insert v, g''))
    , (1,  \g' -> let (!v, !g'') = uniform g'
-                      (!b, !g''') = genBlobSpan g''
+                      (!b, !g''') = randomByteStringR (0, 2000) g''  -- < 2kB


You can generate much smaller blobs. It would speed up the benchmark setup, and the performance of the lookups code does not depend on the blob size. But it's maybe also not super important because inserts with blobs are generated only rarely

A quick follow-up to #538

Quick follow-up to address feedback for #538

mheinzel mentioned this pull request Jan 21, 2025

Implement MergeUnion merges #536

Merged

mheinzel added 4 commits January 22, 2025 14:49

Lookup micro bench: fix file paths in setup

73f9e37

This previously tried creating the wbblobs file with an empty path, so there was an existing directory (the benchmark's root directory) already where the file was supposed to go.

Lookup micro bench: actually generate blobs

8c06336

With recent changes related to WriteBufferBlobs, flushing a write buffer started copying its blobs to a new file. This made the setup fail, since it generated random blob references, but no actual blobs they point at.

mheinzel force-pushed the mheinzel/fix-benchmarks branch from eb7d34d to a7114d1 Compare January 22, 2025 13:57

mheinzel marked this pull request as ready for review January 22, 2025 14:00

mheinzel requested review from dcoutts, jorisdral, recursion-ninja and wenkokke as code owners January 22, 2025 14:00

dcoutts approved these changes Jan 22, 2025

View reviewed changes

dcoutts added this pull request to the merge queue Jan 22, 2025

Merged via the queue into main with commit a320d80 Jan 22, 2025
27 checks passed

dcoutts deleted the mheinzel/fix-benchmarks branch January 22, 2025 15:55

jorisdral reviewed Jan 23, 2025

View reviewed changes

mheinzel added a commit that referenced this pull request Jan 23, 2025

small tweaks for micro benchmarks

e61618b

A quick follow-up to #538

mheinzel added a commit that referenced this pull request Jan 23, 2025

small tweaks for micro benchmarks

71fe4bc

A quick follow-up to #538

github-merge-queue bot pushed a commit that referenced this pull request Jan 24, 2025

Merge pull request #547 from IntersectMBO/mheinzel/fix-benchmarks-2

9f6b334

Quick follow-up to address feedback for #538

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix micro benchmarks #538

Fix micro benchmarks #538

Uh oh!

mheinzel commented Jan 21, 2025 •

edited

Loading

Uh oh!

jorisdral commented Jan 21, 2025

Uh oh!

dcoutts left a comment

Uh oh!

Uh oh!

jorisdral left a comment

Uh oh!

jorisdral Jan 23, 2025

Uh oh!

mheinzel Jan 23, 2025

Uh oh!

jorisdral Jan 23, 2025

Uh oh!

Uh oh!

Fix micro benchmarks #538

Fix micro benchmarks #538

Uh oh!

Conversation

mheinzel commented Jan 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jorisdral commented Jan 21, 2025

Uh oh!

dcoutts left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jorisdral left a comment

Choose a reason for hiding this comment

Uh oh!

jorisdral Jan 23, 2025

Choose a reason for hiding this comment

Uh oh!

mheinzel Jan 23, 2025

Choose a reason for hiding this comment

Uh oh!

jorisdral Jan 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mheinzel commented Jan 21, 2025 •

edited

Loading