improve performance and memory usage in HashUtils.djb2 #28

skimbrel-figma · 2025-05-05T21:44:59Z

Runtime profiling indicates this method generates several many memory allocations.

Comparing to the JS implementation, we saw the intent of the hash &= hash line was to force the JS runtime to keep the number as a 32-bit integer. This is indeed the correct way to do it in JS, but not in Ruby; as a result, the hash local will grow ever larger, requiring more and more memory since Ruby supports unbounded integers.

Fix: truncate the hash value on each iteration with the same 32-bit 0xFFFFFFF constant used at the end instead.

Runtime profiling indicates this method generates several many memory allocations. Comparing to the JS implementation, we saw the intent of the `hash &= hash` line was to force the JS runtime to keep the number as a 32-bit integer. This is indeed the correct way to do it in JS, but not in Ruby; as a result, the `hash` local will grow ever larger, requiring more and more memory since Ruby supports unbounded integers. Fix: truncate the hash value on each iteration with the same 32-bit `0xFFFFFFF` constant used at the end instead.

skimbrel-figma · 2025-05-05T21:49:48Z

Benchmarking before/after with a 256-char string on my laptop:

irb(main):199:0> puts Benchmark.measure { n.times { djb2_fixed(short_str) }}
  0.003365   0.000043   0.003408 (  0.003415)
=> nil                                                                                       
irb(main):200:0> puts Benchmark.measure { n.times { djb2(short_str) }}
  0.024926   0.001164   0.026090 (  0.026234)
=> nil

A quick, imprecise comparison by inspecting GC.stat[:total_allocated_objects] before/after also shows 60% reduction in memory allocations.

I spot-checked a handful of input values to ensure the output hash value did not change.

tore-statsig · 2025-05-12T17:30:53Z

Nice find! Pulling this in to run tests on it now

skimbrel-figma · 2025-05-12T21:31:19Z

@tore-statsig actually i just realized we can do even better — the only thing inside the each loop is .ord, which is available as each_codepoint! i'll update.

#28 """ Runtime profiling indicates this method generates several many memory allocations. Comparing to the JS implementation, we saw the intent of the `hash &= hash` line was to force the JS runtime to keep the number as a 32-bit integer. This is indeed the correct way to do it in JS, but not in Ruby; as a result, the `hash` local will grow ever larger, requiring more and more memory since Ruby supports unbounded integers. Fix: truncate the hash value on each iteration with the same 32-bit `0xFFFFFFF` constant used at the end instead. """ Co-authored-by: Sam Kimbrel <[email protected]>

tore-statsig · 2025-05-14T23:31:25Z

The original version of this is released

https://github.com/statsig-io/ruby-sdk/releases/tag/2.4.2

skimbrel-figma · 2025-05-16T21:45:34Z

great! we've been running the second commit (switching to each_codepoint) as a monkeypatch for most of a week now and it seems totally fine + reduces memory still further 👍

the only thing inside the each loop is .ord, which is available as each_codepoint #28 --------- Co-authored-by: Sam Kimbrel <[email protected]>

use each_codepoint to save a string allocation

f780508

Merge branch 'main' into patch-1

d2c15f3

statsig-kong bot pushed a commit that referenced this pull request May 22, 2025

feat: optimize djb2 hash perf (#378)

c1ab29b

the only thing inside the each loop is .ord, which is available as each_codepoint #28 --------- Co-authored-by: Sam Kimbrel <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

improve performance and memory usage in HashUtils.djb2 #28

improve performance and memory usage in HashUtils.djb2 #28

Uh oh!

skimbrel-figma commented May 5, 2025

Uh oh!

skimbrel-figma commented May 5, 2025

Uh oh!

tore-statsig commented May 12, 2025

Uh oh!

skimbrel-figma commented May 12, 2025

Uh oh!

tore-statsig commented May 14, 2025

Uh oh!

skimbrel-figma commented May 16, 2025

Uh oh!

Uh oh!

improve performance and memory usage in HashUtils.djb2 #28

Are you sure you want to change the base?

improve performance and memory usage in HashUtils.djb2 #28

Uh oh!

Conversation

skimbrel-figma commented May 5, 2025

Uh oh!

skimbrel-figma commented May 5, 2025

Uh oh!

tore-statsig commented May 12, 2025

Uh oh!

skimbrel-figma commented May 12, 2025

Uh oh!

tore-statsig commented May 14, 2025

Uh oh!

skimbrel-figma commented May 16, 2025

Uh oh!

Uh oh!