Shrink bigchunk data structure #1649

bboreham · 2019-09-04T10:41:30Z

Remove pointer indirection on the underlying XORChunk, which will save memory and a few CPU cycles.
We have to be careful to create the Appender after the chunk is appended to our slice, since it has a pointer to the chunk memory.

And stop storing the end time of every small chunk - we can look at the start time of the next one since they don't overlap. This makes the data structure slightly smaller, and speeds up unmarshalling since we don't need to seek through every value.

We don't need to store the start time either, since it can be fetched from the underlying chunk, but that takes longer.

Estimate of the impact: Suppose chunks are avg 6 hours long then each one has 12 smallchunks and we save 16 bytes per = 192MB per million series, plus Go heap overheads.

pkg/chunk/encoding/bigchunk.go

This will save memory and a few CPU cycles. We have to be careful to create the Appender after the chunk is appended to our slice, since it has a pointer to the chunk memory. Signed-off-by: Bryan Boreham <[email protected]>

We can look at the start time of the next one since they don't overlap. This makes the data structure slightly smaller, and speeds up unmarshalling since we don't need to seek through every value. Signed-off-by: Bryan Boreham <[email protected]>

and add a test that covers that case. Signed-off-by: Bryan Boreham <[email protected]>

Check it works after slice and unmarshal, and check it fails when you seek off the end of the chunk. Signed-off-by: Bryan Boreham <[email protected]> fix test

It is only used to shortcut the case where FindAtOrAfter() is called with a target past the end of the chunk, and this never happens because we have from/through times on each chunk at a higher level. Signed-off-by: Bryan Boreham <[email protected]>

csmarchbanks

LGTM

pkg/chunk/encoding/bigchunk.go

bboreham force-pushed the bigchunk-smaller branch from 342a722 to 5a44db6 Compare September 5, 2019 09:44

gouthamve requested a review from tomwilkie September 5, 2019 11:46

bboreham force-pushed the bigchunk-smaller branch from 17ae6e5 to b01ef46 Compare September 7, 2019 14:06

bboreham added the type/performance label Sep 9, 2019

bboreham force-pushed the bigchunk-smaller branch 2 times, most recently from b07cc22 to e45e510 Compare September 13, 2019 16:45

gouthamve self-assigned this Sep 18, 2019

csmarchbanks reviewed Sep 20, 2019

View reviewed changes

pkg/chunk/encoding/bigchunk.go Show resolved Hide resolved

bboreham added 3 commits September 24, 2019 16:46

Fix FindAtOrAfter where seek position is between two smallchunks

87face4

and add a test that covers that case. Signed-off-by: Bryan Boreham <[email protected]>

bboreham force-pushed the bigchunk-smaller branch from e45e510 to 124bd82 Compare September 24, 2019 17:31

bboreham added 2 commits September 25, 2019 10:18

Add more test cases calling FindAtOrAfter()

b2e700e

Check it works after slice and unmarshal, and check it fails when you seek off the end of the chunk. Signed-off-by: Bryan Boreham <[email protected]> fix test

bboreham force-pushed the bigchunk-smaller branch from 1e64a3b to f0ba932 Compare September 25, 2019 10:21

csmarchbanks approved these changes Sep 25, 2019

View reviewed changes

pkg/chunk/encoding/bigchunk.go Show resolved Hide resolved

bboreham merged commit 53f2043 into master Sep 26, 2019

bboreham deleted the bigchunk-smaller branch September 26, 2019 12:31

bboreham mentioned this pull request Sep 27, 2019

Explicitly reallocate bigchunk slice to avoid up to 2x overhead #1702

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Shrink bigchunk data structure #1649

Shrink bigchunk data structure #1649

Uh oh!

bboreham commented Sep 4, 2019 •

edited

Loading

Uh oh!

Uh oh!

csmarchbanks left a comment

Uh oh!

Uh oh!

Uh oh!

Shrink bigchunk data structure #1649

Shrink bigchunk data structure #1649

Uh oh!

Conversation

bboreham commented Sep 4, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

csmarchbanks left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

bboreham commented Sep 4, 2019 •

edited

Loading