Shared in-memory index cache for queriers with blocks storage #2189

pracucci · 2020-02-27T05:42:59Z

What this PR does:
In this PR I propose to shift to a shared in-memory index cache for queriers with blocks storage. The problem with per-tenant caches is that the total max cache size linearly increase with the number of tenants, while with a single cache it's easier to keep the max memory used under control.

Which issue(s) this PR fixes:
Fixes #2069

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

pracucci · 2020-02-27T15:32:52Z

About this TODO:

// TODO: apparently Thanos has a bug which cause a block to not be considered if the
//       query timetamp matches the block max timestamp

This issue is fixed in Thanos master but will upgrade it in a separate PR.

pstibrany

LGTM

pkg/storage/tsdb/config.go

gouthamve

This is a nice PR, only one minor nit.

Though for me there is a bigger concern that we are not prefixing anything with the tenantID. I know its extremely rare for ULIDs to match across tenants, I'm afraid that if it did, we might be leaking some data.

Can we add a TODO to make sure that we propagate the tenantID into the cache somehow?

gouthamve · 2020-02-28T13:00:37Z

pkg/storage/tsdb/config.go

@@ -154,7 +154,7 @@ type BucketStoreConfig struct {
 func (cfg *BucketStoreConfig) RegisterFlags(f *flag.FlagSet) {
 	f.StringVar(&cfg.SyncDir, "experimental.tsdb.bucket-store.sync-dir", "tsdb-sync", "Directory to store synchronized TSDB index headers.")
 	f.DurationVar(&cfg.SyncInterval, "experimental.tsdb.bucket-store.sync-interval", 5*time.Minute, "How frequently scan the bucket to look for changes (new blocks shipped by ingesters and blocks removed by retention or compaction). 0 disables it.")
-	f.Uint64Var(&cfg.IndexCacheSizeBytes, "experimental.tsdb.bucket-store.index-cache-size-bytes", uint64(250*units.Mebibyte), "Size - in bytes - of a per-tenant in-memory index cache used to speed up blocks index lookups.")
+	f.Uint64Var(&cfg.IndexCacheSizeBytes, "experimental.tsdb.bucket-store.index-cache-size-bytes", uint64(1*units.Gibibyte), "Size in bytes of in-memory index cache used to speed up blocks index lookups (shared across multiple tenants).")


shared across all tenants is better, but only slightly :)

shared between ?

I've updated it to "shared between all tenants". I hope to have correctly merged both feedback 😉

Merges can be difficult :)

Signed-off-by: Marco Pracucci <[email protected]>

Signed-off-by: Marco Pracucci <[email protected]> Co-Authored-By: Peter Štibraný <[email protected]>

Signed-off-by: Marco Pracucci <[email protected]>

pracucci · 2020-02-28T13:34:18Z

Though for me there is a bigger concern that we are not prefixing anything with the tenantID. I know its extremely rare for ULIDs to match across tenants, I'm afraid that if it did, we might be leaking some data.

Can we add a TODO to make sure that we propagate the tenantID into the cache somehow?

This is a very good point. After some discussion in #2069 we decided to not propagate the tenantID but make the block ID entropy safer, which is what I did here:
prometheus/prometheus#6867

Feel free share further feedback. I'm willing to re-consider this decision if you think it's risky.

pull-request-size bot added the size/L label Feb 27, 2020

pracucci force-pushed the single-querier-in-memory-cache-for-blocks-storage branch 2 times, most recently from ecc6f61 to 7167a70 Compare February 28, 2020 09:17

pstibrany approved these changes Feb 28, 2020

View reviewed changes

pkg/storage/tsdb/config.go Outdated Show resolved Hide resolved

pracucci force-pushed the single-querier-in-memory-cache-for-blocks-storage branch 2 times, most recently from 0f40094 to 9ffb9d9 Compare February 28, 2020 10:20

gouthamve approved these changes Feb 28, 2020

View reviewed changes

pracucci and others added 6 commits February 28, 2020 14:25

Shift to a shared in-memory index cache for queriers with blocks storage

e8cc57b

Signed-off-by: Marco Pracucci <[email protected]>

Updated doc and CHANGELOG

bfd8daa

Signed-off-by: Marco Pracucci <[email protected]>

Fixed unit tests

68c5ddd

Signed-off-by: Marco Pracucci <[email protected]>

Added integration test on querier with blocks storage

5d27c67

Signed-off-by: Marco Pracucci <[email protected]>

Update pkg/storage/tsdb/config.go

9b8774b

Signed-off-by: Marco Pracucci <[email protected]> Co-Authored-By: Peter Štibraný <[email protected]>

Updated doc

26ab7cb

Signed-off-by: Marco Pracucci <[email protected]>

pracucci force-pushed the single-querier-in-memory-cache-for-blocks-storage branch from 9ffb9d9 to 8f19883 Compare February 28, 2020 13:29

Updated CLI flag desc

8f19883

Signed-off-by: Marco Pracucci <[email protected]>

pracucci merged commit 151c577 into cortexproject:master Feb 28, 2020

pracucci deleted the single-querier-in-memory-cache-for-blocks-storage branch February 28, 2020 13:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Shared in-memory index cache for queriers with blocks storage #2189

Shared in-memory index cache for queriers with blocks storage #2189

Uh oh!

pracucci commented Feb 27, 2020

Uh oh!

pracucci commented Feb 27, 2020

Uh oh!

pstibrany left a comment

Uh oh!

Uh oh!

gouthamve left a comment

Uh oh!

gouthamve Feb 28, 2020

Uh oh!

pstibrany Feb 28, 2020

Uh oh!

pracucci Feb 28, 2020

Uh oh!

pstibrany Feb 28, 2020

Uh oh!

pracucci commented Feb 28, 2020

Uh oh!

Uh oh!

Shared in-memory index cache for queriers with blocks storage #2189

Shared in-memory index cache for queriers with blocks storage #2189

Uh oh!

Conversation

pracucci commented Feb 27, 2020

Uh oh!

pracucci commented Feb 27, 2020

Uh oh!

pstibrany left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gouthamve left a comment

Choose a reason for hiding this comment

Uh oh!

gouthamve Feb 28, 2020

Choose a reason for hiding this comment

Uh oh!

pstibrany Feb 28, 2020

Choose a reason for hiding this comment

Uh oh!

pracucci Feb 28, 2020

Choose a reason for hiding this comment

Uh oh!

pstibrany Feb 28, 2020

Choose a reason for hiding this comment

Uh oh!

pracucci commented Feb 28, 2020

Uh oh!

Uh oh!