Skip to content

Shuffle-sharding of queriers may not work as intended #3571

@pstibrany

Description

@pstibrany

Shuffle sharding is intended to protect the system in a case, where user is sending queries to the system that can crash the querier.

However, if this happens and querier crashes, from query-frontend (or query-scheduler) point of view, querier is simply disconnected, and query-frontend will recompute the shard for user and simply use different queriers. If user sends more crashing queries at this moment, he can eventually crash all the queriers.

In practice, crashed queriers should restart quickly, which will make them part of users shard again, but the potential problem still exists.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions