-
Notifications
You must be signed in to change notification settings - Fork 832
Closed
Labels
keepaliveSkipped by stale botSkipped by stale bot
Description
Currently, querier.worker-parallelism
is the number of parallel queries that can be processed per querier per frontend. In an effort to simplify operating cortex, this could be changed to a max parallelism configuration per querier irrespective of the number of frontends. Changing this would alleviate the common practice of determining querier parallelism -> dividing by n_frontends -> setting querier.worker-parallelism
.
Additionally, the queriers already have a DNS watch loop running in order to add/drop ephemeral frontends, which can be used to readjust the derived per-frontend limits.
To maintain backwards compatibility, this change could be effected by adding a new flag and marking querier.worker-parallelism
deprecated if necessary.
Metadata
Metadata
Assignees
Labels
keepaliveSkipped by stale botSkipped by stale bot