Skip to content

querier.worker-parallelism should be derivable #1883

@owen-d

Description

@owen-d

Currently, querier.worker-parallelism is the number of parallel queries that can be processed per querier per frontend. In an effort to simplify operating cortex, this could be changed to a max parallelism configuration per querier irrespective of the number of frontends. Changing this would alleviate the common practice of determining querier parallelism -> dividing by n_frontends -> setting querier.worker-parallelism.

Additionally, the queriers already have a DNS watch loop running in order to add/drop ephemeral frontends, which can be used to readjust the derived per-frontend limits.

To maintain backwards compatibility, this change could be effected by adding a new flag and marking querier.worker-parallelism deprecated if necessary.

Metadata

Metadata

Assignees

No one assigned

    Labels

    keepaliveSkipped by stale bot

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions