Skip to content

Commit 435c753

Browse files
authored
Update CortexProvisioningTooManyActiveSeries to 3.2M series per ingester (#59)
* Update CortexProvisioningTooManyActiveSeries to 3.2M series per ingester Signed-off-by: Friedrich Gonzalez <[email protected]> * Adjust more things Signed-off-by: Friedrich Gonzalez <[email protected]> * Update CHANGELOG.md --------- Signed-off-by: Friedrich Gonzalez <[email protected]>
1 parent 0dad989 commit 435c753

File tree

4 files changed

+10
-9
lines changed

4 files changed

+10
-9
lines changed

CHANGELOG.md

+1
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@
44
* [CHANGE] Use cortex v1.17.1
55
* [CHANGE] Enable shuffle sharding in compactors
66
* [CHANGE] Remove chunks support for dashboards
7+
* [CHANGE] Target 3M memory series per ingester instead of 1.5M
78
* [CHANGE] Update jsonnet-libs to Fri Jul 19 12:51:49 2024 #57
89
* [ENHANCEMENT] Configure `-ingester.client.grpc-compression` to be `snappy-block`
910
* [ENHANCEMENT] Support Grafana 11 in Cortex Service Scaling Dashboard

cortex-mixin/alerts/alerts.libsonnet

+3-3
Original file line numberDiff line numberDiff line change
@@ -389,11 +389,11 @@
389389
rules: [
390390
{
391391
alert: 'CortexProvisioningTooManyActiveSeries',
392-
// We target each ingester to 1.5M in-memory series. This alert fires if the average
393-
// number of series / ingester in a Cortex cluster is > 1.6M for 2h (we compact
392+
// We target each ingester to 3.0M in-memory series. This alert fires if the average
393+
// number of series / ingester in a Cortex cluster is > 3.2M for 2h (we compact
394394
// the TSDB head every 2h).
395395
expr: |||
396-
avg by (%s) (cortex_ingester_memory_series) > 1.6e6
396+
avg by (%s) (cortex_ingester_memory_series) > 3.2e6
397397
||| % [$._config.alert_aggregation_labels],
398398
'for': '2h',
399399
labels: {

cortex-mixin/docs/playbooks.md

+3-3
Original file line numberDiff line numberDiff line change
@@ -555,13 +555,13 @@ How to **investigate**:
555555
556556
### CortexProvisioningTooManyActiveSeries
557557
558-
This alert fires if the average number of in-memory series per ingester is above our target (1.5M).
558+
This alert fires if the average number of in-memory series per ingester is above our target (3.0M).
559559
560560
How to **fix**:
561561
- Scale up ingesters
562562
- To find out the Cortex clusters where ingesters should be scaled up and how many minimum replicas are expected:
563563
```
564-
ceil(sum by(cluster, namespace) (cortex_ingester_memory_series) / 1.5e6) >
564+
ceil(sum by(cluster, namespace) (cortex_ingester_memory_series) / 3.0e6) >
565565
count by(cluster, namespace) (cortex_ingester_memory_series)
566566
```
567567
- After the scale up, the in-memory series are expected to be reduced at the next TSDB head compaction (occurring every 2h)
@@ -595,7 +595,7 @@ How to **fix**:
595595
kubectl -n <namespace> delete pod ingester-XXX
596596
```
597597
- Restarting an ingester typically reduces the memory allocated by mmap-ed files. After the restart, ingester may allocate this memory again over time, but it may give more time while working on a longer term solution
598-
- Check the `Cortex / Writes Resources` dashboard to see if the number of series per ingester is above the target (1.5M). If so:
598+
- Check the `Cortex / Writes Resources` dashboard to see if the number of series per ingester is above the target (3.0M). If so:
599599
- Scale up ingesters
600600
- Memory is expected to be reclaimed at the next TSDB head compaction (occurring every 2h)
601601

cortex-mixin/recording_rules.libsonnet

+3-3
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ local utils = import 'mixin-utils/utils.libsonnet';
22

33
{
44
local _config = {
5-
max_series_per_ingester: 1.5e6,
5+
max_series_per_ingester: 3.0e6,
66
max_samples_per_sec_per_ingester: 80e3,
77
max_samples_per_sec_per_distributor: 240e3,
88
limit_utilisation_target: 0.6,
@@ -148,7 +148,7 @@ local utils = import 'mixin-utils/utils.libsonnet';
148148
||| % _config,
149149
},
150150
{
151-
// Ingester should have 1.5M series in memory
151+
// Ingester should have 3.0M series in memory
152152
record: 'cluster_namespace_deployment_reason:required_replicas:count',
153153
labels: {
154154
deployment: 'ingester',
@@ -167,7 +167,7 @@ local utils = import 'mixin-utils/utils.libsonnet';
167167
},
168168
{
169169
// We should be about to cover 60% of our limits,
170-
// and ingester can have 1.5M series in memory
170+
// and ingester can have 3.0M series in memory
171171
record: 'cluster_namespace_deployment_reason:required_replicas:count',
172172
labels: {
173173
deployment: 'ingester',

0 commit comments

Comments
 (0)