Ingestion latency spikes when discarding metrics due to limits #1301

gouthamve · 2019-03-25T08:06:19Z

We're seeing consistent spikes in ingester /cortex.Ingester/Push latency when there is an uptick in the samples dropped due to limits.

Needs to be investigated further.

The text was updated successfully, but these errors were encountered:

bboreham · 2019-08-07T10:21:53Z

@gouthamve do you see any improvement after #1497 ?

bboreham · 2019-12-13T21:08:02Z

I have a theory what causes this.

For each rejection, userState.getSeries() calls httpgrpc.ErrorFromHTTPResponse(), which does a lot of memory-allocating, then returns it to Ingester.Push() which calls httpgrpc.HTTPResponseFromError() which does even more memory-allocating. Then Push() throws it all away except for the last error.

I recommend we use an error object that just holds a reference to the labels until called upon to format itself.

This was referenced Jul 10, 2019

Reduce contention on discarded samples counters #1497

Merged

Ingester blowing up to tens of thousands of goroutines #858

Closed

bboreham mentioned this issue Dec 18, 2019

Reduce memory usage from ingester Push() errors #1922

Merged

bboreham closed this as completed in #1922 Dec 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ingestion latency spikes when discarding metrics due to limits #1301

Ingestion latency spikes when discarding metrics due to limits #1301

gouthamve commented Mar 25, 2019

bboreham commented Aug 7, 2019

bboreham commented Dec 13, 2019

Ingestion latency spikes when discarding metrics due to limits #1301

Ingestion latency spikes when discarding metrics due to limits #1301

Comments

gouthamve commented Mar 25, 2019

bboreham commented Aug 7, 2019

bboreham commented Dec 13, 2019