Skip to content

Conversation

alexqyle
Copy link
Contributor

@alexqyle alexqyle commented Jul 23, 2024

What this PR does:

Currently, Cortex would retry any error from compaction. However, some error would be for sure fail again on retry. This PR adds logic to differentiate halt and retriable error (from Thanos). Cortex compactor should only retry compaction on retriable errors. Also, added metrics to count halt and retriable errors separately

Which issue(s) this PR fixes:
Fixes #

Checklist

  • Tests updated
  • Documentation added
  • CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

Signed-off-by: Alex Le <[email protected]>
Signed-off-by: Alex Le <[email protected]>
Signed-off-by: Alex Le <[email protected]>
@yeya24 yeya24 merged commit 42d7327 into cortexproject:master Jul 24, 2024
@alexqyle alexqyle deleted the retriable-and-halt-error branch July 25, 2024 00:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants