You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* fix `MimirIngesterStuckProcessingRecordsFromKafka`
The alert `MimirIngesterStuckProcessingRecordsFromKafka` relied on the metric `cortex_ingest_storage_reader_buffered_fetch_records_total ` provided by the Kafka client to identify wether we had stuck buffers or not.
Now that we've implemented concurrent fetching from Kafka and bypass the client's polling function we needed an equivalent metric when using concurrent fetching. This PR does that; In addition to that - the metric also takes the client's buffered records In case we do use a mixture of non-concurrent fetching and concurrent fetching.
* Add changelog
Signed-off-by: gotjosh <josue.abreu@gmail.com>
* reestrcture metric assignment
Signed-off-by: gotjosh <josue.abreu@gmail.com>
* Remove the registry
Signed-off-by: gotjosh <josue.abreu@gmail.com>
* Fix helm
Signed-off-by: gotjosh <josue.abreu@gmail.com>
* Protected the fetchers
Signed-off-by: gotjosh <josue.abreu@gmail.com>
* Change log to debug
Signed-off-by: gotjosh <josue.abreu@gmail.com>
* make `BufferedRecords` int64 and remove debug logs
Signed-off-by: gotjosh <josue.abreu@gmail.com>
* Move buffered records increment location
Signed-off-by: gotjosh <josue.abreu@gmail.com>
* Use atomic functions for locking / unlocking the client and fetcher.
Signed-off-by: gotjosh <josue.abreu@gmail.com>
* assert on buffered records
Signed-off-by: gotjosh <josue.abreu@gmail.com>
* reset the records buffer when `stop()` is called.
Signed-off-by: gotjosh <josue.abreu@gmail.com>
* Fix test
Signed-off-by: Marco Pracucci <marco@pracucci.com>
* Changed how buffered records are tracked, improved unit tests and used atomic instead of a mutex to protect client/fetcher access
Signed-off-by: Marco Pracucci <marco@pracucci.com>
* Fix
Signed-off-by: Marco Pracucci <marco@pracucci.com>
* Fix comment
Signed-off-by: Marco Pracucci <marco@pracucci.com>
* Fix comment
Signed-off-by: Marco Pracucci <marco@pracucci.com>
* Get back to Josh implementation of buffered records tracking which has better coverage of all buffered records
Signed-off-by: Marco Pracucci <marco@pracucci.com>
* Use atomic for fetcher too
Signed-off-by: Marco Pracucci <marco@pracucci.com>
---------
Signed-off-by: gotjosh <josue.abreu@gmail.com>
Signed-off-by: Marco Pracucci <marco@pracucci.com>
Co-authored-by: Marco Pracucci <marco@pracucci.com>
Copy file name to clipboardExpand all lines: CHANGELOG.md
+1Lines changed: 1 addition & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -76,6 +76,7 @@
76
76
### Mixin
77
77
78
78
*[CHANGE] Remove backwards compatibility for `thanos_memcached_` prefixed metrics in dashboards and alerts removed in 2.12. #9674#9758
79
+
*[CHANGE] Reworked the alert `MimirIngesterStuckProcessingRecordsFromKafka` to also work when concurrent fetching is enabled. #9855
79
80
*[ENHANCEMENT] Unify ingester autoscaling panels on 'Mimir / Writes' dashboard to work for both ingest-storage and non-ingest-storage autoscaling. #9617
80
81
*[ENHANCEMENT] Alerts: Enable configuring job prefix for alerts to prevent clashes with metrics from Loki/Tempo. #9659
81
82
*[ENHANCEMENT] Dashboards: visualize the age of source blocks in the "Mimir / Compactor" dashboard. #9697
Copy file name to clipboardExpand all lines: operations/helm/tests/metamonitoring-values-generated/mimir-distributed/templates/metamonitoring/mixin-alerts.yaml
0 commit comments