fix `MimirIngesterStuckProcessingRecordsFromKafka` #9855

gotjosh · 2024-11-08T16:57:36Z

What this PR does

The alert MimirIngesterStuckProcessingRecordsFromKafka relied on the metric cortex_ingest_storage_reader_buffered_fetch_records_total provided by the Kafka client to identify wether we had stuck buffers or not.

Now that we've implemented concurrent fetching from Kafka and bypass the client's polling function we needed an equivalent metric when using concurrent fetching. This PR does that; In addition to that - the metric also takes the client's buffered records In case we do use a mixture of non-concurrent fetching and concurrent fetching.

Which issue(s) this PR fixes or relates to

N/A

Checklist

Tests updated.
Documentation added.
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX].
about-versioning.md updated with experimental features.

gotjosh · 2024-11-08T16:59:25Z

pkg/storage/ingest/fetcher.go

@@ -226,14 +229,8 @@ type concurrentFetchers struct {

 	// trackCompressedBytes controls whether to calculate MaxBytes for fetch requests based on previous responses' compressed or uncompressed bytes.
 	trackCompressedBytes bool
-}
-
-// Stop implements fetcher


This is just a move as is - these methods above the constructor seem to be oddly placed.

pracucci

You improved an unit test to check that the value is 0 when done, but there's no unit test asserting that this value is actually tracked. Can we add a test running for the 4 concurrency variants that test the value is tracked to the buffered number of records? I think we can implement a mocked "pusher" that doesn't process records (it just hangs) so next records are buffered in the client, then we assert on the metric, and finally we release the pusher to end the test.

pkg/storage/ingest/fetcher.go

pkg/storage/ingest/reader.go

gotjosh · 2024-11-11T09:49:02Z

pkg/storage/ingest/reader.go


-	r.committer = newPartitionCommitter(r.kafkaCfg, kadm.NewClient(r.client), r.partitionID, r.consumerGroup, r.logger, r.reg)
+	r.committer = newPartitionCommitter(r.kafkaCfg, kadm.NewClient(r.getClient()), r.partitionID, r.consumerGroup, r.logger, r.reg)


I think I don't need to rework these arguments as closures here (e.g. func() *kgo.Client return { r.getClient() }) as this only happens within start and from my understanding, start, stop and run are guaranteed to never concurrently - but please let me know if I'm wrong.

pracucci

Nice work! LGTM

dimitarvdimitrov · 2024-11-12T16:03:32Z

pkg/storage/ingest/fetcher.go

@@ -496,6 +510,7 @@ func (r *concurrentFetchers) run(ctx context.Context, wants chan fetchWant, logg
 			attemptSpan.SetTag("attempt", attempt)

 			f := r.fetchSingle(ctx, w)
+			r.bufferedFetchedRecords.Add(int64(len(f.FetchPartition.Records)))


if something calls Stop() before these records are pushed to orderedFetches then the atomic counter will always be positive. Should we decrement the counter if we give up on sending these records?

I'm thinking about the cases in this select which end up returning

select { case <-r.done: wantSpan.Finish() attemptSpan.Finish() close(w.result) return case w.result <- f: previousResult = fetchResult{} case <-ctx.Done(): default: if w.startOffset >= w.endOffset { // We've fetched all we were asked for the whole batch is ready, and we definitely have to wait to send on the channel now. f.startWaitingForConsumption() select { case <-r.done: wantSpan.Finish() attemptSpan.Finish() close(w.result) return case w.result <- f: previousResult = fetchResult{} case <-ctx.Done(): } } }

at this point you can also try to unify all the places which do cleanup actions (like finishing spans, closing channels, now also decrementing the atomic counter)

if something calls Stop() before these records are pushed to orderedFetches then the atomic counter will always be positive

I thought about it too, but do we really care?

If the stop is called at shutdown, we don't care.

If the stop is called because we're moving from concurrent fetcher at startup to sequential fetchwe when ongoing, we don't care because the fetcher reference is trashed

If the stop is called because we're updating the concurrent fetcher config (Update()) then we care... so what if we simply reset the buffer in Stop() after the r.wg.Wait() to keep it simple?

If the stop is called because we're updating the concurrent fetcher config (Update()) then we care... so what if we simply reset the buffer in Stop() after the r.wg.Wait() to keep it simple?

yeah that's another option 👍

I'm adding some more unit tests and I found an issue. The way we currently increase the number of records is wrong because the same record may be concurrently fetched by multiple routines. I'm working on a fix.

I'm adding some more unit tests and I found an issue. The way we currently increase the number of records is wrong because the same record may be concurrently fetched by multiple routines. I'm working on a fix.

Not true. It was my test that was not doing the assertion correctly. I'm getting back to the Josh implementation.

dimitarvdimitrov

can you explain why we need the mutex around the fetcher and the client?

pracucci · 2024-11-12T16:18:44Z

can you explain why we need the mutex around the fetcher and the client?

Client is set in the starting function but there's no guarantee metrics couldn't be scraped in the meanwhile, so race condition. Fetcher same, and also could be updated later if we change it between start and ongoing, so race condition.

dimitarvdimitrov · 2024-11-12T16:48:14Z

can you explain why we need the mutex around the fetcher and the client?

Client is set in the starting function but there's no guarantee metrics couldn't be scraped in the meanwhile, so race condition. Fetcher same, and also could be updated later if we change it between start and ongoing, so race condition.

😿 go guarantees atomic pointer swaps, so technically we should be safe. Was the race detector complaining?

pracucci · 2024-11-12T17:08:26Z

😿 go guarantees atomic pointer swaps, so technically we should be safe. Was the race detector complaining?

Does it? So why atomic.Pointer exists if it's already atomic? 🤔

seizethedave · 2024-11-12T17:22:18Z

Whole-word pointer operations are atomic, but their order w/ respect to surrounding code executing is not guaranteed without the atomic stuff. That's what the race detector complains about. Many of the whole-word atomic types boil down to using different assembly instructions that make sure dirty CPU cache entries are invalidated appropriately.

The alert `MimirIngesterStuckProcessingRecordsFromKafka` relied on the metric `cortex_ingest_storage_reader_buffered_fetch_records_total ` provided by the Kafka client to identify wether we had stuck buffers or not. Now that we've implemented concurrent fetching from Kafka and bypass the client's polling function we needed an equivalent metric when using concurrent fetching. This PR does that; In addition to that - the metric also takes the client's buffered records In case we do use a mixture of non-concurrent fetching and concurrent fetching.

Signed-off-by: gotjosh <josue.abreu@gmail.com>

Signed-off-by: Marco Pracucci <marco@pracucci.com>

pracucci · 2024-11-13T09:04:07Z

People let's keep it simple and safe, following golang advice. I really don't want to reason whether unsynchronised concurrent access is safe or not across all architectures we support.

From golang memory model, first line:

Programs that modify data being simultaneously accessed by multiple goroutines must serialize such access.

I can use an atomic.pointer tho.

dimitarvdimitrov

if we can solve this with atomic.Pointer I'd prefer that to a mutex. An atomic forces you to use it in a synchronised way, whereas you have to remember to use the mutex. But I'm aware it might be on the bike-shedding side of the spectrum, so you can merge as-is too

…d atomic instead of a mutex to protect client/fetcher access Signed-off-by: Marco Pracucci <marco@pracucci.com>

pracucci · 2024-11-13T11:09:21Z

@dimitarvdimitrov I reworked this PR, doing the following changes:

Changed how buffered records are tracked. Now they track only the ordered records that are ready to be sent to PollFetches(). The counter is manipulated exclusively in the goroutine run by start(), which I think makes the logic easier to follow.
I've added some comments to explain a bit the logic in start() based on my understanding of how it works. Please double check it.
Improved unit tests.
Used atomic instead of a mutex to protect client and fetcher

Signed-off-by: Marco Pracucci <marco@pracucci.com>

pracucci · 2024-11-13T11:22:28Z

pkg/storage/ingest/fetcher.go

+	// We need to make sure we don't leak any goroutine given that start is called within a goroutine.
+	defer r.wg.Done()


Note to reviewers: moved here just to group it with r.wg.Add().

Signed-off-by: Marco Pracucci <marco@pracucci.com>

pracucci · 2024-11-13T11:25:41Z

pkg/storage/ingest/fetcher_test.go

+		// Slowly produce more records while processing is slow too. This increase the chances
+		// of progressive fetches done by the consumer.


Note to reviewers: debugging this test I realised it wan't really simulating a slow processing of fetches because it was always fetching all 10 records in a single PollFetches() call.

pracucci · 2024-11-13T11:26:26Z

pkg/storage/ingest/fetcher_test.go

-		var wg sync.WaitGroup
-		wg.Add(1)
-		go func() {
-			defer wg.Done()


Note to reviewers: removed the PollFetches() call from the goroutine because it was superfluous here.

pkg/storage/ingest/fetcher.go

pkg/storage/ingest/reader.go

…s better coverage of all buffered records Signed-off-by: Marco Pracucci <marco@pracucci.com>

Signed-off-by: Marco Pracucci <marco@pracucci.com>

dimitarvdimitrov

thanks!

gotjosh commented Nov 8, 2024

View reviewed changes

gotjosh marked this pull request as ready for review November 8, 2024 19:01

gotjosh requested a review from a team as a code owner November 8, 2024 19:01

pracucci reviewed Nov 11, 2024

View reviewed changes

gotjosh commented Nov 11, 2024

View reviewed changes

pracucci self-requested a review November 12, 2024 15:40

pracucci approved these changes Nov 12, 2024

View reviewed changes

dimitarvdimitrov reviewed Nov 12, 2024

View reviewed changes

gotjosh and others added 13 commits November 13, 2024 09:52

Add changelog

e067cea

Signed-off-by: gotjosh <josue.abreu@gmail.com>

reestrcture metric assignment

006fb41

Signed-off-by: gotjosh <josue.abreu@gmail.com>

Remove the registry

0f10a5b

Signed-off-by: gotjosh <josue.abreu@gmail.com>

Fix helm

ca29785

Signed-off-by: gotjosh <josue.abreu@gmail.com>

Protected the fetchers

92efc70

Signed-off-by: gotjosh <josue.abreu@gmail.com>

Change log to debug

693e73d

Signed-off-by: gotjosh <josue.abreu@gmail.com>

make BufferedRecords int64 and remove debug logs

f05f02e

Signed-off-by: gotjosh <josue.abreu@gmail.com>

Move buffered records increment location

41c83a1

Signed-off-by: gotjosh <josue.abreu@gmail.com>

Use atomic functions for locking / unlocking the client and fetcher.

ef66300

Signed-off-by: gotjosh <josue.abreu@gmail.com>

assert on buffered records

4780813

Signed-off-by: gotjosh <josue.abreu@gmail.com>

reset the records buffer when stop() is called.

2cc97aa

Signed-off-by: gotjosh <josue.abreu@gmail.com>

Fix test

6e166d7

Signed-off-by: Marco Pracucci <marco@pracucci.com>

pracucci force-pushed the fix-stuck-buffers branch from 9192f31 to 6e166d7 Compare November 13, 2024 08:54

dimitarvdimitrov approved these changes Nov 13, 2024

View reviewed changes

pracucci marked this pull request as draft November 13, 2024 09:36

Changed how buffered records are tracked, improved unit tests and use…

97bb2a9

…d atomic instead of a mutex to protect client/fetcher access Signed-off-by: Marco Pracucci <marco@pracucci.com>

pracucci marked this pull request as ready for review November 13, 2024 11:07

pracucci requested review from dimitarvdimitrov and pracucci November 13, 2024 11:07

pracucci added 2 commits November 13, 2024 12:17

Fix

ee84f33

Signed-off-by: Marco Pracucci <marco@pracucci.com>

Fix comment

e544ac8

Signed-off-by: Marco Pracucci <marco@pracucci.com>

pracucci reviewed Nov 13, 2024

View reviewed changes

Fix comment

a011860

Signed-off-by: Marco Pracucci <marco@pracucci.com>

pracucci reviewed Nov 13, 2024

View reviewed changes

dimitarvdimitrov reviewed Nov 13, 2024

View reviewed changes

pkg/storage/ingest/fetcher.go Outdated Show resolved Hide resolved

dimitarvdimitrov reviewed Nov 13, 2024

View reviewed changes

pkg/storage/ingest/reader.go Outdated Show resolved Hide resolved

Get back to Josh implementation of buffered records tracking which ha…

e894fca

…s better coverage of all buffered records Signed-off-by: Marco Pracucci <marco@pracucci.com>

dimitarvdimitrov approved these changes Nov 13, 2024

View reviewed changes

Use atomic for fetcher too

c70ffe8

Signed-off-by: Marco Pracucci <marco@pracucci.com>

dimitarvdimitrov approved these changes Nov 13, 2024

View reviewed changes

pracucci merged commit 72cbd83 into main Nov 13, 2024
31 checks passed

pracucci deleted the fix-stuck-buffers branch November 13, 2024 13:38

dimitarvdimitrov mentioned this pull request Nov 13, 2024

ingester: shortcut fetchWants that have already been satisfied #9891

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix `MimirIngesterStuckProcessingRecordsFromKafka` #9855

fix `MimirIngesterStuckProcessingRecordsFromKafka` #9855

gotjosh commented Nov 8, 2024

gotjosh Nov 8, 2024

pracucci left a comment

gotjosh Nov 11, 2024

pracucci left a comment

dimitarvdimitrov Nov 12, 2024 •

edited

Loading

pracucci Nov 12, 2024

dimitarvdimitrov Nov 12, 2024

pracucci Nov 13, 2024

pracucci Nov 13, 2024

dimitarvdimitrov left a comment

pracucci commented Nov 12, 2024

dimitarvdimitrov commented Nov 12, 2024

pracucci commented Nov 12, 2024

seizethedave commented Nov 12, 2024

pracucci commented Nov 13, 2024

dimitarvdimitrov left a comment

pracucci commented Nov 13, 2024 •

edited

Loading

pracucci Nov 13, 2024

pracucci Nov 13, 2024

pracucci Nov 13, 2024

dimitarvdimitrov left a comment


		r.committer = newPartitionCommitter(r.kafkaCfg, kadm.NewClient(r.client), r.partitionID, r.consumerGroup, r.logger, r.reg)
		r.committer = newPartitionCommitter(r.kafkaCfg, kadm.NewClient(r.getClient()), r.partitionID, r.consumerGroup, r.logger, r.reg)

		// We need to make sure we don't leak any goroutine given that start is called within a goroutine.
		defer r.wg.Done()

		// Slowly produce more records while processing is slow too. This increase the chances
		// of progressive fetches done by the consumer.

fix MimirIngesterStuckProcessingRecordsFromKafka #9855

fix MimirIngesterStuckProcessingRecordsFromKafka #9855

Conversation

gotjosh commented Nov 8, 2024

What this PR does

Which issue(s) this PR fixes or relates to

Checklist

Choose a reason for hiding this comment

pracucci left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pracucci left a comment

Choose a reason for hiding this comment

dimitarvdimitrov Nov 12, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dimitarvdimitrov left a comment

Choose a reason for hiding this comment

pracucci commented Nov 12, 2024

dimitarvdimitrov commented Nov 12, 2024

pracucci commented Nov 12, 2024

seizethedave commented Nov 12, 2024

pracucci commented Nov 13, 2024

dimitarvdimitrov left a comment

Choose a reason for hiding this comment

pracucci commented Nov 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dimitarvdimitrov left a comment

Choose a reason for hiding this comment

fix `MimirIngesterStuckProcessingRecordsFromKafka` #9855

fix `MimirIngesterStuckProcessingRecordsFromKafka` #9855

dimitarvdimitrov Nov 12, 2024 •

edited

Loading

pracucci commented Nov 13, 2024 •

edited

Loading