ListOffsetsRequest should only be sent to the leader replica #4616

kphelps · 2024-02-12T15:09:51Z

When using fetch-from-follower, it is currently possible for a consumer to get stuck in a loop sending ListOffsetRequest when we go through the rd_kafka_offset_reset path since the request is sent to the preferred replica. Instead, always send it to the leader.

cla-assistant · 2024-02-12T15:10:12Z

All committers have signed the CLA.

emasab · 2024-02-20T13:14:44Z

It's correct to send the ListOffsets request to the preferred replica. The loop probably comes from this discovered bug:
#4620

When enabling debug logs, could you check if it's receiving FENCED_LEADER_EPOCH errors?

kphelps · 2024-02-21T17:46:54Z

Nope, I'm seeing NOT_LEADER_OR_FOLLOWER errors.

kphelps · 2024-02-21T17:53:28Z

Aha, from KIP-392:

The FetchRequest schema has field for the replica id. Consumers typically use the sentinel -1, which indicates that fetching is only allowed from the leader. A lesser known sentinel is -2, which was originally intended to be used for debugging and allows fetching from followers. We propose to let the consumer use this to indicate the intent to allow fetching from a follower. Similarly, when we need to send a ListOffsets request to a follower in order to find the log start offset, we will use the same sentinel for the replica id field.

Looks like we unconditionally set the replica id to -1 here

Looks like the Java client opts to just always send to the leader. WDYT?

emasab · 2024-03-05T19:20:53Z

@kphelps
The replica id should be set to -1 in clients, and to the broker id in followers, see the RPC definition
https://github.com/apache/kafka/blob/2f401ff4c85f6797391b8a3dd57d651f4de3d6ad/clients/src/main/resources/common/message/ListOffsetsRequest.json#L42

The error NOT_LEADER_OR_FOLLOWER happens when the broker isn't a replica for that partition.
In that case librdkafka refreshes metadata to get the leader again, here.

librdkafka/src/rdkafka_request.c

Line 935 in a6d85bd

rd_kafka_metadata_refresh_known_topics(rk, NULL,

Is it possible to reproduce the issue and send a log with "debug": "all" or "debug": "consumer,cgrp,topic,fetch,metadata,broker,topic" ?

kphelps · 2024-03-05T22:46:46Z

I'm working to reproduce this now, but have been having trouble in a controlled environment. Will share that when I get it.

The broker only allows fetching from the leader unless the replica id is set to -2 here which propagates down to retrieving the local log and erroring here.

kphelps · 2024-03-21T18:51:20Z

Found a test that was silently failing due to this issue

emasab · 2024-03-26T19:17:19Z

Thanks @kphelps I was checking this issue more in depth and understood the problem, it's different from what I linked and as you said could be solved in two ways, by sending the request to the follower with -2 or to the leader as Java is doing.

The con of sending it to leader is that is case the follower is lagging behind it could have other offset resets when fetching, until it has caught up, I've checked broker code and tried using -2 by changing mock cluster implementation and it works too.

Will ask for an opinion internally too before deciding for one of the two solutions.

emasab · 2024-04-10T11:11:20Z

Cannot fix it by sending the request to the follower because there are some problems:
if replica id was different from CONSUMER_REPLICA_ID (-1), the isolation level parameter would be ignored, so I'm following @kphelps proposal and using the same behaviour as Java, to send the request to the leader only.

broker code:

            val fetchOnlyFromLeader = offsetRequest.replicaId != ListOffsetsRequest.DEBUGGING_REPLICA_ID
            val isClientRequest = offsetRequest.replicaId == ListOffsetsRequest.CONSUMER_REPLICA_ID
            val isolationLevelOpt = if (isClientRequest)
              Some(offsetRequest.isolationLevel)
            else
              None

removed test import

emasab · 2024-04-10T12:08:24Z

/sem-approve

emasab · 2024-06-10T11:49:44Z

/sem-approve

emasab · 2024-06-12T12:00:22Z

@kphelps sorry, giving we're have having an issue with the public CI, I've created this internal branch with your changes. #4754

davidblewett mentioned this pull request Mar 5, 2024

ListOffsets loop of failed requests on leader epoch change until timeout happens #4620

Closed

7 tasks

ListOffsetsRequest should only be sent to the leader replica

341c62d

kphelps force-pushed the kphelps/list-offsets-leader branch from dfb9e3e to 341c62d Compare March 21, 2024 18:49

emasab added the bug label Mar 26, 2024

Add CHANGELOG entry

90d269e

removed test import

emasab force-pushed the kphelps/list-offsets-leader branch from 0c86070 to 90d269e Compare April 10, 2024 12:06

Merge branch 'master' into kphelps/list-offsets-leader

238e533

Merge branch 'master' into kphelps/list-offsets-leader

2bb84dd

emasab requested a review from a team as a code owner June 10, 2024 11:48

emasab mentioned this pull request Jun 12, 2024

ListOffsetsRequest should only be sent to the leader replica #4753

Closed

emasab mentioned this pull request Jun 12, 2024

ListOffsetsRequest should only be sent to the leader replica (CI) #4754

Merged

kphelps closed this Jul 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ListOffsetsRequest should only be sent to the leader replica #4616

ListOffsetsRequest should only be sent to the leader replica #4616

kphelps commented Feb 12, 2024

cla-assistant bot commented Feb 12, 2024 •

edited

Loading

emasab commented Feb 20, 2024 •

edited

Loading

kphelps commented Feb 21, 2024

kphelps commented Feb 21, 2024 •

edited

Loading

emasab commented Mar 5, 2024 •

edited

Loading

kphelps commented Mar 5, 2024

kphelps commented Mar 21, 2024

emasab commented Mar 26, 2024 •

edited

Loading

emasab commented Apr 10, 2024 •

edited

Loading

emasab commented Apr 10, 2024

emasab commented Jun 10, 2024

emasab commented Jun 12, 2024 •

edited

Loading

ListOffsetsRequest should only be sent to the leader replica #4616

ListOffsetsRequest should only be sent to the leader replica #4616

Conversation

kphelps commented Feb 12, 2024

cla-assistant bot commented Feb 12, 2024 • edited Loading

emasab commented Feb 20, 2024 • edited Loading

kphelps commented Feb 21, 2024

kphelps commented Feb 21, 2024 • edited Loading

emasab commented Mar 5, 2024 • edited Loading

kphelps commented Mar 5, 2024

kphelps commented Mar 21, 2024

emasab commented Mar 26, 2024 • edited Loading

emasab commented Apr 10, 2024 • edited Loading

emasab commented Apr 10, 2024

emasab commented Jun 10, 2024

emasab commented Jun 12, 2024 • edited Loading

cla-assistant bot commented Feb 12, 2024 •

edited

Loading

emasab commented Feb 20, 2024 •

edited

Loading

kphelps commented Feb 21, 2024 •

edited

Loading

emasab commented Mar 5, 2024 •

edited

Loading

emasab commented Mar 26, 2024 •

edited

Loading

emasab commented Apr 10, 2024 •

edited

Loading

emasab commented Jun 12, 2024 •

edited

Loading