Aggregate `memory_units_t` in `fetch` to reduce cross-shard calls #12092

ballard26 · 2023-07-14T05:55:13Z

By aggregating memory_units_t from the same shard together we reduce cross shard function calls from linear to the number of fetched partitions to 1.

Backports Required

Release Notes

none

src/v/kafka/server/handlers/fetch.cc

StephanDollberg · 2023-07-14T13:57:33Z

src/v/kafka/server/handlers/fetch.cc

@StephanDollberg

By aggregating memory_units_t from the same shard together we reduce cross shard function calls from linear to the number of fetched partitions to 1. From @StephanDollberg Don't submit empty and check sem counts instead of has_data. Co-authored-by: Stephan Dollberg <stephan@redpanda.com>

michael-redpanda · 2023-07-14T16:54:22Z

src/v/kafka/server/handlers/fetch.cc

@@ -263,7 +275,7 @@ static ss::future<read_result> do_read_from_ntp(
  ssx::semaphore& memory_sem,
  ssx::semaphore& memory_fetch_sem) {
    // control available memory
-    read_result::memory_units_t memory_units;
+    read_result::memory_units_t memory_units(memory_sem, memory_fetch_sem);


was this a latent bug?

Yeah, if for some reason read_from_partition reads data even though skip_read is set a seg fault will occur when we try to adjust the semaphore units in memory_units_t. This doesn't happen today, but one never knows what the future holds.

This avoids this potential issue by setting the semaphore units to some reasonable default state.

travisdowns · 2023-07-14T18:24:30Z

Looks good to me. If this is still causing a burden after these optimizations, one avenue for investigation would be to see if we are in practice returning a small number of units for each fetch request, say 100s of KBs. In that case, we might reasonably say that each core can hold say 10 MB of free but not returned units from another core, and so buffer/batch the units until that number is hit which might coalesce a lot messages into 1.

github-actions bot added the area/redpanda label Jul 14, 2023

StephanDollberg reviewed Jul 14, 2023

View reviewed changes

src/v/kafka/server/handlers/fetch.cc Outdated Show resolved Hide resolved

StephanDollberg reviewed Jul 14, 2023

View reviewed changes

src/v/kafka/server/handlers/fetch.cc Outdated Show resolved Hide resolved

StephanDollberg approved these changes Jul 14, 2023

View reviewed changes

ballard26 force-pushed the memory-units-agg branch from 7cd6844 to 4ca51c1 Compare July 14, 2023 16:22

github-actions bot added the area/k8s label Jul 14, 2023

ballard26 force-pushed the memory-units-agg branch from 4ca51c1 to 8a4af64 Compare July 14, 2023 16:30

github-actions bot removed the area/k8s label Jul 14, 2023

ballard26 changed the title ~~Draft: aggregate memory_units_t to reduce cross-shard calls~~ Aggregate memory_units_t in fetch to reduce cross-shard calls Jul 14, 2023

ballard26 marked this pull request as ready for review July 14, 2023 16:36

ballard26 requested a review from dotnwat July 14, 2023 16:36

ballard26 force-pushed the memory-units-agg branch from 8a4af64 to 65c8ad6 Compare July 14, 2023 16:40

StephanDollberg approved these changes Jul 14, 2023

View reviewed changes

michael-redpanda reviewed Jul 14, 2023

View reviewed changes

ballard26 merged commit 6e50790 into redpanda-data:dev Jul 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aggregate `memory_units_t` in `fetch` to reduce cross-shard calls #12092

Aggregate `memory_units_t` in `fetch` to reduce cross-shard calls #12092

ballard26 commented Jul 14, 2023

StephanDollberg commented Jul 14, 2023

michael-redpanda Jul 14, 2023

ballard26 Jul 14, 2023

travisdowns commented Jul 14, 2023

Aggregate memory_units_t in fetch to reduce cross-shard calls #12092

Aggregate memory_units_t in fetch to reduce cross-shard calls #12092

Conversation

ballard26 commented Jul 14, 2023

Backports Required

Release Notes

StephanDollberg commented Jul 14, 2023

michael-redpanda Jul 14, 2023

Choose a reason for hiding this comment

ballard26 Jul 14, 2023

Choose a reason for hiding this comment

travisdowns commented Jul 14, 2023

Aggregate `memory_units_t` in `fetch` to reduce cross-shard calls #12092

Aggregate `memory_units_t` in `fetch` to reduce cross-shard calls #12092