Optimize sort followed by limit #6941

jedelbo · 2023-09-01T09:29:28Z

If the limit is much smaller than the total size of the TableView, then it is faster to make a sorted insert into a vector that is kept at the limit size.

What, How & Why?

☑️ ToDos

📝 Changelog update
🚦 Tests (or not relevant)
C-API, if public C++ API changed.

If the limit is much smaller than the total size of the TableView, then it is faster to make a sorted insert into a vector that is kept at the limit size.

ironage · 2023-09-01T20:00:00Z

Once we do #6933 we should be able to optimize this further by only inserting the limit number of elements into the vector in the first place instead of having an extra buffer like you do here. Given that we expect future optimizations, could you please move the benchmark out of the test and into benchmark-common-tasks so that we can track this case over time. If you could post that benchmark's results compared to current master, I'd also be curious to see what kind of improvement this actually is 😄
The actual code changes seem fine to me though; this is a sorely needed optimization 👍

tgoyne · 2023-09-01T20:11:19Z

src/realm/sort_descriptor.cpp

+        IndexPairs buffer;
+        buffer.reserve(limit + 1);
+        for (auto& elem : v) {
+            auto it = std::lower_bound(buffer.begin(), buffer.end(), elem, predicate);


If std::ref() is doing something useful in the std::sort() case then it will here too (it makes it so that the predicate is not copied when it's passed to the algorithm).

jedelbo · 2023-09-04T08:45:21Z

@ironage I am not sure how we can avoid the buffer. We cannot clear the buffer that holds the elements we should sort.
I have added a benchmark. You should be aware the we can get any performance gain we want by carefully selecting the original table view size and limit .In the test I added, the size and limit is 10000 and 100 respectively. This is the result

Req runs:  387  SortThenLimit (MemOnly, EncryptionOff):       min   1.26ms (-36.01%)           max   1.33ms (-35.29%)           med   1.28ms (-35.62%)           avg   1.28ms (-35.59%)           stddev    14us (-17.69%)

ironage

👍

cla-bot bot added the cla: yes label Sep 1, 2023

github-actions bot assigned jedelbo Sep 1, 2023

jedelbo requested review from astigsen and ironage September 1, 2023 09:29

jedelbo force-pushed the je/sort-limit branch from aaf922e to 25e7ba8 Compare September 1, 2023 09:30

Optimize sort followed by limit

19cc731

If the limit is much smaller than the total size of the TableView, then it is faster to make a sorted insert into a vector that is kept at the limit size.

jedelbo force-pushed the je/sort-limit branch from 25e7ba8 to 19cc731 Compare September 1, 2023 10:42

tgoyne reviewed Sep 1, 2023

View reviewed changes

Update after review

3b1223a

ironage approved these changes Sep 5, 2023

View reviewed changes

jedelbo merged commit 14215e8 into master Sep 7, 2023
24 of 27 checks passed

jedelbo deleted the je/sort-limit branch September 7, 2023 09:37

jedelbo mentioned this pull request Oct 5, 2023

Optimize SORT + LIMIT #4402

Closed

github-actions bot locked as resolved and limited conversation to collaborators Mar 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize sort followed by limit #6941

Optimize sort followed by limit #6941

jedelbo commented Sep 1, 2023

ironage commented Sep 1, 2023

tgoyne Sep 1, 2023

jedelbo Sep 4, 2023

jedelbo commented Sep 4, 2023

ironage left a comment

Optimize sort followed by limit #6941

Optimize sort followed by limit #6941

Conversation

jedelbo commented Sep 1, 2023

What, How & Why?

☑️ ToDos

ironage commented Sep 1, 2023

tgoyne Sep 1, 2023

Choose a reason for hiding this comment

jedelbo Sep 4, 2023

Choose a reason for hiding this comment

jedelbo commented Sep 4, 2023

ironage left a comment

Choose a reason for hiding this comment