Implement chunk iterators that drop the GIL #106

GodTamIt · 2023-12-14T19:25:43Z

This allows callers to get chunks at a time from the iterator and in such scenarios, it makes sense to drop the GIL.

Dropping the GIL is not only useful for scenarios when there are multiple iterators but when code may be calling other GIL-dropping code.

Benchmarking Rdict Iterator...
Iterator performance: 4194304 items in 1.7615967919118702 seconds (2380967.0971572897 it/s)
Iterator performance multi-thread: 16777216 items in 6.953388374997303 seconds (2412811.5812323666 it/s)

Benchmarking Rdict Batch Iterator...
Batched iterator performance: 16777216 items in 12.566605832893401 seconds (1335063.4390143137 it/s)
Batched iterator performance multi-thread: 16777216 items in 9.088478291872889 seconds (1845987.3546710834 it/s)

Congyuwang · 2023-12-15T01:34:52Z

I think it's better to provide a default limited chunk size rather than using None as default--doesn't seem like a default behaviour.

Congyuwang · 2023-12-15T01:44:06Z

Iter_chunk still seems significantly slower, although the chunk size is pretty large (25000).

GodTamIt · 2023-12-15T02:23:38Z

Iter_chunk still seems significantly slower, although the chunk size is pretty large (25000).

That's because it's doing 4x more work for less than 4x time

src/iter.rs

+        backwards: bool,
+        py: Python,
+    ) -> PyResult<Vec<(PyObject, PyObject)>> {
+        let raw_items = py.allow_threads(|| -> PyResult<Vec<(Box<[u8]>, Box<[u8]>)>> {


GodTamIt · 2023-12-16T17:57:37Z

I think it's better to provide a default limited chunk size rather than using None as default--doesn't seem like a default behaviour.

This is now done.

Iter_chunk still seems significantly slower, although the chunk size is pretty large (25000).

@Congyuwang iter chunk is faster. It's probably more helpful to look at the it/s metric than the total seconds because the number of items used to be different between single-thread and multi-thread. I've updated the benchmark to not do this.

Congyuwang · 2023-12-17T01:04:40Z

I'm comparing iter vs. iter_chunk. Not multithreaded vs. single threaded. Seems that based on the previous benchmark, multithreaded iter chunk is slower than multithreaded iter where GIL is not released.

Congyuwang · 2023-12-17T14:50:52Z

Looks like we have some kind of deadlock.

Implement chunk iterators that drop the GIL

5997114

GodTamIt force-pushed the allow-threads-iter branch from 42d05eb to 5997114 Compare December 14, 2023 19:26

GodTamIt added 2 commits December 16, 2023 12:13

Wrap iterator in lock to prevent races

995951a

Set default chunk size to 10000

8ef38e9

github-advanced-security bot found potential problems Dec 16, 2023

View reviewed changes

Add iterations to single-thread batch iterator bench

81b308c

Congyuwang closed this Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement chunk iterators that drop the GIL #106

Implement chunk iterators that drop the GIL #106

GodTamIt commented Dec 14, 2023 •

edited

Loading

Congyuwang commented Dec 15, 2023

Congyuwang commented Dec 15, 2023

GodTamIt commented Dec 15, 2023

GodTamIt commented Dec 16, 2023

Congyuwang commented Dec 17, 2023

Congyuwang commented Dec 17, 2023

Implement chunk iterators that drop the GIL #106

Implement chunk iterators that drop the GIL #106

Conversation

GodTamIt commented Dec 14, 2023 • edited Loading

Congyuwang commented Dec 15, 2023

Congyuwang commented Dec 15, 2023

GodTamIt commented Dec 15, 2023

GodTamIt commented Dec 16, 2023

Congyuwang commented Dec 17, 2023

Congyuwang commented Dec 17, 2023

GodTamIt commented Dec 14, 2023 •

edited

Loading