Make ordered as the default behavior #54

wookayin · 2020-07-16T08:44:08Z

map() preserving the order is much more intuitive behavior. Python's builtin Pool executor, ray, joblib, etc. all work in such a way.

I realized that one can still pipe to pl.process.ordered, but the documentation is limited and this is quite difficult to use.

def slow_identity(x):
   time.sleep(random.random())
   return x

s = list(range(100)) | pl.process.map(slow_identity, workers=N)
list(s)     # should be ordered by default

The text was updated successfully, but these errors were encountered:

cgarciae · 2020-07-16T13:15:01Z

Hey @wookayin,

Implementing ordering efficiently can get very tricky if you consider multi-stage pipelines containing transformations like filter and flat_map. The current implementation of ordered is pessimistic and has to wait for all the elements to come in before yielding.

I think the example from the ordered documentation should be able for people to get started, but would be happy to improve if you give some feedback.

cgarciae · 2020-07-16T13:29:22Z

I don't agree that stages should order by default since its a slower operation. We can optimize ordered for simple cases and add ordered shortcut flag to map.

What do you think?

gtadamson · 2020-09-17T15:53:40Z

I agree with @cgarciae that ordering by default would be undesirable if it slowed down the speed of the full pypeln

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make ordered as the default behavior #54

Make ordered as the default behavior #54

wookayin commented Jul 16, 2020 •

edited

Loading

cgarciae commented Jul 16, 2020

cgarciae commented Jul 16, 2020 •

edited

Loading

gtadamson commented Sep 17, 2020

Make ordered as the default behavior #54

Make ordered as the default behavior #54

Comments

wookayin commented Jul 16, 2020 • edited Loading

cgarciae commented Jul 16, 2020

cgarciae commented Jul 16, 2020 • edited Loading

gtadamson commented Sep 17, 2020

wookayin commented Jul 16, 2020 •

edited

Loading

cgarciae commented Jul 16, 2020 •

edited

Loading