-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEATURE] Spark and Dask Take 1 row without sorting optimization #269
Comments
@kvnkho @goodwanghan is this issue taken? Would love to jump on this. Any pointers would be awesome |
Hi @meehawk , good to see you in the Fugue repo this time! Thanks for the interest. I still remember you from the Prefect repo. This issue is not taken and you can take a stab at it. Here is some initial guidance. There is a You can find the implementations of
So this specific issue is an optimization for the Spark code. But to know the issue, you need to know the The important thing to know for This issue proposes special handling to use the native Spark This may be a lot to digest, feel free to ping us in the Slack channel at |
Is your feature request related to a problem? Please describe.
Look at here
If taking just one row with our sorting, we may use
GROUP BY
andFIRST
to solve this problem, it can be a lot faster. Let's add this special handling.The text was updated successfully, but these errors were encountered: