[BUG] Wide schema read with operations falling back to the CPU show serious query slowdown #12195
Labels
? - Needs Triage
Need team to review and classify
bug
Something isn't working
performance
A performance related task/issue
Describe the bug
Queries with wide schemas show slowdown for basic SQL operations. This is especially worse if there are fallbacks along the way.
Steps/Code to reproduce bug
Write the test data:
Run the query on this data:
On my local machine with single RTX A5000 and spark in local mode the runtimes are as follows:
Expected behavior
Have a comparable if not better run time. Support performance with wide schemas . This case covers a non -shuffle case but also relates to #11127 which deals with wide schemas during shuffle.
Environment details (please complete the following information)
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: