Skip to content

Conversation

@micheal-o
Copy link
Contributor

What changes were proposed in this pull request?

We want to extract the same partitioning key, used during shuffle to partition the streaming state, to perform repartition for the operator state. The State store key typically includes the partition key, hence we are extracting it depending on the operator state key schema. Each operator determines how they store data in the state store.

This is to make sure repartition is consistent with how the state will be partitioned during query execution.

Why are the changes needed?

For offline state repartition

Does this PR introduce any user-facing change?

No

How was this patch tested?

New tests for each operator

Was this patch authored or co-authored using generative AI tooling?

No

Copy link
Contributor

@anishshri-db anishshri-db left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm pending nits

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants