Skip to content

Incremental OPTIMIZE #256

@hamersaw

Description

@hamersaw

Currently, when calling OPTIMIZE on a table a background process starts which identifies all fragments that are undersized, computes a compaction plan where these are combined, and then submits separate compaction tasks to workers. In the case of a massive number of compactable fragments this process can be quite expensive and the OPTIMIZE can take a long time.

To update this process, we could bound the number of fragments that are rewritten within the OPTIMIZE command. This would allow for smaller jobs that complete faster, incrementally "checkpointing" the overall compaction.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions