Skip to content

Scalar Quantization (SQ) & Product Quantization (PQ) #2

@swar09

Description

@swar09

Uses full f32 . Memory usage is high (4 bytes per dimension).

Fix: Compress vectors to u8 or i8 (Quantization) to reduce memory usage by 4x and speed up distance commputations.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions