Quantize before import and conversion to Linalg during `CompiledModule.new()` #211

renxida · 2023-11-30T01:03:08Z

Now, passes that need to happen on torch IR before conversion to linalg can be supplied as pre_import_passes to the constructor of StateUpdateModule or other classes that inherit from CompiledModule.

E.g.

    pre_import_passes = []
    if quantization == "int4" and not compile_to == "linalg":
        from shark_turbine.transforms.quantization import mm_group_quant
        pre_import_passes.append(mm_group_quant.MMGroupQuantRewriterPass)
    inst = StateUpdateModule(context=Context(), import_to=import_to, pre_import_passes=pre_import_passes)

renxida · 2023-11-30T17:16:58Z

This fixes #183

IanNod · 2023-11-30T17:23:04Z

Please run black on the files changed

move mmgroupquant

42914be

renxida changed the title ~~makes quantization happen before import and conversion to linalg to fix #183~~ Quantize before import and conversion to Linalg during CompiledModule.__new__() Nov 30, 2023

IanNod linked an issue Nov 30, 2023 that may be closed by this pull request

Integrate quantization pass with CompiledModule #183

Open

renxida closed this May 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantize before import and conversion to Linalg during `CompiledModule.new()` #211

Quantize before import and conversion to Linalg during `CompiledModule.new()` #211

renxida commented Nov 30, 2023

renxida commented Nov 30, 2023

IanNod commented Nov 30, 2023

Quantize before import and conversion to Linalg during CompiledModule.__new__() #211

Quantize before import and conversion to Linalg during CompiledModule.__new__() #211

Conversation

renxida commented Nov 30, 2023

renxida commented Nov 30, 2023

IanNod commented Nov 30, 2023

Quantize before import and conversion to Linalg during `CompiledModule.new()` #211

Quantize before import and conversion to Linalg during `CompiledModule.new()` #211