feat: _do_annotate_conv_bn + _annotate_softmax #1

Dayof · 2025-02-07T18:33:24Z

Motivation

Fix some torch limitations

_get_aten_graph_module_for_pattern receives this _conv_bn fn and call torch capture_pre_autograd_graph with it, but inside this torch function it has this following assert (torch/_export/__init__.py:147 ):

assert isinstance(f, torch.nn.Module), "Expected an nn.Module instance."

This PR encapsulates the callable as a torch.nn.Module so that it does no raises this problem.

Add softmax quantized op

There is no softmax included in the static op in ai-edge-torch nowadays, therefore we are adding _annotate_softmax so that we can convert a full int8 model that included a softmax as output layer.

Dayof added 2 commits February 7, 2025 15:30

fix _do_annotate_conv_bn

b35ae8e

add softmax

14921a1

Dayof changed the title ~~fix: _do_annotate_conv_bn~~ fix: _do_annotate_conv_bn + _annotate_softmax Feb 14, 2025

Dayof changed the title ~~fix: _do_annotate_conv_bn + _annotate_softmax~~ feat: _do_annotate_conv_bn + _annotate_softmax Feb 14, 2025

ShinyCode approved these changes Feb 15, 2025

View reviewed changes

Dayof merged commit a6a828a into main Feb 17, 2025

Dayof deleted the fix/qat branch February 17, 2025 16:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: _do_annotate_conv_bn + _annotate_softmax #1

feat: _do_annotate_conv_bn + _annotate_softmax #1

Dayof commented Feb 7, 2025 •

edited

Loading

feat: _do_annotate_conv_bn + _annotate_softmax #1

feat: _do_annotate_conv_bn + _annotate_softmax #1

Conversation

Dayof commented Feb 7, 2025 • edited Loading

Motivation

Fix some torch limitations

Add softmax quantized op

Dayof commented Feb 7, 2025 •

edited

Loading