Support Dynamically Quantized Convolutions

We wish to have some initial support for Dynamically Quantized Convolutions. 

Let's first write a test to drive development of the feature. Let's add a test here first:
https://github.com/pytorch/executorch/blob/e9c2315aacaa36fbd5aa8fe060d798fdc3f08695/backends/xnnpack/test/ops/test_conv2d.py#L4

let's just do 2d convolutions for now. Take a look at how we test dqlinears in general: https://github.com/pytorch/executorch/blob/e9c2315aacaa36fbd5aa8fe060d798fdc3f08695/backends/xnnpack/test/ops/test_linear.py#L327

And let's try to add a test here. Since we're adding quantizer support, we should make sure that after 
```
.quantize()
.export()
```
we should check that a choose_q_param node is in the graph. Now, after we've added this test, when we run it with 
`python -m unittest backends.xnnpack.test.ops.test_conv2d....` It should fail because it can't find the choose_q_params node. Let's first start by enabling the quantizer to properly annotate convolutions.

https://github.com/pytorch/executorch/blob/e9c2315aacaa36fbd5aa8fe060d798fdc3f08695/backends/xnnpack/quantizer/xnnpack_quantizer.py#L266

Since we're first starting with conv2d, we should only annotate dynamically quantized convs if they are 2d. We can add a check that the len(outputpadding) == 2 somewhere here:
https://github.com/pytorch/executorch/blob/main/backends/xnnpack/quantizer/xnnpack_quantizer_utils.py#L295

Now that we have it annotated, it should bass through the test that's checking for the choose_q_param. Now we just need to update our partitioner to allow DynamicallyQuantizedConvolutions:

https://github.com/pytorch/executorch/blob/e9c2315aacaa36fbd5aa8fe060d798fdc3f08695/backends/xnnpack/partition/config/gemm_configs.py#L396

Again it would be nice to check in our constraints that if we detect a dynamically quantized convolution, and it is 1d, then we don't partition. After this the test should be passing. There may be some more lingering issues with the wiring, if that's the case feel free to reach out in the discord group:

https://discord.com/channels/1334270993966825602/1336777807509979188



cc @digantdesai @cbilgin

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Dynamically Quantized Convolutions #9021

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support Dynamically Quantized Convolutions #9021

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions