[Quantization]: Update simulated_quantize to infer correct layout by f2013519 · Pull Request #14875 · apache/tvm

f2013519 · 2023-05-17T22:07:16Z

In our BYOC flow, we invoke the Convert Layout pass after simulated_quantize annotations are inserted in the IR.

However, after Convert Layout, we observe layout transforms to the original layout are inserted before the simulated_quantize annotations, as the compiler is currently unable to infer the layouts from the input.

This PR allows propagation of the input layouts to avoid the insertion of additional layout transforms.

tvm-bot · 2023-05-17T22:07:19Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

No users to tag found in teams: quantization _{See #10317 for details}

_{Generated by tvm-bot}

f2013519 · 2023-05-18T04:49:08Z

cc: @junrushao @masahi @vinx13

[Quantization]: Update simulated_quantize to infer correct layout

9854f3f

masahi approved these changes May 18, 2023

View reviewed changes

masahi merged commit 28e9801 into apache:main May 18, 2023

ysh329 mentioned this pull request Jul 12, 2023

[Release] v0.13.0 Release Candidate Notes #15295

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Quantization]: Update simulated_quantize to infer correct layout#14875

[Quantization]: Update simulated_quantize to infer correct layout#14875
masahi merged 1 commit intoapache:mainfrom
f2013519:main

f2013519 commented May 17, 2023

Uh oh!

tvm-bot commented May 17, 2023

Uh oh!

f2013519 commented May 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

f2013519 commented May 17, 2023

Uh oh!

tvm-bot commented May 17, 2023

Uh oh!

f2013519 commented May 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants