[TFLite] Enable int64 biases for int16 quantized operators by leandron · Pull Request #12042 · apache/tvm

leandron · 2022-07-08T16:04:29Z

This enables int64 biases for quantized fully connected, requantize and transpose convolution in TFLite networks. It goes on top of existing int16 support for TFLite frontend.

cc @areusch for reviews

Mousius

Can we add some test cases for this please @leandron 😸

kuladeep2706 · 2022-10-07T12:31:14Z

Hello @leandron,

I'm working on similar lines & have a model with conv2d_transpose & all the other ops are already supported from your already merged commit. I've made the same changes you've done for conv2d_transpose from this patch, but the dequantize layer at the end is getting int64 input which isn't right. Am I missing something that needs to be changed?

Thanks in advance!

leandron · 2022-11-08T16:22:25Z

Hello @leandron,

I'm working on similar lines & have a model with conv2d_transpose & all the other ops are already supported from your already merged commit. I've made the same changes you've done for conv2d_transpose from this patch, but the dequantize layer at the end is getting int64 input which isn't right. Am I missing something that needs to be changed?

Thanks in advance!

In TFlite as of now, biases are set by default to be int64 when int16 quantisation is used.

I have this model which was created using the default int16 flow, and can be used to check these internal data types with e.g. Netron

tvm-bot · 2022-11-09T13:31:51Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

Built docs for commit 98568b2 can be found here.

_{Generated by tvm-bot}

This enables int64 biases for quantized fully connected, requantize and transpose convolution in TFLite networks. It goes on top of existing int16 support for TFLite frontend. Add a test case using DS_CNN int16 quantized. Change-Id: I3006ee76f5037fb6f915818358c9aada2faf40bf

leandron · 2022-11-09T17:30:31Z

Please have another look.

asparkhi

Overall looks good to me. Do you know of any links to int16 specs similar to https://www.tensorflow.org/lite/performance/quantization_spec (int8 only)?

ekalda

Thanks @leandron, looks good to me!

asparkhi

Thanks @leandron. LGTM 😄

Mousius · 2022-11-15T10:31:15Z

Sorry for the delay - thanks @leandron 😸

) This enables int64 biases for quantized fully connected, requantize and transpose convolution in TFLite networks. It goes on top of existing int16 support for TFLite frontend. Add a test case using DS_CNN int16 quantized.

github-actions Bot requested a review from areusch July 8, 2022 16:04

Mousius requested changes Jul 8, 2022

View reviewed changes

areusch added needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it and removed needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it labels Oct 19, 2022

leandron force-pushed the int16_ops_dense_requantize branch from 7eb64a3 to a940412 Compare November 8, 2022 16:20

leandron force-pushed the int16_ops_dense_requantize branch 3 times, most recently from 5b67c87 to 1846d00 Compare November 9, 2022 10:54

leandron force-pushed the int16_ops_dense_requantize branch from 1846d00 to 98568b2 Compare November 9, 2022 15:23

asparkhi reviewed Nov 10, 2022

View reviewed changes

Comment thread src/relay/qnn/op/dense.cc

Comment thread tests/python/contrib/test_ethosn/test_convert_equivalents.py

ekalda approved these changes Nov 10, 2022

View reviewed changes

asparkhi approved these changes Nov 10, 2022

View reviewed changes

Mousius approved these changes Nov 15, 2022

View reviewed changes

Mousius merged commit 034dc67 into apache:main Nov 15, 2022

leandron mentioned this pull request Feb 1, 2023

TVM v0.11.0 Release Candidate Notes #13899

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TFLite] Enable int64 biases for int16 quantized operators#12042

[TFLite] Enable int64 biases for int16 quantized operators#12042
Mousius merged 1 commit intoapache:mainfrom
leandron:int16_ops_dense_requantize

leandron commented Jul 8, 2022 •

edited

Loading

Uh oh!

Mousius left a comment

Uh oh!

kuladeep2706 commented Oct 7, 2022

Uh oh!

leandron commented Nov 8, 2022

Uh oh!

tvm-bot commented Nov 9, 2022 •

edited

Loading

Uh oh!

leandron commented Nov 9, 2022 •

edited

Loading

Uh oh!

asparkhi left a comment

Uh oh!

Uh oh!

Uh oh!

ekalda left a comment

Uh oh!

asparkhi left a comment

Uh oh!

Mousius commented Nov 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

leandron commented Jul 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mousius left a comment

Choose a reason for hiding this comment

Uh oh!

kuladeep2706 commented Oct 7, 2022

Uh oh!

leandron commented Nov 8, 2022

Uh oh!

tvm-bot commented Nov 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leandron commented Nov 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

asparkhi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ekalda left a comment

Choose a reason for hiding this comment

Uh oh!

asparkhi left a comment

Choose a reason for hiding this comment

Uh oh!

Mousius commented Nov 15, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

leandron commented Jul 8, 2022 •

edited

Loading

tvm-bot commented Nov 9, 2022 •

edited

Loading

leandron commented Nov 9, 2022 •

edited

Loading