Questions about int4 quantization #179

gaikwadrahul8 · 2024-11-28T10:04:40Z

Hi, all.
Since it is an inquiry rather than an issue, I will not write a template.

Looking at the kernel side code of tflite, I saw that int4 for filter is supported in several op kernels; conv2d, depthwise-conv2d, fully-connected.

Could you tell me if there are plans to support int4 quantization in the tflite converter or support int4 for each op's input as well as filter, and if so, what milestones do you have?

Thank you :)

gaikwadrahul8 · 2024-11-28T11:45:33Z

This issue originally reported by @0-chan-kor has been moved to this dedicated repository for LiteRT to enhance issue tracking and prioritization. To ensure continuity, we have created this new issue on your behalf.

We appreciate your understanding and look forward to your continued involvement.

pkgoogle · 2024-12-02T19:14:37Z

Original Issue: tensorflow/tensorflow#60125

gaikwadrahul8 mentioned this issue Nov 28, 2024

Questions about int4 quantization tensorflow/tensorflow#60125

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about int4 quantization #179

Questions about int4 quantization #179

gaikwadrahul8 commented Nov 28, 2024

gaikwadrahul8 commented Nov 28, 2024

pkgoogle commented Dec 2, 2024

Questions about int4 quantization #179

Questions about int4 quantization #179

Comments

gaikwadrahul8 commented Nov 28, 2024

gaikwadrahul8 commented Nov 28, 2024

pkgoogle commented Dec 2, 2024