about QAT model #42

lxzatwowone1 · 2024-09-25T02:52:25Z

I have a question, Why need qat model to convert dla model, can I use original onnx model?
Thanks! I am DL beginner

liuanqi-libra7 · 2024-11-29T10:20:08Z

DLA's INT8 compute capability is much higher than float, we recommend that the model be inferred with INT8 precision on DLA.
The accuracy of implicit quantization using TensorRT will be lower than PyTorch Quantization, so we recommend using PyTorch Quantization to quantize the model.

Provide feedback