[QST] int8 Conv2D for volta V100 #1964

sycz00 · 2024-11-26T16:44:21Z

What is your question?

Hey I use following script to emit the cuda kernel for the Nvidia V100:

import torch
import random

import cutlass

dtype = torch.int32
type_A = torch.int8
type_B = torch.int8
type_C = torch.int32
type_D = torch.int32


plan = cutlass.Conv2dFprop(element=dtype, element_input=type_A, element_weight=type_B, element_C=type_C ,element_output=type_D, element_accumulator=type_D)
op = plan.construct()
conv_layer = cutlass.emit.pytorch(op, name='conv_layer', cc=plan.cc, sourcedir='conv', jit=True)

I tried a couple of configuration datatypes. Float works just fine. But I want them to be integer-8 Input and weights. And it should be possible with idp4a instruction set, right ?
Any ideas ? - thanks in advance !

sycz00 added ? - Needs Triage question Question labels Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QST] int8 Conv2D for volta V100 #1964

[QST] int8 Conv2D for volta V100 #1964

sycz00 commented Nov 26, 2024

[QST] int8 Conv2D for volta V100 #1964

[QST] int8 Conv2D for volta V100 #1964

Comments

sycz00 commented Nov 26, 2024