We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
4a3156d
CUDA: faster dequantize kernels for Q4_0 and Q4_1 (#4938) Co-authored-by: Iwan Kawrakow <[email protected]>