Skip to content

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #6908

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #6908

Annotations

1 error

build (latest, ubuntu-20.04)

failed Jan 7, 2025 in 44m 4s