Skip to content

b3150

Compare
Choose a tag to compare
@github-actions github-actions released this 14 Jun 17:37
76d66ee
CUDA: faster q2_K, q3_K MMQ + int8 tensor cores (#7921)

* CUDA: faster q2_K, q3_K MMQ + int8 tensor cores

* try CI fix

* try CI fix

* try CI fix

* fix data race

* rever q2_K precision related changes