disable_exllamav2_for_quantization #1482

SunMarc · 2023-10-24T20:01:41Z

What does this do ?

This PR removes the possibility to quantize a model with exllamav2 as we don't be save to save the weights as they are somehow reordered.

fxmarty

LGTM

optimum/gptq/quantizer.py

disable_exllamav2

3b78b15

SunMarc requested a review from fxmarty October 24, 2023 20:01

SunMarc changed the title ~~disable_exllamav2~~ disable_exllamav2_for_quantization Oct 24, 2023

fxmarty approved these changes Oct 25, 2023

View reviewed changes

optimum/gptq/quantizer.py Show resolved Hide resolved

SunMarc merged commit 313e1bd into huggingface:main Oct 25, 2023
48 of 52 checks passed