Skip to content

Commit

Permalink
save q4 quantization
Browse files Browse the repository at this point in the history
  • Loading branch information
xenova committed Jul 4, 2024
1 parent 9128651 commit 83cbb21
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion scripts/quantize.py
Original file line number Diff line number Diff line change
Expand Up @@ -169,7 +169,7 @@ def main():

q4_model = quantize_q4(
model,
save_path=None,
save_path=None if mode == QuantMode.Q4F16 else save_path,
block_size=block_size,
is_symmetric=quantization_args.is_symmetric,
accuracy_level=quantization_args.accuracy_level,
Expand Down

0 comments on commit 83cbb21

Please sign in to comment.