Update docs/source/inference.mdx

Co-authored-by: Helena Kloosterman <[email protected]>
huggingface · Mar 13, 2024 · afc23d0 · afc23d0
1 parent 027c370
commit afc23d0
Showing 1 changed file with 1 addition and 1 deletion.
diff --git a/docs/source/inference.mdx b/docs/source/inference.mdx
@@ -99,7 +99,7 @@ tokenizer.save_pretrained(save_directory)
 
 ### Weight-only quantization
 
-You can also apply fp16, 8-bit or 4-bit weight quantization on the linear and embedding layers when exporting your model with the CLI by setting `--weight-format` to respectively `fp16`, `int8` or `int4`:
+You can also apply fp16, 8-bit or 4-bit weight compression on the linear and embedding layers when exporting your model with the CLI by setting `--weight-format` to respectively `fp16`, `int8` or `int4`:
 
 ```bash
 optimum-cli export openvino --model gpt2 --weight-format int8 ov_model