Skip to content

Commit

Permalink
change device to cpu for qunatization
Browse files Browse the repository at this point in the history
  • Loading branch information
HRashidi committed Oct 30, 2024
1 parent 464e9d9 commit 57565d4
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions aana/deployments/hqq_text_generation_deployment.py
Original file line number Diff line number Diff line change
Expand Up @@ -118,6 +118,7 @@ async def apply_config(self, config: dict[str, Any]):
self.dtype = Dtype.BFLOAT16

if config_obj.quantize_on_fly:
self.model_kwargs.pop("device_map", None)
self.model = AutoModelForCausalLM.from_pretrained(
self.model_id, torch_dtype=self.dtype.to_torch(), **self.model_kwargs
)
Expand Down

0 comments on commit 57565d4

Please sign in to comment.