Skip to content

Commit

Permalink
Update base for Update on "Update QAT READMEs using new APIs"
Browse files Browse the repository at this point in the history
Add references to new QAT APIs including `quantize_`,
`FakeQuantizedX`, and the new embedding Quantizers and
ComposableQATQuantizer. Also link to new QAT + LoRA recipe
in torchtune.

[ghstack-poisoned]
  • Loading branch information
andrewor14 committed Jan 10, 2025
1 parent ec94797 commit 507a9e6
Showing 1 changed file with 1 addition and 4 deletions.
5 changes: 1 addition & 4 deletions torchao/quantization/qat/linear.py
Original file line number Diff line number Diff line change
Expand Up @@ -107,10 +107,7 @@ def forward(self, x: torch.Tensor) -> torch.Tensor:

def to_linear(self) -> torch.nn.Linear:
new_linear = torch.nn.Linear(
self.in_features,
self.out_features,
self.bias,
device=self.weight.device
self.in_features, self.out_features, self.bias, device=self.weight.device
)
# In distributed training, the model may be instantiated
# on the meta device, in which case there is no need to
Expand Down

0 comments on commit 507a9e6

Please sign in to comment.