[bug] Constevaluating trainable nn.Parameter #929

pmarkovicTT · 2024-12-18T13:47:23Z

Description:
Hitting KeyError: 'gradient_lora1.b.consteval_graph.output' when implementing LoRA layer with nn.Parameters.

Repro branch:
https://github.com/tenstorrent/tt-forge-fe/tree/pmarkovic/parameter-consteval-issue

The text was updated successfully, but these errors were encountered:

Add test to make sure LoRA layer can be implemented and used in a full training pipeline. Problem encountered Hitting `KeyError: 'gradient_lora1.b.consteval_graph.output'` when implementing LoRA layer with nn.Parameters. ``` class LoraLayer(nn.Module): def __init__(self, input_size, output_size, rank=8, alpha=4, dtype=torch.float32): super(LoraLayer, self).__init__() self.a = nn.Parameter(torch.empty(input_size, rank, dtype=dtype), requires_grad=True) self.b = nn.Parameter(torch.zeros(rank, output_size, dtype=dtype), requires_grad=True) self.alpha = alpha / rank nn.init.normal_(self.a, mean=0, std=1) def forward(self, x): return self.alpha * (x @ self.a @ self.b) ``` Raised issue: #929

pmarkovicTT mentioned this issue Dec 18, 2024

MNIST lora layer test #913

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bug] Constevaluating trainable nn.Parameter #929

[bug] Constevaluating trainable nn.Parameter #929

pmarkovicTT commented Dec 18, 2024

[bug] Constevaluating trainable nn.Parameter #929

[bug] Constevaluating trainable nn.Parameter #929

Comments

pmarkovicTT commented Dec 18, 2024