lora : error message if new token is added in the adapter #9948

ngxson · 2024-10-18T21:39:58Z

Related to #9778

Explanation from TRL team member:

There can be situations where PEFT will automatically save the embedding layer when it detects that it has been changed (e.g. new tokens being added). However, this should save the full embedding layer, not lora_A and lora_B.

In llama.cpp, we simply can't support this case because it will break adapter hot-reload (cannot switch lm_head or embd_tokens at runtime). Also, if multiple adapters each has its own set of additional token, we can't mix and match these adapters together.

Solution

When fine-tuning your model using TRL, please make sure:

Not to add any token
Not to call setup_chat_format()

I have read the contributing guidelines
Self-reported review complexity:
- Low

slaren · 2024-10-18T21:52:03Z

In llama.cpp, we simply can't support this case because it will break adapter hot-reload

I think we can support this by loading the full tensor from the LoRA and replacing it in the model, it shouldn't be a problem. Keep a copy to the previous tensor pointer to restore it when unloading the LoRA, and reject applying another LoRA if it also has a full copy of the same tensor. Or better, just modify the LoRA applying functions like llm_build_lora_mm to use the replacement tensor. Dealing with the changes to the tokenizer may be a more difficult problem, however.

ngxson · 2024-10-18T22:10:57Z

In fact, I'm thinking of the case where each adapter has its own set of added tokens. This will be impossible because now each adapter has its own definition of the added token.

In any cases, I don't think user should add a new tokens to LoRA since the their embeddings will not be trained. In fact, these embeddings will be initialized to random vectors.

(I'm merging this PR once @Victoran0 confirms that removing setup_chat_format fixes the problem)

Victoran0 · 2024-10-19T18:29:01Z

Yes @ngxson , removing setup_chat_format resolved the error, the lora adapter's size is now less than 100mb.
This commit is optimal as it tells the user where the problem is coming from and what to fix. Great job!

lora : warn user if new token is added in the adapter

14014fd

github-actions bot added the python python script changes label Oct 18, 2024

ngxson changed the title ~~lora : warn user if new token is added in the adapter~~ lora : error message if new token is added in the adapter Oct 18, 2024

slaren approved these changes Oct 18, 2024

View reviewed changes

Victoran0 mentioned this pull request Oct 19, 2024

Added support for SFTTrainer checkpoint models and adapter models containing some non-LoRA weights #9778

Closed

4 tasks

ngxson merged commit c421ac0 into ggerganov:master Oct 22, 2024
9 checks passed

dsx1986 pushed a commit to dsx1986/llama.cpp that referenced this pull request Oct 29, 2024

lora : warn user if new token is added in the adapter (ggerganov#9948)

5c554e3

dsx1986 pushed a commit to dsx1986/llama.cpp that referenced this pull request Oct 29, 2024

lora : warn user if new token is added in the adapter (ggerganov#9948)

e33453b

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

lora : warn user if new token is added in the adapter (ggerganov#9948)

88135ea

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

lora : warn user if new token is added in the adapter (ggerganov#9948)

bf62ff1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lora : error message if new token is added in the adapter #9948

lora : error message if new token is added in the adapter #9948

ngxson commented Oct 18, 2024 •

edited

Loading

slaren commented Oct 18, 2024 •

edited

Loading

ngxson commented Oct 18, 2024 •

edited

Loading

Victoran0 commented Oct 19, 2024 •

edited

Loading

lora : error message if new token is added in the adapter #9948

lora : error message if new token is added in the adapter #9948

Conversation

ngxson commented Oct 18, 2024 • edited Loading

Solution

slaren commented Oct 18, 2024 • edited Loading

ngxson commented Oct 18, 2024 • edited Loading

Victoran0 commented Oct 19, 2024 • edited Loading

ngxson commented Oct 18, 2024 •

edited

Loading

slaren commented Oct 18, 2024 •

edited

Loading

ngxson commented Oct 18, 2024 •

edited

Loading

Victoran0 commented Oct 19, 2024 •

edited

Loading