You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for the work, I reproduced the training, but merging lora modules got an error.
if q_len % group_size > 0:
raise ValueError("q_len %d should be divisible by group size %d."%(q_len, group_size))
I find modeling_llama.py in LongQLoRA-Llama2-7b-8k you released is different from the configuration in training. So which base model needs to be selected to merge.
Thanks
The text was updated successfully, but these errors were encountered:
Thanks for the work, I reproduced the training, but merging lora modules got an error.
if q_len % group_size > 0:
raise ValueError("q_len %d should be divisible by group size %d."%(q_len, group_size))
I find modeling_llama.py in LongQLoRA-Llama2-7b-8k you released is different from the configuration in training. So which base model needs to be selected to merge.
Thanks
The text was updated successfully, but these errors were encountered: