Merge changes from randaller/llama-chat #4

Honigmelone · 2023-03-12T12:02:44Z

Hey,

I notices the default prompt in example-chat.py was quite different from your two repos. I have merges some more recent changes from https://github.com/randaller/llama-chat to get the interactive chat working in the cpu only version.

I have not merged the model and the tokenizer yet. You might want to consider to build up on this and to merge them as well to obtain two consistent repositories

…for the interactive chat to work

randaller · 2023-03-12T13:29:30Z

@Honigmelone this will break all other examples; llama-chat is now a primary repo, and this repo is deprecated

Honigmelone · 2023-03-12T15:22:16Z

I see, is it somehow possible to run llama-chat in cpu only mode or do you drop this functionality?

alaestor · 2023-03-19T19:25:35Z

I haven't a clue what I'm doing and am just quickly messing around, but regarding llama-chat/llama/model.py: I changed use_gpu in def forward to False, and then all occurrences of .cuda() to .cpu() in Transformer's and Attention's inits. It just sorta... worked. Kind of. I assume it's tailored for GPU use because it's slow as heck on CPU (going from llama-cpu @ 1it/s to the bodged llama-chat's 6~8s/it with 7B on my 7950x)

Hopefully proper CPU support will come to the main repo some day.... For now I guess I'll just base my own personal experiments on this deprecated repo, or Frankenstein myself some hybrid of the two.

manually merged changes from https://github.com/randaller/llama-chat …

bcb1af5

…for the interactive chat to work

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge changes from randaller/llama-chat #4

Merge changes from randaller/llama-chat #4

Honigmelone commented Mar 12, 2023

randaller commented Mar 12, 2023

Honigmelone commented Mar 12, 2023

alaestor commented Mar 19, 2023 •

edited

Loading

Merge changes from randaller/llama-chat #4

Are you sure you want to change the base?

Merge changes from randaller/llama-chat #4

Conversation

Honigmelone commented Mar 12, 2023

randaller commented Mar 12, 2023

Honigmelone commented Mar 12, 2023

alaestor commented Mar 19, 2023 • edited Loading

alaestor commented Mar 19, 2023 •

edited

Loading