0.2.9 broke the bos/eos/sys handling for chat sequences #800

earonesty · 2023-10-07T13:15:15Z

recent changes (0.2.7 -> 0.2.9.) to the chat framework broke things (this is vicuna 7b)

first call always returns the prompt:

[INST] <<SYS>> You are a helpful assistant. <|endoftext|>

second call works:

Once upon a time in a faraway land, there lived a silly frog named Freddy who loved to hop around on lily pads and croak loudly.

third and continuing calls get odder, and show the bos tokens, etc:

[INST] <<SYS>>
You are a helpful assistant.
<</SYS>>



The silly frog hops around on the lily pad, croaking loudly.

trying different versions.

0.2.7 fixes it
0.2.9 breaks it
0.2.11 still broken

its a token handling issue. gonna try to figure out what broke between 0.2.7 and 0.2.9

looks like it was this:

"+- Add configurable chat formats by @abetlen in #711"

The text was updated successfully, but these errors were encountered:

earonesty · 2023-10-23T15:33:39Z

verified that this PR does not fully fix the issue, leaving most users locked to version 0.2.7

#808

earonesty · 2023-11-02T16:01:06Z

easy to reproduce, just specify a longer chat conversation. at least 3 user and 2 assistant messages result in weird results

abetlen · 2023-11-02T21:25:56Z

@earonesty just confirming that #850 didn't resolve this?

earonesty · 2023-11-03T13:50:02Z

correct I tried that branch and while it fixes the first request it does not fix subsequent requests

It's really about subsequent requests with a persistent running instance

earonesty · 2023-11-03T13:50:13Z

I suspect something about the vectors not getting cleared between requests or something is going on

earonesty · 2023-11-03T19:41:57Z

tried again today with v 0.2.13:

$ python example.py
This is an instruction to write a sentence about a silly frog in the next message.
(venv)
erik@LAPTOP-1K8B9NQ8 MINGW64 ~/chainer (main)
$ python example.py
You have been asked to write a description of an absurd scenario, and the only mental image you can conjure is that of a silly-looking frog. It jumps around in your mind, its skin green and
bumpy like a piece of broccoli, and it hops along with a goofy grin on its face. You wonder how such a creature would behave if it were to come to life. Maybe it would croak out a silly joke
 or do a funny dance. Or perhaps it would just sit there, looking completely bewildered by the world around it.
(venv)
erik@LAPTOP-1K8B9NQ8 MINGW64 ~/chainer (main)
$ python example.py
[INST]Write one paragraph about why the frog is silly. [/INST]

you can see that it starts hallucinating system messages and prompts, instead of generating responses to the "write one sentence about a silly frog". this is vicuna. version 0.2.7 just works every time.

earonesty · 2023-11-03T19:51:45Z

hmm, looks like maybe this is a problem a little deeper. i can see the engine is formatting the input correctly, and each request looks normal

<<SYS>>
You are a helpful assistant.
<</SYS>>



[INST]Write one sentence about a silly frog. [/INST]

it's only in subsequent requests that it starts getting odd. tried disabling caching, didn't help. eventually it starts spitting out system prompts as completions.

it's like it's chaining my requests, instead of starting over.

look at the output of the 3rd request.... two prompts. and each is progressive.

[INST]Write another sentence that is completely unrelated to the first sentence. [/INST]
[INST]Write a third sentence that connects the two sentences in a strange way. [/INST]

earonesty · 2023-11-03T20:34:42Z

yes, that PR and the other fixed it.

you now need to specify the chat_format correctly ... it won't guess anymore!

earonesty changed the title ~~used use vicuna and server.py with no problem, but with a recent update here's a series of chats~~ used use vicuna and server.py with no problem, but with a recent update here's a series of chats 0.2.11. Oct 7, 2023

earonesty changed the title ~~used use vicuna and server.py with no problem, but with a recent update here's a series of chats 0.2.11.~~ used use vicuna and server.py with no problem, but with a recent update here's a series of chats Oct 7, 2023

earonesty changed the title ~~used use vicuna and server.py with no problem, but with a recent update here's a series of chats~~ used use vicuna and server.py with no problem, but with a recent update it broke the bos/eos/sys handling Oct 9, 2023

teleprint-me mentioned this issue Oct 9, 2023

Repeated greeting in same chat session #801

Closed

earonesty changed the title ~~used use vicuna and server.py with no problem, but with a recent update it broke the bos/eos/sys handling~~ 0.2.9 broke the bos/eos/sys handling for chat sequences Oct 9, 2023

antoine-lizee mentioned this issue Oct 30, 2023

fix: tokenization of special characters: #850

Merged

earonesty closed this as completed Nov 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.2.9 broke the bos/eos/sys handling for chat sequences #800

0.2.9 broke the bos/eos/sys handling for chat sequences #800

earonesty commented Oct 7, 2023 •

edited

Loading

earonesty commented Oct 23, 2023 •

edited

Loading

earonesty commented Nov 2, 2023

abetlen commented Nov 2, 2023

earonesty commented Nov 3, 2023

earonesty commented Nov 3, 2023

earonesty commented Nov 3, 2023

earonesty commented Nov 3, 2023 •

edited

Loading

earonesty commented Nov 3, 2023 •

edited

Loading

0.2.9 broke the bos/eos/sys handling for chat sequences #800

0.2.9 broke the bos/eos/sys handling for chat sequences #800

Comments

earonesty commented Oct 7, 2023 • edited Loading

earonesty commented Oct 23, 2023 • edited Loading

earonesty commented Nov 2, 2023

abetlen commented Nov 2, 2023

earonesty commented Nov 3, 2023

earonesty commented Nov 3, 2023

earonesty commented Nov 3, 2023

earonesty commented Nov 3, 2023 • edited Loading

earonesty commented Nov 3, 2023 • edited Loading

earonesty commented Oct 7, 2023 •

edited

Loading

earonesty commented Oct 23, 2023 •

edited

Loading

earonesty commented Nov 3, 2023 •

edited

Loading

earonesty commented Nov 3, 2023 •

edited

Loading