Was the chat template applied in ochat/config/conversation_template.py? #200

houghtonweihu · 2024-03-12T00:00:28Z

In ReadMe, it shows that when sending a request to API server, use this format:
curl http://localhost:18888/v1/chat/completions
-H "Content-Type: application/json"
-d '{
"model": "openchat_3.5",
"messages": [{"role": "user", "content": "You are a large language model named OpenChat. Write a poem to describe yourself"}]
}'

It seems that the method tokenize_conversations in conversation_template.py only adds eot to messages, but did not add GPT4 Correct User and GPT4 Correct Assistant (only used user and assistant in messages)?

As a result, the same input is processed differently for API server and Inference with transformers?
Because of this, the model that I tested for inference with transformers will behave differently when it is deployed
using API Server?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Was the chat template applied in ochat/config/conversation_template.py? #200

Was the chat template applied in ochat/config/conversation_template.py? #200

houghtonweihu commented Mar 12, 2024

Was the chat template applied in ochat/config/conversation_template.py? #200

Was the chat template applied in ochat/config/conversation_template.py? #200

Comments

houghtonweihu commented Mar 12, 2024