Support chat response format #2046

drbh · 2024-06-10T19:04:02Z

This PR adds support for the response_format parameter in the chat endpoint. This callers users to constrain chat results to a specified grammar.

Important

This feature is mutually exclusive with tools, as they both constrain the output via a grammar internally.

reference in openai api: https://platform.openai.com/docs/api-reference/chat/create#chat-create-response_format

example usage

from openai import OpenAI
from pydantic import BaseModel, conint
from typing import List
import json


class Weather(BaseModel):
    location: str
    days: List[str]
    weather: List[str]
    temperature: List[str]


client = OpenAI(
    base_url="http://localhost:3000/v1",
    api_key="_",
)


chat_completion = client.chat.completions.create(
    model="tgi",
    messages=[
        {
            "role": "system",
            "content": f"[Date: Mon Jun 10th]\nRespond to the users questions and answer them in the following format: {Weather.schema()}",
        },
        {
            "role": "user",
            "content": "What's the weather like the next 3 days in Brooklyn, NY?",
        },
    ],
    seed=42,
    max_tokens=500,
    response_format={"type": "json_object", "value": Weather.schema()},
)

json_response = chat_completion.choices[0].message.content
json_response = json.loads(json_response)
print(json.dumps(json_response, indent=2))
# {
#   "days": [
#     "Today",
#     "Tomorrow",
#     "Day After Tomorrow"
#   ],
#   "location": "Brooklyn, NY",
#   "temperature": [
#     "65",
#     "72",
#     "78"
#   ],
#   "weather": [
#     "Mostly Cloudy",
#     "Partly Cloudy",
#     "Sunny"
#   ]
# }

aymeric-roucher · 2024-06-11T07:44:17Z

Thank you @drbh , this will be a great addition! 🔥

Is it possible to use a regex as the grammar, like response_format={"type": "regex", "value": "..."} ?

drbh · 2024-06-11T13:50:05Z

@aymeric-roucher yes regex should work as well! The response_format is using the same mechanics as grammar in the generate endpoint (so both will accept the "type"s even if we add more going forward)

* feat: support response_format in chat * fix: adjust typos * fix: add trufflehog lint

drbh added 3 commits June 10, 2024 18:33

feat: support response_format in chat

bcf2b29

fix: adjust typos

8c24b12

fix: add trufflehog lint

4ce8494

drbh mentioned this pull request Jun 10, 2024

Add response_format to chat/completions #1966

Closed

OlivierDehaene approved these changes Jun 11, 2024

View reviewed changes

drbh merged commit 376a0b7 into main Jun 11, 2024
6 checks passed

drbh deleted the support-chat-response-format branch June 11, 2024 14:44

Narsil mentioned this pull request Jun 24, 2024

Fixing AMD CI #2109

Closed

5 tasks

This was referenced Jul 1, 2024

Add grammar to chat/completions endpoint / Messages API #1858

Closed

Agents use grammar huggingface/transformers#31735

Merged

yuanwu2017 pushed a commit to yuanwu2017/tgi-gaudi that referenced this pull request Sep 26, 2024

Support chat response format (huggingface#2046)

99c9474

* feat: support response_format in chat * fix: adjust typos * fix: add trufflehog lint

sidharthrajaram mentioned this pull request Oct 23, 2024

Support OpenAI Structured Output by adding json_schema as an alias for JSON Grammar #2680

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support chat response format #2046

Support chat response format #2046

drbh commented Jun 10, 2024

aymeric-roucher commented Jun 11, 2024

drbh commented Jun 11, 2024

Support chat response format #2046

Support chat response format #2046

Conversation

drbh commented Jun 10, 2024

aymeric-roucher commented Jun 11, 2024

drbh commented Jun 11, 2024