Add support for chat templates #408

xenova · 2023-11-21T01:02:18Z

This PR adds support for chat templates, making it possible to generate well-formatted prompts in a generic fashion. This will also be useful to users of other libraries, like TGI, as this means they can generating prompts on the client-side (link to some discussion).

Since the python library's implementation uses Jinja, it required a minimalistic JavaScript reimplementation of the templating engine. Special thanks to Tyler Laceby for his amazing "Guide to Interpreters" tutorial series, which provided the basis for this implementation.

Example: Applying a chat template to a chat history

import { AutoTokenizer } from '@xenova/transformers';

const tokenizer = await AutoTokenizer.from_pretrained("mistralai/Mistral-7B-Instruct-v0.1");

const chat = [
  { "role": "user", "content": "Hello, how are you?" },
  { "role": "assistant", "content": "I'm doing great. How can I help you today?" },
  { "role": "user", "content": "I'd like to show off how chat templating works!" },
]

const text = await tokenizer.apply_chat_template(chat, { tokenize: false });
// "<s>[INST] Hello, how are you? [/INST]I'm doing great. How can I help you today?</s> [INST] I'd like to show off how chat templating works! [/INST]"

const input_ids = await tokenizer.apply_chat_template(chat, { tokenize: true, return_tensor: false });
// [1, 733, 16289, 28793, 22557, 28725, 910, 460, 368, 28804, 733, 28748, 16289, 28793, 28737, 28742, 28719, 2548, 1598, 28723, 1602, 541, 315, 1316, 368, 3154, 28804, 2, 28705, 733, 16289, 28793, 315, 28742, 28715, 737, 298, 1347, 805, 910, 10706, 5752, 1077, 3791, 28808, 733, 28748, 16289, 28793]

cc @OlivierDehaene (original requester of the feature)

TODOs

Add support for built-in string functions (e.g., .strip())
Add support for logical operators
Add all loop variables (like loop.last)
Add full support for nested for-loops (currently, the loop index is reused)
Auto-generate unit tests
Add support for array slicing (e.g., messages[1:])
Do trim_blocks=True, lstrip_blocks=True (needed by HuggingFaceH4/zephyr-7b-beta)
Add test cases for invalid templates

i.e., non Jinja statements/expressions

…lates (#352) This PR introduces the `@huggingface/jinja` library, which is a minimalistic JavaScript implementation of the Jinja templating engine, specifically designed for parsing ML chat templates. Although it was [originally created](huggingface/transformers.js#408) for (and integrated into) transformers.js, it became clear that others can use this functionality too, without the overhead of the transformers.js library. **Example usage:** Loading a `tokenizer_config.json` from the HF hub and render a list of messages ```js import { Template } from "@huggingface/templates"; import { downloadFile } from "@huggingface/hub"; const config = await (await downloadFile({ repo: "mistralai/Mistral-7B-Instruct-v0.1", path: "tokenizer_config.json" })).json(); const chat = [ { "role": "user", "content": "Hello, how are you?" }, { "role": "assistant", "content": "I'm doing great. How can I help you today?" }, { "role": "user", "content": "I'd like to show off how chat templating works!" }, ]; const template = new Template(config.chat_template); const result = template.render({ messages: chat, bos_token: config.bos_token, eos_token: config.eos_token, }); // "<s>[INST] Hello, how are you? [/INST]I'm doing great. How can I help you today?</s> [INST] I'd like to show off how chat templating works! [/INST]" ``` --------- Co-authored-by: coyotte508 <[email protected]>

HuggingFaceDocBuilderDev · 2023-12-15T21:14:13Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Example from https://discuss.huggingface.co/t/issue-with-llama-2-chat-template-and-out-of-date-documentation/61645/3

xenova added 30 commits November 20, 2023 23:07

Add basic support for chat templates

1697c3d

Cleanup

7878ded

JSDoc improvements

03389ac

Support conversion of user-defined functions

d91e022

Cleanup

466d1e4

Fix function creation

449a027

Add unit tests for templates

d7700a0

Cleanup

ee5af8d

Merge branch 'main' into chat-templates

5f37eeb

Improve JSDoc

6927136

Add missing return types

3b08827

Add chat templates docs to table of contents

ffab125

Add support for logical negation

c5629b4

Fix nested logical negation

aabe4be

Add unit tests for logical operators

5f4d7af

Add loop variables

c3a6f08

Add support for RuntimeValue built-in functions

1bbf882

Add unit tests for string instance methods

a5fafe8

Fix conversion of normal function to FunctionValue

6559f81

Update object method unit tests

468e7df

Save chat template to tokenizer_config.json during conversion

abaf579

Fix raise_exception error

be49ef6

Add != operator for booleans

40dfca4

Remember to increment loop index

4445766

Cleanup for loop evaluator

6be2463

Use is helper function

d4a37ae

Add support for text nodes

6d82622

i.e., non Jinja statements/expressions

Add auto-generated templating tests

eb88df1

Update unit tests

7fc877a

Remove unused function

c41550e

xenova added 6 commits December 14, 2023 21:32

Delete templates.test.js

68ac91c

Move Jinja functionality to @huggingface/jinja

e650835

Fix template cache type

0e6b945

Update chat template unit tests

27762f1

Update @huggingface/jinja version

4ca26ff

Merge branch 'main' into chat-templates

cd8ab6f

xenova added 21 commits December 16, 2023 02:28

Fix default llama2 system prompt usage

d9b63ae

Add unit test for llama2 w/o chat template set

ec4d0c6

Update jinja version

6560d68

Update jinja version

d9a9171

Add unit test for user-defined chat templates

147dc78

Example from https://discuss.huggingface.co/t/issue-with-llama-2-chat-template-and-out-of-date-documentation/61645/3

Add AddedToken for improved tokenization

3ee65c6

Add example usage for chat templates

149194b

Add 'first' Metaspace pretokenizer prepend scheme

bf9ec96

Formatting

f5c6edb

Update wav2vec2 converter special tokens whitespace split

ec5e287

Fix Metaspace pretokenizer split criteria

7a3b3aa

Update inputs of PreTokenizerSequence

073ec93

Improve Metaspace pretokenizer

6b5e064

Update llama tokenizer tests

a231620

Improve handling of legacy llama tokenizer

30a45ac

Re-enable SPM tests

c2c10d3

Add static tokenizer test cases

4ddd1c8

Add llama2 static tests

2958984

Allow user to override legacy tokenizer behaviour in .from_pretrained

e4f5cb1

Add legacy tokenizer unit tests

56af977

Bump jinja version to 0.1.0

56820b3

xenova merged commit d4f7cd5 into main Dec 18, 2023
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for chat templates #408

Add support for chat templates #408

xenova commented Nov 21, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 15, 2023

Add support for chat templates #408

Add support for chat templates #408

Conversation

xenova commented Nov 21, 2023 • edited Loading

TODOs

HuggingFaceDocBuilderDev commented Dec 15, 2023

xenova commented Nov 21, 2023 •

edited

Loading