🚨🚨 TextGenerationPipeline: rely on the tokenizer default kwargs #31747

gante · 2024-07-02T13:10:51Z

What does this PR do?

Issue

TextGenerationPipeline.preprocess, where tokenization happens, has defined a few optional kwargs whose default value does not match the tokenizer defaults. This was causing:

A mismatch between the tokens resulting from tokenizer calls and the tokens pipeline was seeing, using default arguments on both
The issue seen in this thread -- add_special_tokens had to be manually set to True in the pipeline, despite it already being the default in the tokenizer.

(2 is a consequence of 1 :) )

This PR

🚨🚨 This PR changes the code to rely on the tokenizer's defaults when these flags are unset. This means some models using TextGenerationPipeline previously did not add a <bos> by default, which (negatively) impacted their performance. In practice, this is a breaking change.

Example of a script changed as a result of this PR:

from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
import torch

tokenizer = AutoTokenizer.from_pretrained("google/gemma-2-9b-it")
model = AutoModelForCausalLM.from_pretrained("google/gemma-2-9b-it", torch_dtype=torch.bfloat16, device_map="auto")
pipe = pipeline("text-generation", model=model, tokenizer=tokenizer)
print(pipe("Foo bar"))

HuggingFaceDocBuilderDev · 2024-07-02T13:29:46Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

LysandreJik

Thanks a lot @gante ! Let's keep it a bit on main to ensure it doesn't break too much the users' setups before releasing it.

pcuenca · 2024-07-02T16:11:12Z

Very nice, I agree it makes sense to make the behaviours consistent!

Corresponding PR to microsoft/DeepSpeed-MII#510 that is made due to changes from transformers introduced in huggingface/transformers#31747

rely on the tokenizer default kwargs

c672cd7

gante requested a review from LysandreJik July 2, 2024 13:11

fix a few tests

cad6a40

gante changed the title ~~Pipeline: rely on the tokenizer default kwargs~~ 🚨🚨 TextGenerationPipeline: rely on the tokenizer default kwargs Jul 2, 2024

LysandreJik approved these changes Jul 2, 2024

View reviewed changes

LysandreJik merged commit 82486e5 into huggingface:main Jul 2, 2024
21 checks passed

gante mentioned this pull request Jul 2, 2024

Gemma 2: Update slow tests #31759

Merged

This was referenced Jul 25, 2024

Support latest changes in transformers microsoft/DeepSpeed-MII#512

Open

Pin transformers version for MII tests microsoft/DeepSpeed#5807

Merged

loadams added a commit to microsoft/DeepSpeed that referenced this pull request Jul 30, 2024

Pin transformers version for MII tests (#5807)

5e8a27a

Corresponding PR to microsoft/DeepSpeed-MII#510 that is made due to changes from transformers introduced in huggingface/transformers#31747

loadams mentioned this pull request Oct 29, 2024

Update MII tests to support transformers latest microsoft/DeepSpeed#6686

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚨🚨 TextGenerationPipeline: rely on the tokenizer default kwargs #31747

🚨🚨 TextGenerationPipeline: rely on the tokenizer default kwargs #31747

gante commented Jul 2, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 2, 2024

LysandreJik left a comment

pcuenca commented Jul 2, 2024

🚨🚨 TextGenerationPipeline: rely on the tokenizer default kwargs #31747

🚨🚨 TextGenerationPipeline: rely on the tokenizer default kwargs #31747

Conversation

gante commented Jul 2, 2024 • edited Loading

What does this PR do?

Issue

This PR

HuggingFaceDocBuilderDev commented Jul 2, 2024

LysandreJik left a comment

Choose a reason for hiding this comment

pcuenca commented Jul 2, 2024

gante commented Jul 2, 2024 •

edited

Loading