Option to set 'non_blocking' for to(device) in BatchEncoding and BatchFeature #34883

daniel-bogdoll · 2024-11-22T16:32:08Z

Option to set 'non_blocking' for to(device) operation in BatchEncoding for performance improvements. Defaults to 'false', thus no behavioral changes.

What does this PR do?

This minor PR adds the non_blocking option to the to() function.

Previous: def to(self, device: Union[str, "torch.device"]) -> "BatchEncoding":
New: def to(self, device: Union[str, "torch.device"], non_blocking: bool = False) -> "BatchEncoding":

Since non_blocking defaults to 'False', this PR does not introduce behavioral changes.

I realized, when utilizing Zero Shot Object Detection models, that it was not possible to set this option, leading to sub-optimal performance during inference.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

vision models: @amyeroberts, @qubvel

… improvements. Defaults to 'false', thus no behavioral changes.

qubvel

Hi @daniel-bogdoll, thanks for adding this! It looks great to me. Do you think it might be worth extending the same option to BatchFeature to ensure consistent capabilities?

daniel-bogdoll · 2024-11-22T18:20:21Z

Thanks @qubvel, sure thing! Which tests would I need to run to make sure modifications in the to() function of BatchFeature get tested?

Just to make sure, I assume you refer to

transformers/src/transformers/feature_extraction_utils.py

Line 206 in 54be2d7

def to(self, *args, **kwargs) -> "BatchFeature":

?

qubvel · 2024-11-22T18:42:40Z

Yes, I refer to this one, but not sure it's properly tested anywhere, I was able to find only SequenceFeatureExtractionTestMixin

qubvel · 2024-11-22T18:44:35Z

Maybe we can do it as simple as

non_blocking = kwargs.get("non_blocking", False)
...
elif isinstance(v, torch.Tensor) and device is not None:
      new_data[k] = v.to(device=device, non_blocking=non_blocking)
...

daniel-bogdoll · 2024-11-22T18:48:06Z

That's how I would have tried it as well. But what about this block?

# Check if the args are a device or a dtype
        if device is None and len(args) > 0:
            # device should be always the first argument
            arg = args[0]
            if is_torch_dtype(arg):
                # The first argument is a dtype
                pass
            elif isinstance(arg, str) or is_torch_device(arg) or isinstance(arg, int):
                device = arg
            else:
                # it's something else
                raise ValueError(f"Attempting to cast a BatchFeature to type {str(arg)}. This is not supported.")

Here device is derived from args rather than kwargs. Should this be extended in some way to also consider deriving non_blocking? Not sure where or how this is used.

qubvel · 2024-11-22T18:57:15Z

Here device is derived from args rather than kwargs. Should this be extended in some way to also consider deriving non_blocking? Not sure where or how this is used.

I don't think so, maybe at some moment, it is worth refactoring this method for more explicit args and kwargs. For now, we can add a note in docstring that non_blocking should be passed as a keyword argument.

daniel-bogdoll · 2024-11-22T19:01:11Z

@qubvel Done! Thanks for the super-fast replies, was a pleasure! Tests fail now, though:

For the first one, as you stated here (#34826 (comment)), it does not seem to be related.

https://app.circleci.com/pipelines/github/huggingface/transformers/111324/workflows/3351b194-4b9e-4a17-876b-85360fc7ff01/jobs/1482124?utm_campaign=vcs-integration-link&utm_medium=referral&utm_source=github-checks-link&utm_content=summary

FAILED
tests/models/xlm_roberta_xl/test_modeling_xlm_roberta_xl.py::XLMRobertaXLModelTest::test_assisted_decoding_matches_greedy_search_1_same 
- AssertionError: False is not true

As the second one is a timeout issue, it also seems unrelated:

https://app.circleci.com/pipelines/github/huggingface/transformers/111324/workflows/3351b194-4b9e-4a17-876b-85360fc7ff01/jobs/1482127?utm_campaign=vcs-integration-link&utm_medium=referral&utm_source=github-checks-link&utm_content=summary

FAILED
tests/models/convbert/test_modeling_convbert.py::ConvBertModelTest::test_pipeline_fill_mask -
requests.exceptions.ReadTimeout: (ReadTimeoutError("HTTPSConnectionPool(host='huggingface.co', port=443):
Read timed out. (read timeout=10)"), '(Request ID: 04e3d1b8-11fc-4791-ba74-3d7d67a5f3f2)')

qubvel

Thanks for updates! Looks great, just a small suggestion

src/transformers/tokenization_utils_base.py

Co-authored-by: Pavel Iakubovskii <[email protected]>

qubvel · 2024-12-03T10:45:25Z

@ArthurZucker or @LysandreJik please review when you have bandwidth

ArthurZucker

Yeah sound super good!

ArthurZucker · 2024-12-05T15:09:26Z

src/transformers/tokenization_utils_base.py

@@ -799,12 +799,13 @@ def as_tensor(value, dtype=None):

        return self

-    def to(self, device: Union[str, "torch.device"]) -> "BatchEncoding":
+    def to(self, device: Union[str, "torch.device"], *, non_blocking: bool = False) -> "BatchEncoding":


why do we need * ?

@qubvel suggested this to enforce it as a keyword argument for future backwards compatability. All arguments after the * are forced to be passed as keyword arguments: #34883 (comment)

Yes, only device can be passed as a positional argument with * introduced. This way, we will prevent anyone from using batch_feature.to("cuda", True) instead of batch_feature.to("cuda", non_blocking=True). This would be useful in case we introduce more positional arguments in the future or need to change order, for example, with adding dtype.

Thanks for explaining, good decision @qubvel ! 🤗

Option to set 'non_blocking' for to(device) operation for performance…

88eddf8

… improvements. Defaults to 'false', thus no behavioral changes.

qubvel reviewed Nov 22, 2024

View reviewed changes

Enabling non_blocking in to() operation of BatchFeature.

08f5d4b

daniel-bogdoll changed the title ~~Option to set 'non_blocking' for to(device) operation in BatchEncoding~~ Option to set 'non_blocking' for to(device) in BatchEncoding and BatchFeature Nov 22, 2024

Improved docstring on utilization of non_blocking

9f465d7

qubvel approved these changes Nov 25, 2024

View reviewed changes

src/transformers/tokenization_utils_base.py Outdated Show resolved Hide resolved

Force non_blocking as keyword argument

4e8e786

Co-authored-by: Pavel Iakubovskii <[email protected]>

qubvel requested a review from LysandreJik November 25, 2024 14:41

Merge branch 'main' into batch-encoding-with-non_blocking-option

2d2fe86

qubvel mentioned this pull request Dec 3, 2024

Fix case of nested tensors in BatchMixFeature #35063

Closed

1 task

ArthurZucker reviewed Dec 5, 2024

View reviewed changes

ArthurZucker approved these changes Dec 9, 2024

View reviewed changes

ArthurZucker merged commit de8a0b7 into huggingface:main Dec 9, 2024
20 of 22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to set 'non_blocking' for to(device) in BatchEncoding and BatchFeature #34883

Option to set 'non_blocking' for to(device) in BatchEncoding and BatchFeature #34883

daniel-bogdoll commented Nov 22, 2024

qubvel left a comment

daniel-bogdoll commented Nov 22, 2024 •

edited

Loading

qubvel commented Nov 22, 2024

qubvel commented Nov 22, 2024 •

edited

Loading

daniel-bogdoll commented Nov 22, 2024

qubvel commented Nov 22, 2024

daniel-bogdoll commented Nov 22, 2024 •

edited

Loading

qubvel left a comment

qubvel commented Dec 3, 2024

ArthurZucker left a comment

ArthurZucker Dec 5, 2024

daniel-bogdoll Dec 5, 2024 •

edited

Loading

qubvel Dec 5, 2024 •

edited

Loading

ArthurZucker Dec 9, 2024

Option to set 'non_blocking' for to(device) in BatchEncoding and BatchFeature #34883

Option to set 'non_blocking' for to(device) in BatchEncoding and BatchFeature #34883

Conversation

daniel-bogdoll commented Nov 22, 2024

What does this PR do?

Before submitting

Who can review?

qubvel left a comment

Choose a reason for hiding this comment

daniel-bogdoll commented Nov 22, 2024 • edited Loading

qubvel commented Nov 22, 2024

qubvel commented Nov 22, 2024 • edited Loading

daniel-bogdoll commented Nov 22, 2024

qubvel commented Nov 22, 2024

daniel-bogdoll commented Nov 22, 2024 • edited Loading

qubvel left a comment

Choose a reason for hiding this comment

qubvel commented Dec 3, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Dec 5, 2024

Choose a reason for hiding this comment

daniel-bogdoll Dec 5, 2024 • edited Loading

Choose a reason for hiding this comment

qubvel Dec 5, 2024 • edited Loading

Choose a reason for hiding this comment

ArthurZucker Dec 9, 2024

Choose a reason for hiding this comment

daniel-bogdoll commented Nov 22, 2024 •

edited

Loading

qubvel commented Nov 22, 2024 •

edited

Loading

daniel-bogdoll commented Nov 22, 2024 •

edited

Loading

daniel-bogdoll Dec 5, 2024 •

edited

Loading

qubvel Dec 5, 2024 •

edited

Loading