Add sdpa for Beit #34941

OmarManzoor · 2024-11-26T15:00:39Z

What does this PR do?

Related to #28005

Adds sdpa for Beit model
Also extends the support of sdpa to Data2VecVision model to ensure consistency

Who can review?

qubvel

Glad to see you in another sdpa-PR @OmarManzoor! Thanks for working on the implementation 👍 Overall looking great, just a few notes:

docs/source/en/model_doc/beit.md

src/transformers/models/beit/modeling_beit.py

tests/models/beit/test_modeling_beit.py

HuggingFaceDocBuilderDev · 2024-11-27T08:48:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qubvel

Hey, I'm getting the following values for benchmark

BeitForImageClassification

Image batch size	Eager (s/iter)	Eager CI, %	SDPA (s/iter)	SDPA CI, %	SDPA speedup
1	0.013	±0.2%	0.011	±0.5%	1.122
4	0.013	±0.2%	0.011	±0.1%	1.122
16	0.026	±0.1%	0.021	±0.5%	1.232
32	0.048	±0.8%	0.039	±0.3%	1.252

with the following env:

Python version: 3.10.12 (main, Nov  6 2024, 20:22:13) [GCC 11.4.0]
Transformers version: 4.47.0.dev0
Torch version: 2.5.0+cu118
GPU: NVIDIA A10G

Here is the code, can you run it with your env to check? Also my code might be not the best way to benchmark, would appreciate if you share your code!

OmarManzoor · 2024-11-27T11:42:27Z

Here is the code, can you run it with your env to > > check? Also my code might be not the best way to benchmark, would appreciate if you share your code!

Maybe your script might be better than what I am using but nevertheless here is the script I used for inference.
inference_image_benchmark
I basically just modified the script that we have been using for the text models.

OmarManzoor · 2024-11-27T11:49:07Z

Here are the results using your script

BeitForImageClassification

Image batch size	Eager (s/iter)	Eager CI, %	SDPA (s/iter)	SDPA CI, %	SDPA speedup
1	0.012	±0.5%	0.011	±0.3%	1.135
4	0.013	±0.2%	0.011	±0.2%	1.181
16	0.045	±0.1%	0.035	±0.1%	1.3
32	0.088	±0.1%	0.067	±0.1%	1.322

Environment:

Python version: 3.10.14 (main, Jul 23 2024, 15:53:02) [GCC 9.4.0]
Transformers version: 4.47.0.dev0
Torch version: 2.5.1+cu124
GPU: NVIDIA GeForce RTX 2060 SUPER

qubvel · 2024-11-27T13:41:42Z

Ok, looks good, can you update documentation benchmarks then? Maybe it's worth adding memory stats to my script and rerunning it

OmarManzoor · 2024-11-27T13:50:22Z

Ok, looks good, can you update documentation benchmarks then? Maybe it's worth adding memory stats to my script and rerunning it

Okay I think we can use your inference benchmarks but for the training part I think we can keep the current benchmarks? Also did you have a look at the script I shared, is there something that is not quite correct in that?

OmarManzoor · 2024-12-02T06:43:35Z

@qubvel Do the slow tests still need to be run?

qubvel

Thanks for the ping! Looking great to me, just one comment

src/transformers/models/beit/modeling_beit.py

qubvel · 2024-12-02T11:20:23Z

@ArthurZucker please review when you have bandwidth!

tests/models/beit/test_modeling_beit.py

tests/models/data2vec/test_modeling_data2vec_vision.py

dvrogozh

@OmarManzoor : thank you for rebase and adding thresholds path for xpu. I verified it on my side against upstream pytorch xpu - both added tests work for me.

OmarManzoor · 2024-12-06T06:17:14Z

@dvrogozh Thank you for verifying

OmarManzoor · 2024-12-11T09:38:14Z

@ArthurZucker Can this be merged?

ArthurZucker

Hey all! Sorry for the delay, we are in the middle of a huge refactoring in #35235, which is why I wanted to wait a bit, but good work should be rewarded, so let's merge this! Thanks for being patient 🤗

OmarManzoor · 2024-12-17T13:48:01Z

Hey all! Sorry for the delay, we are in the middle of a huge refactoring in #35235, which is why I wanted to wait a bit, but good work should be rewarded, so let's merge this! Thanks for being patient 🤗

Thank you!

Add sdpa for Beit

4d92605

dvrogozh mentioned this pull request Nov 26, 2024

Fix test_eager_matches_sdpa_inference for XPU backend #34889

Merged

qubvel reviewed Nov 26, 2024

View reviewed changes

docs/source/en/model_doc/beit.md Show resolved Hide resolved

src/transformers/models/beit/modeling_beit.py Show resolved Hide resolved

tests/models/beit/test_modeling_beit.py Show resolved Hide resolved

OmarManzoor added 2 commits November 27, 2024 12:13

Updates

1cfe225

[run-slow] beit

fb3b767

qubvel added Vision SDPA run-slow labels Nov 27, 2024

qubvel self-requested a review November 27, 2024 10:08

qubvel reviewed Nov 27, 2024

View reviewed changes

Update inference benchmarks

ab37a50

qubvel approved these changes Dec 2, 2024

View reviewed changes

src/transformers/models/beit/modeling_beit.py Show resolved Hide resolved

OmarManzoor added 3 commits December 2, 2024 15:23

Merge remote-tracking branch 'upstream/main' into beit_sdpa

cfbeecc

Update

4048833

Fix - add missed to super().forward()

e4a8427

qubvel requested a review from ArthurZucker December 2, 2024 11:19

dvrogozh reviewed Dec 2, 2024

View reviewed changes

OmarManzoor added 3 commits December 5, 2024 16:10

Merge branch 'main' into beit_sdpa

aa1e9d9

Updates

f48d8c2

Fix missing import

d5be5a1

dvrogozh approved these changes Dec 5, 2024

View reviewed changes

ArthurZucker approved these changes Dec 17, 2024

View reviewed changes

ArthurZucker merged commit 747f361 into huggingface:main Dec 17, 2024
13 checks passed

OmarManzoor deleted the beit_sdpa branch December 17, 2024 13:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sdpa for Beit #34941

Add sdpa for Beit #34941

OmarManzoor commented Nov 26, 2024

qubvel left a comment

HuggingFaceDocBuilderDev commented Nov 27, 2024

qubvel left a comment

OmarManzoor commented Nov 27, 2024

OmarManzoor commented Nov 27, 2024

qubvel commented Nov 27, 2024 •

edited

Loading

OmarManzoor commented Nov 27, 2024 •

edited

Loading

OmarManzoor commented Dec 2, 2024

qubvel left a comment

qubvel commented Dec 2, 2024

dvrogozh left a comment

OmarManzoor commented Dec 6, 2024

OmarManzoor commented Dec 11, 2024

ArthurZucker left a comment

OmarManzoor commented Dec 17, 2024

Add sdpa for Beit #34941

Add sdpa for Beit #34941

Conversation

OmarManzoor commented Nov 26, 2024

What does this PR do?

Who can review?

qubvel left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 27, 2024

qubvel left a comment

Choose a reason for hiding this comment

BeitForImageClassification

OmarManzoor commented Nov 27, 2024

OmarManzoor commented Nov 27, 2024

BeitForImageClassification

Environment:

qubvel commented Nov 27, 2024 • edited Loading

OmarManzoor commented Nov 27, 2024 • edited Loading

OmarManzoor commented Dec 2, 2024

qubvel left a comment

Choose a reason for hiding this comment

qubvel commented Dec 2, 2024

dvrogozh left a comment

Choose a reason for hiding this comment

OmarManzoor commented Dec 6, 2024

OmarManzoor commented Dec 11, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

OmarManzoor commented Dec 17, 2024

qubvel commented Nov 27, 2024 •

edited

Loading

OmarManzoor commented Nov 27, 2024 •

edited

Loading