Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Suport sdpa for RoBERTa and XLM-RoBERTa models #31752

Open
kiszk opened this issue Jul 2, 2024 · 0 comments
Open

Suport sdpa for RoBERTa and XLM-RoBERTa models #31752

kiszk opened this issue Jul 2, 2024 · 0 comments
Labels
Feature request Request for a new feature

Comments

@kiszk
Copy link
Contributor

kiszk commented Jul 2, 2024

Feature request

Enable sdpa for RoBERTa and XLM-RoBERTa models

Motivation

While BERT, which is similar to RoBERTa and XLM-RoBERTa, support sdpa, RoBERTa and XLM-RoBERTa do not support sdpa yet. This enablement is straight-forward.

Our applications need latency reduction by sdpa. The performance advantage in BERT is already shown at #28802.

Your contribution

I will submit a PR.

@kiszk kiszk added the Feature request Request for a new feature label Jul 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Feature request Request for a new feature
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant