You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
While BERT, which is similar to RoBERTa and XLM-RoBERTa, support sdpa, RoBERTa and XLM-RoBERTa do not support sdpa yet. This enablement is straight-forward.
Our applications need latency reduction by sdpa. The performance advantage in BERT is already shown at #28802.
Your contribution
I will submit a PR.
The text was updated successfully, but these errors were encountered:
Feature request
Enable sdpa for RoBERTa and XLM-RoBERTa models
Motivation
While BERT, which is similar to RoBERTa and XLM-RoBERTa, support sdpa, RoBERTa and XLM-RoBERTa do not support sdpa yet. This enablement is straight-forward.
Our applications need latency reduction by sdpa. The performance advantage in BERT is already shown at #28802.
Your contribution
I will submit a PR.
The text was updated successfully, but these errors were encountered: