Fix arg in bettertransformer llama attention #1421

SunMarc · 2023-09-28T15:21:29Z

What does this PR do ?

This PR fixes the integration of llama in bettertransformers. Since the PR about flashattention2 in llama models was merged, a new argument padding_mask was introduced silently in the forward of the attention module. This breaks the forward of the llama_forward in bettertransformer.

@younesbelkada We may have to do it for every model that supports flashattention2. LMK if I should just do one PR for all supported + plan to support models ?

younesbelkada

Thanks ! I think that you can just pass dummy **kwargs all over the places, what do you think?

SunMarc · 2023-09-28T15:51:03Z

Yeah, we can do that too since FA2 is experimental at this stage and will be removed in the future from transformers. LMK what you think @fxmarty.

fxmarty · 2023-10-02T07:43:08Z

LGTM, thank you for fixing! Yes, feel free to add kwargs everywhere in this PR.

* fix arg in llama attention * change to kwargs * add kwargs everwhere --------- Co-authored-by: younesbelkada <[email protected]>

fix arg in llama attention

498c4b8

SunMarc requested review from fxmarty and younesbelkada September 28, 2023 15:21

SunMarc changed the title ~~fix arg in llama attention~~ Fix arg in bettertransformer llama attention Sep 28, 2023

younesbelkada approved these changes Sep 28, 2023

View reviewed changes

change to kwargs

d0f27d2

add kwargs everwhere

b67fb2a

fxmarty merged commit dbe70f9 into huggingface:main Oct 3, 2023
49 of 52 checks passed

fxmarty mentioned this pull request Oct 3, 2023

padding_mask issue #1426

Closed

iosonopersia mentioned this pull request Oct 5, 2023

position_ids is now required by FalconRotaryEmbedding forward method #1430

Closed

fxmarty mentioned this pull request Oct 18, 2023

Llama 2 BetterTransformer Broken #1446

Closed

4 tasks

fxmarty pushed a commit that referenced this pull request Nov 3, 2023

Fix arg in bettertransformer llama attention (#1421)

42924f8

* fix arg in llama attention * change to kwargs * add kwargs everwhere --------- Co-authored-by: younesbelkada <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix arg in bettertransformer llama attention #1421

Fix arg in bettertransformer llama attention #1421

SunMarc commented Sep 28, 2023

younesbelkada left a comment

SunMarc commented Sep 28, 2023

fxmarty commented Oct 2, 2023

Fix arg in bettertransformer llama attention #1421

Fix arg in bettertransformer llama attention #1421

Conversation

SunMarc commented Sep 28, 2023

What does this PR do ?

younesbelkada left a comment

Choose a reason for hiding this comment

SunMarc commented Sep 28, 2023

fxmarty commented Oct 2, 2023