ROCm / flash-attention Public

forked from Dao-AILab/flash-attention

Notifications You must be signed in to change notification settings
Fork 46
Star 141

Code
Issues 24
Pull requests 8
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: ROCm/flash-attention

Labels 17 Milestones 0

New pull request New

8 Open 55 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Added Benchmark for Rotary Decode Kernel + Performance Speed Up for Rotary Kernel

#102 opened Nov 22, 2024 by alexkranias-amd

Loading…

Dropout

#101 opened Nov 15, 2024 by micmelesse • Draft

Added Dropout BWD

#95 opened Nov 5, 2024 by alexkranias-amd

Loading…

Fix stride issues in flash_attn_interface

#58 opened May 31, 2024 by clintg6

Loading…

GPUAI-1250 - Flash Attention v2.04 two modules layer_norm cannot be used fixed

#52 opened Apr 3, 2024 by xiaoxiangAMD

Loading…

add FA api benchmark csv

#48 opened Mar 7, 2024 by fsx950223

Loading…

GPUAI-1250 - Flash Attention v2.04 module rotary cannot be used code fixed

#47 opened Mar 1, 2024 by xiaoxiangAMD

Loading…

Flash attention for rocm

#1 opened Feb 17, 2023 by groenenboomj

Loading…

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly