Skip to content

[Neuron][Kernel] NKI-based flash-attention kernel with paged KV cache#11277

Open
liangfu wants to merge 1 commit intovllm-project:mainfrom liangfu:nki-flash-attn

Commits

Commits on Jan 9, 2025