Skip to content

Flash decoding kernel adding and prefill-chunking and prefix caching enabling in intel cpu/xpu#2815

Open
sywangyi wants to merge 8 commits intohuggingface:mainfrom sywangyi:flash_decoding

Commits

Commits on Nov 25, 2024

Commits on Dec 2, 2024

Commits on Dec 10, 2024

Commits on Dec 19, 2024

Commits on Dec 20, 2024

Commits on Jan 9, 2025