Dynamic shape compilation support for flex attention with block mask #33

SamGalanakis · 2024-08-28T10:26:21Z

Repro here: pytorch/pytorch#134560
It is mentioned in the original flex attention blog that dynamic shapes work should I be implementing it differently? My usecase is for document packing so varying batch size, block mask is recomputed at each step. Blocked by this for switching to flex attention unfortunately, thanks in advance.

SamGalanakis closed this as completed Sep 12, 2024

SamGalanakis reopened this Sep 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamic shape compilation support for flex attention with block mask #33

Dynamic shape compilation support for flex attention with block mask #33

SamGalanakis commented Aug 28, 2024

Dynamic shape compilation support for flex attention with block mask #33

Dynamic shape compilation support for flex attention with block mask #33

Comments

SamGalanakis commented Aug 28, 2024