Skip to content

b2830

Compare
Choose a tag to compare
@github-actions github-actions released this 09 May 13:55
a743d76
CUDA: generalize FP16 fattn vec kernel (#7061)

* CUDA: generalize FP16 fattn vec kernel

* disable unsupported head sizes for AMD in test

* try AMD fix

* fix batch size 2-8

* partially revert changes