Allows interweaving of arbitrary kinds of 'attention' layers, like sliding window, reuse prev layer kv cache etc. #6535
pr-cpu.yaml
on: pull_request
Matrix: pytest-cpu
Coverage Results
/
coverage
Annotations
4 errors
cpu-2.3.1 / pytest-cpu
Canceling since a higher priority waiting request for 'PR CPU tests-1299' exists
|
cpu-2.3.1 / pytest-cpu
The operation was canceled.
|
cpu-2.3.0 / pytest-cpu
Canceling since a higher priority waiting request for 'PR CPU tests-1299' exists
|
cpu-2.3.0 / pytest-cpu
The operation was canceled.
|