Skip to content

Llama: device/type-invariant RoPE sin/cos computation, eager attention matches original implementation #35625

Llama: device/type-invariant RoPE sin/cos computation, eager attention matches original implementation

Llama: device/type-invariant RoPE sin/cos computation, eager attention matches original implementation #35625