Skip to content

Add support for Grouped Query Attention on Llama Model #80

Add support for Grouped Query Attention on Llama Model

Add support for Grouped Query Attention on Llama Model #80