Skip to content
This repository has been archived by the owner on Aug 30, 2024. It is now read-only.

[Neural Speed] Support continuous batching + beam search inference in LLAMA #57

[Neural Speed] Support continuous batching + beam search inference in LLAMA

[Neural Speed] Support continuous batching + beam search inference in LLAMA #57

Triggered via pull request March 1, 2024 02:37
Status Success
Total duration 2m 31s
Artifacts

windows-test.yml

on: pull_request
Windows-Binary-Test
2m 21s
Windows-Binary-Test
Fit to window
Zoom out
Zoom in