Skip to content
This repository has been archived by the owner on Aug 30, 2024. It is now read-only.

[Neural Speed] Support continuous batching + beam search inference in LLAMA #84

[Neural Speed] Support continuous batching + beam search inference in LLAMA

[Neural Speed] Support continuous batching + beam search inference in LLAMA #84

Triggered via pull request March 4, 2024 06:11
Status Success
Total duration 2m 35s
Artifacts

windows-test.yml

on: pull_request
Windows-Binary-Test
2m 24s
Windows-Binary-Test
Fit to window
Zoom out
Zoom in