This repository has been archived by the owner on Aug 30, 2024. It is now read-only.

Python Unit Test

[Neural Speed] Support continuous batching + beam search inference in LLAMA #321

Sign in to view logs

Summary
Jobs
- unit-test
Run details
- Usage
- Workflow file

Triggered via pull request February 29, 2024 09:02

zhentaoyu

opened #145

yzt/llama-batching

Status Cancelled

Total duration 1m 6s

Artifacts –

unit-test-llmruntime.yml

on: pull_request

Annotations

2 errors

Canceling since a higher priority waiting request for 'Python Unit Test-145' exists

The operation was canceled.