This repository has been archived by the owner on Aug 30, 2024. It is now read-only.

CPP Graph Test

[Neural Speed] Support continuous batching + beam search inference in LLAMA #428

Sign in to view logs

Triggered via pull request March 1, 2024 07:48

zhentaoyu

synchronize #145

yzt/llama-batching

Status Cancelled

Total duration 30s

Artifacts –

cpp-graph-test.yml

on: pull_request

Matrix: CPP-Graph-Workflow

Genreate-Report

Annotations

2 errors

CPP-Graph-Workflow (llama-2-7b-chat)

Canceling since a higher priority waiting request for 'CPP Graph Test-145' exists

CPP-Graph-Workflow (gptj-6b)

Canceling since a higher priority waiting request for 'CPP Graph Test-145' exists