Skip to content

Add performance statistics for speculative decoding #6147

Add performance statistics for speculative decoding

Add performance statistics for speculative decoding #6147

Triggered via pull request December 26, 2024 06:04
Status Failure
Total duration 44m 45s
Artifacts

causal_lm_cpp.yml

on: pull_request
Matrix: cpp-beam_search_causal_lm-ubuntu
cpp-multinomial-greedy_causal_lm-ubuntu
13m 50s
cpp-multinomial-greedy_causal_lm-ubuntu
cpp-greedy_causal_lm-windows
44m 8s
cpp-greedy_causal_lm-windows
cpp-greedy_causal_lm-Qwen-7B-Chat
12m 9s
cpp-greedy_causal_lm-Qwen-7B-Chat
cpp-beam_search_causal_lm-Qwen1_5-7B-Chat
33m 50s
cpp-beam_search_causal_lm-Qwen1_5-7B-Chat
cpp-beam_search_causal_lm-Phi-2
16m 20s
cpp-beam_search_causal_lm-Phi-2
cpp-beam_search_causal_lm-notus-7b-v1
32m 16s
cpp-beam_search_causal_lm-notus-7b-v1
cpp-speculative_decoding_lm-ubuntu
24m 15s
cpp-speculative_decoding_lm-ubuntu
cpp-prompt_lookup_decoding_lm-ubuntu
12m 23s
cpp-prompt_lookup_decoding_lm-ubuntu
cpp-Phi-1_5
8m 18s
cpp-Phi-1_5
cpp-greedy_causal_lm-redpajama-3b-chat
11m 36s
cpp-greedy_causal_lm-redpajama-3b-chat
cpp-chat_sample-ubuntu
16m 44s
cpp-chat_sample-ubuntu
visual_language_chat_sample-ubuntu-minicpm_v2_6
8m 24s
visual_language_chat_sample-ubuntu-minicpm_v2_6
visual_language_chat_sample-ubuntu-llava_1_5  /  visual_language_chat_sample-ubuntu-llava
14m 42s
visual_language_chat_sample-ubuntu-llava_1_5 / visual_language_chat_sample-ubuntu-llava
visual_language_chat_sample-ubuntu-llava_next  /  visual_language_chat_sample-ubuntu-llava
35m 54s
visual_language_chat_sample-ubuntu-llava_next / visual_language_chat_sample-ubuntu-llava
visual_language_chat_sample-ubuntu-internvl2
24m 2s
visual_language_chat_sample-ubuntu-internvl2
cpp-continuous-batching-ubuntu
16m 4s
cpp-continuous-batching-ubuntu
cpp-continuous-batching-windows
25m 29s
cpp-continuous-batching-windows
cpp-continuous-batching-macos
21m 34s
cpp-continuous-batching-macos
ci/gha_overall_status_causal_lm
0s
ci/gha_overall_status_causal_lm
Fit to window
Zoom out
Zoom in

Annotations

6 errors and 1 warning
cpp-Phi-1_5
Process completed with exit code 124.
cpp-greedy_causal_lm-redpajama-3b-chat
Process completed with exit code 124.
cpp-greedy_causal_lm-Qwen-7B-Chat
Process completed with exit code 1.
cpp-multinomial-greedy_causal_lm-ubuntu
Process completed with exit code 1.
cpp-greedy_causal_lm-windows
Process completed with exit code 1.
ci/gha_overall_status_causal_lm
Process completed with exit code 1.
ci/gha_overall_status_causal_lm
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636