Skip to content

Actions: huggingface/text-generation-inference

CI build

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,971 workflow runs
1,971 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Using new cache.
CI build #98: Pull request #2125 opened by Narsil
June 26, 2024 13:22 1h 6m 29s ci2
ci2
June 26, 2024 13:22 1h 6m 29s
Ci test
CI build #97: Pull request #2124 opened by glegendre01
June 26, 2024 12:25 14m 58s ci-test
June 26, 2024 12:25 14m 58s
change push registry
CI build #95: Commit 1bd1dfc pushed by glegendre01
June 26, 2024 09:11 4m 32s ci-test
June 26, 2024 09:11 4m 32s
fix: simplify kserve endpoint and fix imports (#2119)
CI build #93: Commit be2d380 pushed by drbh
June 25, 2024 23:30 41m 13s main
June 25, 2024 23:30 41m 13s
Add support for Marlin 2:4 sparsity (#2102)
CI build #91: Commit f1f98e3 pushed by danieldk
June 25, 2024 19:09 1h 27m 38s main
June 25, 2024 19:09 1h 27m 38s
Support AWQ quantization with bias (#2117)
CI build #90: Commit 14980df pushed by danieldk
June 25, 2024 19:09 58m 8s main
June 25, 2024 19:09 58m 8s
Enable multiple LoRa adapters (#2010)
CI build #89: Commit 04e1af9 pushed by drbh
June 25, 2024 18:46 41m 24s main
June 25, 2024 18:46 41m 24s
Enable multiple LoRa adapters
CI build #88: Pull request #2010 synchronize by drbh
June 25, 2024 16:23 57m 31s lora-internal
June 25, 2024 16:23 57m 31s
Fix CI . (#2118)
CI build #87: Commit a2a97b0 pushed by Narsil
June 25, 2024 15:53 42m 3s main
June 25, 2024 15:53 42m 3s
Fix CI .
CI build #86: Pull request #2118 opened by Narsil
June 25, 2024 15:28 1h 7m 17s fix_ci
June 25, 2024 15:28 1h 7m 17s
first test with registry mirror
CI build #85: Commit 5d60daa pushed by glegendre01
June 25, 2024 15:13 1h 16m 18s ci-test
June 25, 2024 15:13 1h 16m 18s
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
CI build #84: Pull request #1940 synchronize by Narsil
June 25, 2024 15:07 1h 4m 11s flashdecoding
June 25, 2024 15:07 1h 4m 11s
Support AWQ quantization with bias
CI build #83: Pull request #2117 synchronize by danieldk
June 25, 2024 15:02 57m 54s bugfix/awq-with-bias
June 25, 2024 15:02 57m 54s
Support AWQ quantization with bias
CI build #82: Pull request #2117 opened by danieldk
June 25, 2024 14:59 3m 28s bugfix/awq-with-bias
June 25, 2024 14:59 3m 28s
Add pytest release marker (#2114)
CI build #81: Commit fc9c315 pushed by danieldk
June 25, 2024 14:53 1h 0m 54s main
June 25, 2024 14:53 1h 0m 54s
fix cpu and xpu issue (#2116)
CI build #79: Commit e563983 pushed by Narsil
June 25, 2024 14:47 57m 40s main
June 25, 2024 14:47 57m 40s
Fix nccl regression on PyTorch 2.3 upgrade
CI build #77: Pull request #2099 synchronize by fxmarty
June 25, 2024 14:28 1h 0m 41s fix-nccl-regression
June 25, 2024 14:28 1h 0m 41s
Add pytest release marker
CI build #75: Pull request #2114 synchronize by danieldk
June 25, 2024 13:32 1h 1m 31s ci/release-tests
June 25, 2024 13:32 1h 1m 31s
[Major Change][Undecided yet] Move to FlashDecoding instead of PagedAttention kernel.
CI build #73: Pull request #1940 synchronize by Narsil
June 25, 2024 13:10 1h 21m 48s flashdecoding
June 25, 2024 13:10 1h 21m 48s
ProTip! You can narrow down the results and go further in time using created:<2024-06-25 or the other filters available.