Skip to content

Actions: l3utterfly/llama.cpp

Publish Docker image

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
117 workflow runs
117 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

readme : remove -ins (#7759)
Publish Docker image #68: Commit 9973e81 pushed by l3utterfly
June 5, 2024 07:22 8m 39s master
June 5, 2024 07:22 8m 39s
merge from upstream
Publish Docker image #67: Pull request #21 synchronize by l3utterfly
May 27, 2024 07:44 57m 51s merge
May 27, 2024 07:44 57m 51s
metal : disable FA kernel for HS=256 (#7556)
Publish Docker image #66: Commit 62bfef5 pushed by l3utterfly
May 27, 2024 07:42 15m 36s master
May 27, 2024 07:42 15m 36s
merge from upstream
Publish Docker image #65: Pull request #20 opened by l3utterfly
May 22, 2024 06:36 29m 34s master
May 22, 2024 06:36 29m 34s
llama : add phi3 128K model support (#7225)
Publish Docker image #64: Commit 201cc11 pushed by l3utterfly
May 22, 2024 06:35 15m 13s master
May 22, 2024 06:35 15m 13s
merge from upstream
Publish Docker image #63: Pull request #19 synchronize by l3utterfly
May 16, 2024 12:37 51m 25s merge
May 16, 2024 12:37 51m 25s
rpc : get available mem for the CPU backend
Publish Docker image #62: Commit 9afdffe pushed by l3utterfly
May 16, 2024 12:36 10m 17s master
May 16, 2024 12:36 10m 17s
merged from upstream
Publish Docker image #61: Pull request #18 synchronize by l3utterfly
May 3, 2024 04:13 46m 46s merge
May 3, 2024 04:13 46m 46s
Remove .attention from skipped tensors to match more accurately (#7051)
Publish Docker image #60: Commit 60325fa pushed by l3utterfly
May 3, 2024 04:08 7m 27s master
May 3, 2024 04:08 7m 27s
Flash attn update
Publish Docker image #59: Pull request #17 opened by l3utterfly
April 30, 2024 09:11 30m 4s flash-attn
April 30, 2024 09:11 30m 4s
convert : use utf8 encoding (#7000)
Publish Docker image #58: Commit 952d03d pushed by l3utterfly
April 30, 2024 08:38 7m 12s master
April 30, 2024 08:38 7m 12s
merged from upstream
Publish Docker image #57: Pull request #15 opened by l3utterfly
April 30, 2024 07:41 47m 21s master
April 30, 2024 07:41 47m 21s
Improve usability of --model-url & related flags (#6930)
Publish Docker image #56: Commit 8843a98 pushed by l3utterfly
April 30, 2024 07:41 7m 12s master
April 30, 2024 07:41 7m 12s
merge from upstream
Publish Docker image #55: Pull request #14 opened by l3utterfly
April 29, 2024 01:29 19m 0s master
April 29, 2024 01:29 19m 0s
Fix more int overflow during quant (PPL/CUDA). (#6563)
Publish Docker image #54: Commit e00b4a8 pushed by l3utterfly
April 29, 2024 01:29 6m 53s master
April 29, 2024 01:29 6m 53s
Merge from upstream
Publish Docker image #53: Pull request #12 synchronize by l3utterfly
April 25, 2024 07:03 4m 44s merge-master-layla
April 25, 2024 07:03 4m 44s
Dry sampler
Publish Docker image #52: Pull request #11 synchronize by l3utterfly
April 25, 2024 06:36 20m 20s dry-sampler
April 25, 2024 06:36 20m 20s
Test flash attn
Publish Docker image #51: Pull request #10 opened by l3utterfly
April 25, 2024 06:30 17m 44s test-flash-attn
April 25, 2024 06:30 17m 44s
README: add graphic for matrix multiplication (#6881)
Publish Docker image #50: Commit 784e11d pushed by l3utterfly
April 25, 2024 06:29 10m 29s master
April 25, 2024 06:29 10m 29s
[SYCL] Windows default build instructions without -DLLAMA_SYCL_F16 fl…
Publish Docker image #49: Commit 4e96a81 pushed by l3utterfly
April 23, 2024 02:40 7m 23s master
April 23, 2024 02:40 7m 23s
Gg/flash attn
Publish Docker image #48: Pull request #9 opened by l3utterfly
April 19, 2024 07:45 8m 27s ggerganov:gg/flash-attn
April 19, 2024 07:45 8m 27s
merged from upstream
Publish Docker image #47: Pull request #8 opened by l3utterfly
April 16, 2024 01:57 49m 1s master
April 16, 2024 01:57 49m 1s
main: add --json-schema / -j flag (#6659)
Publish Docker image #46: Commit 7593639 pushed by l3utterfly
April 16, 2024 01:56 6m 58s master
April 16, 2024 01:56 6m 58s
merge from upstream
Publish Docker image #45: Pull request #7 opened by l3utterfly
April 6, 2024 01:15 45m 26s master
April 6, 2024 01:15 45m 26s
gguf.py : add licence and version to gguf writer (#6504)
Publish Docker image #44: Commit a8bd14d pushed by l3utterfly
April 6, 2024 01:14 7m 21s master
April 6, 2024 01:14 7m 21s