Skip to content

Actions: ggerganov/llama.cpp

Pull Request Labeler

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,069 workflow run results
4,069 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update convert_hf_to_gguf.py
Pull Request Labeler #3969: Pull request #9542 opened by blap
September 18, 2024 23:02 16s
September 18, 2024 23:02 16s
add solar pro support
Pull Request Labeler #3968: Pull request #9541 opened by mxyng
September 18, 2024 22:38 18s
September 18, 2024 22:38 18s
llama : add reranking support
Pull Request Labeler #3967: Pull request #9510 synchronize by ggerganov
September 18, 2024 18:20 16s
September 18, 2024 18:20 16s
ggml : fix n_threads_cur initialization with one thread
Pull Request Labeler #3966: Pull request #9538 synchronize by max-krasnyansky
September 18, 2024 16:00 19s
September 18, 2024 16:00 19s
ggml : fix n_threads_cur initialization with one thread
Pull Request Labeler #3965: Pull request #9538 opened by slaren
September 18, 2024 12:59 16s
September 18, 2024 12:59 16s
Update clip.cpp
Pull Request Labeler #3964: Pull request #9482 synchronize by Tejaakshaykumar
September 18, 2024 10:27 20m 34s
September 18, 2024 10:27 20m 34s
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80)
Pull Request Labeler #3963: Pull request #9526 synchronize by yeahdongcn
September 18, 2024 09:54 38m 30s
September 18, 2024 09:54 38m 30s
vocab: refactor tokenizer to reduce the overhead of creating multi times tokenizer
Pull Request Labeler #3962: Pull request #9449 synchronize by kylo5aby
September 18, 2024 09:52 26m 9s
September 18, 2024 09:52 26m 9s
llama : use reserve/emplace_back in sampler_sample
Pull Request Labeler #3961: Pull request #9534 opened by danbev
September 18, 2024 09:51 5m 2s
September 18, 2024 09:51 5m 2s
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80)
Pull Request Labeler #3960: Pull request #9526 synchronize by yeahdongcn
September 18, 2024 09:43 20s
September 18, 2024 09:43 20s
Implementations for Q4_0_8_8 quantization based functions - AVX512 version of ggml_gemm_q4_0_8x8_q8_0
Pull Request Labeler #3959: Pull request #9532 opened by Srihari-mcw
September 18, 2024 08:34 17s
September 18, 2024 08:34 17s
Add Intel Advanced Matrix Extensions (AMX) support to ggml
Pull Request Labeler #3958: Pull request #8998 synchronize by mingfeima
September 18, 2024 07:31 35m 47s
September 18, 2024 07:31 35m 47s
server : clean-up completed tasks from waiting list
Pull Request Labeler #3957: Pull request #9531 opened by ggerganov
September 18, 2024 07:22 32m 31s
September 18, 2024 07:22 32m 31s
Add Intel Advanced Matrix Extensions (AMX) support to ggml
Pull Request Labeler #3956: Pull request #8998 synchronize by mingfeima
September 18, 2024 07:12 42m 13s
September 18, 2024 07:12 42m 13s
Add Intel Advanced Matrix Extensions (AMX) support to ggml
Pull Request Labeler #3955: Pull request #8998 synchronize by mingfeima
September 18, 2024 06:20 21m 33s
September 18, 2024 06:20 21m 33s
Add Intel Advanced Matrix Extensions (AMX) support to ggml
Pull Request Labeler #3954: Pull request #8998 synchronize by mingfeima
September 18, 2024 06:16 24m 35s
September 18, 2024 06:16 24m 35s
ggml: Add run-time detection of neon, i8mm and sve
Pull Request Labeler #3953: Pull request #9331 synchronize by eddnjjn
September 18, 2024 06:12 19s
September 18, 2024 06:12 19s
server : fix OpenSSL build by removing invalid LOG_INFO references
Pull Request Labeler #3952: Pull request #9529 opened by EZForever
September 18, 2024 02:59 16s
September 18, 2024 02:59 16s
bugfix: structured output response_format does not match openai
Pull Request Labeler #3951: Pull request #9527 opened by VJHack
September 18, 2024 02:14 19s
September 18, 2024 02:14 19s
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80)
Pull Request Labeler #3950: Pull request #9526 opened by yeahdongcn
September 18, 2024 01:59 18s
September 18, 2024 01:59 18s
llama: (proposal) propagating the results of graph_compute to the user interface
Pull Request Labeler #3949: Pull request #9525 opened by Xarbirus
September 17, 2024 20:33 7m 48s
September 17, 2024 20:33 7m 48s
llama-bench: correct argument parsing error message
Pull Request Labeler #3948: Pull request #9524 opened by Xarbirus
September 17, 2024 20:22 19s
September 17, 2024 20:22 19s
llama : add reranking support
Pull Request Labeler #3947: Pull request #9510 synchronize by ggerganov
September 17, 2024 13:38 27m 54s
September 17, 2024 13:38 27m 54s
IBM Granite MoE Architecture
Pull Request Labeler #3946: Pull request #9438 synchronize by gabe-l-hart
September 17, 2024 12:46 16s
September 17, 2024 12:46 16s
llama : add reranking support
Pull Request Labeler #3945: Pull request #9510 synchronize by ggerganov
September 17, 2024 10:53 11m 8s
September 17, 2024 10:53 11m 8s