Releases · teleprint-me/llama.cpp

09 Aug 02:45

3a14e00

b3550

gguf-py : simplify support for quant types (#8838)

* gguf-py : use classes for quants

* convert_hf : simplify internal quantization type selection

* gguf-py : fix flake8 lint

* gguf-py : fix BF16 numpy view type

* gguf-py : remove LlamaFileTypeMap

Too specific to 'llama.cpp', and would be a maintenance burden
to keep up to date.

* gguf-py : add generic quantize and dequantize functions

The quant classes no longer need to be known,
only the target or the source type,
for 'quantize' and 'dequantize', respectively.

Assets 20

07 Aug 20:46

github-actions

b3542

15fa07a

b3542

make : use C compiler to build metal embed object (#8899)

* make : use C compiler to build metal embed object

* use rm + rmdir to avoid -r flag in rm

Assets 20

04 Aug 01:26

github-actions

b3506

76614f3

b3506

ggml : reading the runtime sve config of the cpu (#8709)

* ggml : reading the runtime sve config of the cpu

* change to one time init to prevent performance drop

* prefix variable to avoid possible conflicts

* revert xxhash fix and add brackets

---------

Co-authored-by: domke <[email protected]>

Assets 20

02 Aug 05:01

github-actions

b3503

0fbbd88

b3503

[SYCL] Fixing wrong VDR iq4nl value (#8812)

Assets 20

30 Jul 16:39

github-actions

b3493

7e72aa7

b3493

py: add_array() will not add to kv store if value is an empty array (…

Assets 20

28 Jul 08:50

github-actions

b3484

4730fac

b3484

chore : Fix vulkan related compiler warnings, add help text, improve …

Assets 20

27 Jul 03:40

github-actions

b3468

2b1f616

b3468

ggml : reduce hash table reset cost (#8698)

* ggml : reduce hash table reset cost

* fix unreachable code warnings after GGML_ASSERT(false)

* GGML_ASSERT(false) -> GGML_ABORT("fatal error")

* GGML_ABORT use format string

Assets 20

26 Jul 21:53

github-actions

b3467

01245f5

b3467

llama : fix order of parameters (#8706)

usage of `aclrtGetMemInfo` is correct:

https://www.hiascend.com/doc_center/source/zh/canncommercial/63RC2/inferapplicationdev/aclcppdevg/aclcppdevg_03_0103.html

Co-authored-by: Judd <[email protected]>

Assets 20

25 Jul 23:55

github-actions

b3466

01aec4a

b3466

server : add Speech Recognition & Synthesis to UI (#8679)

* server : add Speech Recognition & Synthesis to UI

* server : add Speech Recognition & Synthesis to UI (fixes)

Assets 20

23 Jul 17:12

github-actions

b3448

b841d07

b3448

server : fix URL.parse in the UI (#8646)

Assets 20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: teleprint-me/llama.cpp

b3550

b3542

b3506

b3503

b3493

b3484

b3468

b3467

b3466

b3448