Releases · ggml-org/llama.cpp

17 Feb 10:53

c4d29ba

b4733

server : fix divide-by-zero in metrics reporting (#11915)

Assets 23

17 Feb 07:26

github-actions

b4732

2eea03d

b4732

vulkan: implement several ops relevant for ggml_opt (#11769)

* vulkan: support memset_tensor

* vulkan: support GGML_OP_SUM

* vulkan: implement GGML_OP_ARGMAX

* vulkan: implement GGML_OP_SUB

* vulkan: implement GGML_OP_COUNT_EQUAL

* vulkan: implement GGML_OP_OPT_STEP_ADAMW

* vulkan: fix check_results RWKV_WKV6 crash and memory leaks

* vulkan: implement GGML_OP_REPEAT_BACK

* tests: remove invalid test-backend-ops REPEAT_BACK tests

* vulkan: fix COUNT_EQUAL memset using a fillBuffer command

Assets 23

16 Feb 17:39

github-actions

b4731

0f2bbe6

b4731

server : bump httplib to 0.19.0 (#11908)

Assets 23

16 Feb 10:18

github-actions

b4730

fe163d5

b4730

common : Fix a typo in help (#11899)

This patch fixes a typo in command help.
prefx -> prefix

Signed-off-by: Masanari Iida <standby24x7@gmail.com>

Assets 23

16 Feb 08:18

github-actions

b4728

bf42a23

b4728

vulkan: support multi/vision rope, and noncontiguous rope (#11902)

Assets 23

16 Feb 07:20

github-actions

b4727

c2ea16f

b4727

metal : fix the crash caused by the lack of residency set support on …

Assets 23

15 Feb 19:09

github-actions

b4724

2288510

b4724

metal : optimize dequant q6_K kernel (#11892)

Assets 23

15 Feb 15:14

github-actions

b4722

68ff663

b4722

repo : update links to new url (#11886)

* repo : update links to new url

ggml-ci

* cont : more urls

ggml-ci

Assets 23

15 Feb 10:45

github-actions

b4721

f355229

b4721

server: fix type promotion typo causing crashes w/ --jinja w/o tools …

Assets 23

15 Feb 08:34

github-actions

b4720

fc1b0d0

b4720

vulkan: initial support for IQ1_S and IQ1_M quantizations (#11528)

* vulkan: initial support for IQ1_S and IQ1_M quantizations

* vulkan: define MMV kernels for IQ1 quantizations

* devops: increase timeout of Vulkan tests again

* vulkan: simplify ifdef for init_iq_shmem

Assets 23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: ggml-org/llama.cpp

b4733

b4732

b4731

b4730

b4728

b4727

b4724

b4722

b4721

b4720