-
Notifications
You must be signed in to change notification settings - Fork 12.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ggml : support broadcast for ggml_soft_max_ext and ggml_flash_attn_ext
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#14435
opened Jun 28, 2025 by
ggerganov
Loading…
2 of 5 tasks
fix(hybrid cache): Only call apply on child caches in the success state
#14428
opened Jun 27, 2025 by
gabe-l-hart
Loading…
ggml : add ggml_scale_bias
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
[CANN]update aclnnGroupedMatmulV2 to aclnnGroupedMatmulV3
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#14411
opened Jun 27, 2025 by
noemotiovon
Loading…
[CANN] weight format to nz for Ascend310P3
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#14407
opened Jun 27, 2025 by
tqgy6
Loading…
OpenCL: add conv2d kernel
ggml
changes relating to the ggml tensor library for machine learning
#14403
opened Jun 26, 2025 by
rmatif
Loading…
Add explanation to --no-mmap in llama server
examples
server
#14399
opened Jun 26, 2025 by
malte-j
Loading…
ggml : add pointer to attach user data
ggml
changes relating to the ggml tensor library for machine learning
#14397
opened Jun 26, 2025 by
koush
Loading…
SYCL: Take improvements from GLU branch and disable faulty fp16 exp kernel
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14395
opened Jun 26, 2025 by
qnixsynapse
Loading…
compare-commits.sh: support both llama-bench and test-backend-ops
python
python script changes
script
Script related
#14392
opened Jun 26, 2025 by
yeahdongcn
Loading…
Add Conv2d for CPU
ggml
changes relating to the ggml tensor library for machine learning
#14388
opened Jun 26, 2025 by
am17an
Loading…
ggml-cpu: Build variant targeting Neoverse-V2
ggml
changes relating to the ggml tensor library for machine learning
#14380
opened Jun 25, 2025 by
ckastner
Loading…
webui: preserve partial content when streaming errors occur
examples
server
#14374
opened Jun 25, 2025 by
Aaryan-549
Loading…
5 of 8 tasks
Q2k interleaving implementation - x86/x64 SIMD
ggml
changes relating to the ggml tensor library for machine learning
#14373
opened Jun 25, 2025 by
Srihari-mcw
Loading…
test-backend-ops: add support for specifying output format
testing
Everything test related
#14368
opened Jun 25, 2025 by
yeahdongcn
Loading…
vulkan: Add fusion support for RMS_NORM+MUL
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#14366
opened Jun 24, 2025 by
jeffbolznv
Loading…
llama : add high-throughput mode
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
build: refine toplevel .gitignore
script
Script related
#14355
opened Jun 24, 2025 by
zhouwg
Loading…
1 task done
Add script to test op perf and compare
python
python script changes
script
Script related
#14354
opened Jun 24, 2025 by
yeahdongcn
Loading…
Make the shell scripts cross-platform
devops
improvements to build systems and github actions
examples
script
Script related
server
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#14341
opened Jun 23, 2025 by
vedranmiletic
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:master.