-
Notifications
You must be signed in to change notification settings - Fork 14.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
llama: fix RPC for -fit on
ggml
changes relating to the ggml tensor library for machine learning
#18233
opened Dec 20, 2025 by
JohannesGaessler
Loading…
server : implement extra_args support for /models/load endpoint
examples
server
#18232
opened Dec 20, 2025 by
Chrisischris
•
Draft
webui: Fix the header backdrop blur
examples
server
#18230
opened Dec 20, 2025 by
ImadSaddik
Loading…
server: /v1/responses (text generation only)
examples
server
#18227
opened Dec 20, 2025 by
openingnow
Loading…
webui: use server presets as parameter placeholders
examples
server
#18226
opened Dec 20, 2025 by
ServeurpersoCom
Loading…
ggml-metal: guard buffer map slicing
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#18225
opened Dec 20, 2025 by
SzymonPrajs
Loading…
webui: apply webui_settings on first load
examples
server
#18223
opened Dec 20, 2025 by
ServeurpersoCom
Loading…
common : reorganize includes to prioritize vendored deps
#18222
opened Dec 20, 2025 by
aldehir
Loading…
ggml-metal: fix memset range and temp buffer leaks
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#18221
opened Dec 20, 2025 by
SzymonPrajs
Loading…
Make sure that CMAKE will always use JSON headers under vendor directory
examples
server
testing
Everything test related
#18218
opened Dec 20, 2025 by
ThanatosShinji
Loading…
convert: rework ftype heuristics
python
python script changes
#18214
opened Dec 20, 2025 by
taronaeo
Loading…
ggml-metal: fix bf16/f16 matmul kernels
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#18210
opened Dec 20, 2025 by
SzymonPrajs
Loading…
Fix BLAS Compile Definitions
ggml
changes relating to the ggml tensor library for machine learning
#18205
opened Dec 19, 2025 by
DaAwesomeP
Loading…
HIP: Use mmq on MFMA devices for MUL_MAT_ID in cases where a lot of splits would be generated
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#18202
opened Dec 19, 2025 by
IMbackK
Loading…
llamafile: add rvv support for sgemm kernels
ggml
changes relating to the ggml tensor library for machine learning
#18199
opened Dec 19, 2025 by
taimur-10x
Loading…
cmake: Added more x86_64 CPU backends when building with changes relating to the ggml tensor library for machine learning
GGML_CPU_ALL_VARIANTS=On
ggml
vulkan: fix im2col overflowing maxworkgroupcount
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#18180
opened Dec 18, 2025 by
jeffbolznv
Loading…
vulkan: Warptile tuning for Intel Xe2/Xe3
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18178
opened Dec 18, 2025 by
virajwad
Loading…
tool/ex/tests: consistently free ctx, then model
examples
testing
Everything test related
#18168
opened Dec 18, 2025 by
JohannesGaessler
Loading…
Adding --direct-io flag for model loading
examples
#18166
opened Dec 18, 2025 by
JTischbein
Loading…
spm: make llama a dynamic library; leave placeholder for ggml/gguf na…
#18165
opened Dec 18, 2025 by
steven-moon
Loading…
ggml-hexagon: gelu optimization
ggml
changes relating to the ggml tensor library for machine learning
#18151
opened Dec 17, 2025 by
joeldushouyu
•
Draft
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.