-
Notifications
You must be signed in to change notification settings - Fork 14.3k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ggml: add ggml_rope_comp
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
server
testing
Everything test related
vulkan: disable events for UMA systems to workaround directio failures
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#18397
opened Dec 26, 2025 by
jeffbolznv
Loading…
ggml-hexagon: optimize activation function
ggml
changes relating to the ggml tensor library for machine learning
#18393
opened Dec 26, 2025 by
joeldushouyu
Loading…
rpc: fix segfault on invalid endpoint format
ggml
changes relating to the ggml tensor library for machine learning
#18387
opened Dec 26, 2025 by
o7si
Loading…
model-conversion : add device option to embd run orig model
examples
python
python script changes
#18386
opened Dec 26, 2025 by
danbev
Loading…
opencl: allow resizing transpose buffers
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
#18384
opened Dec 26, 2025 by
lhez
Loading…
Finally Added Kimi-Linear Support :)
examples
ggml
changes relating to the ggml tensor library for machine learning
model
Model specific
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
#18381
opened Dec 26, 2025 by
Aaryan-Kapoor
Loading…
llama : fix integer overflow in llama_chat_apply_template
examples
testing
Everything test related
#18378
opened Dec 25, 2025 by
yashwantbezawada
Loading…
Fix a crash on multiple active LoRa (issue 18050)
examples
server
#18375
opened Dec 25, 2025 by
byko3y
Loading…
llama_fit_params: return enum for fail vs. error
examples
#18374
opened Dec 25, 2025 by
JohannesGaessler
Loading…
feat: Add model pinning feature to protect critical models from LRU eviction
examples
server
#18373
opened Dec 25, 2025 by
malaiwah
Loading…
Support Youtu-VL Model
examples
python
python script changes
#18367
opened Dec 25, 2025 by
f291400
Loading…
sampling: reuse token data buffer in llama_sampler_sample
#18365
opened Dec 25, 2025 by
JayZenith
Loading…
NLLB-600 language translation implementation
model
Model specific
python
python script changes
#18359
opened Dec 25, 2025 by
Acceldium
Loading…
11 of 16 tasks
feat: Add memory factory hook for custom KV cache implementations
#18357
opened Dec 24, 2025 by
rmarnold
Loading…
[WIP] tool-call: experimental migration of all parsers to peg-parser infra (w/ better test coverage)
documentation
Improvements or additions to documentation
examples
python
python script changes
script
Script related
server
testing
Everything test related
ggml-cpu : add riscv vec dot kernel dispatch based on vlen
ggml
changes relating to the ggml tensor library for machine learning
Work around broken IntelSYCLConfig.cmake in Intel oneAPI 2025.x
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#18345
opened Dec 24, 2025 by
rrsathe
Loading…
common/grammar : replace problematic backtracking regex Everything test related
[\s\S]*
testing
#18342
opened Dec 24, 2025 by
aldehir
Loading…
ggml-cuda : fix INT_MAX overflow in cpy kernels (#18140)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#18340
opened Dec 24, 2025 by
Muhammad-Kamran-Khan
Loading…
android: routine maintenance - Dec 2025
android
Issues specific to Android
examples
#18338
opened Dec 24, 2025 by
naco-siren
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.