Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ggml: add ggml_rope_comp Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning server testing Everything test related
#18401 opened Dec 26, 2025 by ngxson Draft
Fix in the example in examples/retrieval examples
#18400 opened Dec 26, 2025 by hectorem2 Loading…
vulkan: disable events for UMA systems to workaround directio failures ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18397 opened Dec 26, 2025 by jeffbolznv Loading…
ggml-hexagon: optimize activation function ggml changes relating to the ggml tensor library for machine learning
#18393 opened Dec 26, 2025 by joeldushouyu Loading…
(proposal) contributing: tighten AI usage policy
#18388 opened Dec 26, 2025 by ngxson Loading…
rpc: fix segfault on invalid endpoint format ggml changes relating to the ggml tensor library for machine learning
#18387 opened Dec 26, 2025 by o7si Loading…
model-conversion : add device option to embd run orig model examples python python script changes
#18386 opened Dec 26, 2025 by danbev Loading…
opencl: allow resizing transpose buffers ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#18384 opened Dec 26, 2025 by lhez Loading…
Finally Added Kimi-Linear Support :) examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#18381 opened Dec 26, 2025 by Aaryan-Kapoor Loading…
llama : fix integer overflow in llama_chat_apply_template examples testing Everything test related
#18378 opened Dec 25, 2025 by yashwantbezawada Loading…
llama-fit-params: fix Gemma 3 calculation
#18372 opened Dec 25, 2025 by JohannesGaessler Loading…
Support Youtu-VL Model examples python python script changes
#18367 opened Dec 25, 2025 by f291400 Loading…
sampling: reuse token data buffer in llama_sampler_sample
#18365 opened Dec 25, 2025 by JayZenith Loading…
NLLB-600 language translation implementation model Model specific python python script changes
#18359 opened Dec 25, 2025 by Acceldium Loading…
11 of 16 tasks
llama-fit-params: fix overflow check
#18354 opened Dec 24, 2025 by JohannesGaessler Loading…
[WIP] tool-call: experimental migration of all parsers to peg-parser infra (w/ better test coverage) documentation Improvements or additions to documentation examples python python script changes script Script related server testing Everything test related
#18353 opened Dec 24, 2025 by ochafik Draft
7 tasks
ggml-cpu : add riscv vec dot kernel dispatch based on vlen ggml changes relating to the ggml tensor library for machine learning
#18348 opened Dec 24, 2025 by xctan Draft
Work around broken IntelSYCLConfig.cmake in Intel oneAPI 2025.x ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#18345 opened Dec 24, 2025 by rrsathe Loading…
common/grammar : replace problematic backtracking regex [\s\S]* testing Everything test related
#18342 opened Dec 24, 2025 by aldehir Loading…
ggml-cuda : fix INT_MAX overflow in cpy kernels (#18140) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#18340 opened Dec 24, 2025 by Muhammad-Kamran-Khan Loading…
android: routine maintenance - Dec 2025 android Issues specific to Android examples
#18338 opened Dec 24, 2025 by naco-siren Loading…
ProTip! Exclude everything labeled bug with -label:bug.