Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[RFC] ggml: new backend for API Remoting Apple Metal https://en.wikipedia.org/wiki/Metal_(API) build Compilation issues ggml changes relating to the ggml tensor library for machine learning
#17072 opened Nov 7, 2025 by kpouget Loading…
Fix NetBSD compilation error
#17068 opened Nov 7, 2025 by xinitrcn1 Loading…
Add ops needed for new hybrid models: SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17063 opened Nov 6, 2025 by pwilkin Loading…
cmake: add option to build and link BoringSSL build Compilation issues
#17062 opened Nov 6, 2025 by angt Loading…
cuda: extended MMF_ROWS_PER_BLOCK ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17051 opened Nov 6, 2025 by zhang-hui-yulo Loading…
Add MoE dynamic routing with expert caching build Compilation issues documentation Improvements or additions to documentation examples
#17044 opened Nov 6, 2025 by jmangold23 Draft
ggml-cpu: handle 3d tensors in repack mat_mul ggml changes relating to the ggml tensor library for machine learning
#17030 opened Nov 5, 2025 by Alcpz Loading…
tests(test-backend-ops): Test backend ops verbosity testing Everything test related
#17029 opened Nov 5, 2025 by gabe-l-hart Loading…
ci: add Arm-hosted Graviton4 runner devops improvements to build systems and github actions
#17021 opened Nov 5, 2025 by sudhiarm Loading…
sampling : add support for GPU sampling (wip) examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs server testing Everything test related
#17004 opened Nov 4, 2025 by danbev Draft
3 of 9 tasks
Q4/Q8 Tiled Gemm Optimization. ggml changes relating to the ggml tensor library for machine learning
#16999 opened Nov 4, 2025 by shalinib-ibm Loading…
kleidiai: add optimized per-channel kernels for Q8_0 ggml changes relating to the ggml tensor library for machine learning
#16993 opened Nov 4, 2025 by chaxu01 Loading…
CUDA: add stream-based concurrency ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16991 opened Nov 4, 2025 by am17an Loading…
2 tasks
Add circular tiling support to conv2d and pad, for Vulkan, CUDA, and CPU (used for making seamless textures) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related Vulkan Issues specific to the Vulkan backend
#16985 opened Nov 4, 2025 by Phylliida Loading…
Mamba2 SSD Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning model Model specific Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#16982 opened Nov 3, 2025 by gabe-l-hart Draft
sycl: flash-attention implementation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16969 opened Nov 3, 2025 by ye-NX Loading…
CUDA: add implicit conv3d ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#16948 opened Nov 2, 2025 by bssrdf Loading…
Add e2e tests for embedding raw flag devops improvements to build systems and github actions examples python python script changes testing Everything test related
#16940 opened Nov 2, 2025 by SamMalayek Draft
doc: Windows + clang/ninja build guide format cleanup documentation Improvements or additions to documentation
#16939 opened Nov 2, 2025 by jsjtxietian Loading…
ProTip! Exclude everything labeled bug with -label:bug.