-
Notifications
You must be signed in to change notification settings - Fork 13.6k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[RFC] ggml: new backend for API Remoting
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
#17072
opened Nov 7, 2025 by
kpouget
Loading…
Add ops needed for new hybrid models: SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#17063
opened Nov 6, 2025 by
pwilkin
Loading…
cmake: add option to build and link BoringSSL
build
Compilation issues
#17062
opened Nov 6, 2025 by
angt
Loading…
cuda: extended MMF_ROWS_PER_BLOCK
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17051
opened Nov 6, 2025 by
zhang-hui-yulo
Loading…
fix : Dangling pointer for non-empty trigger words in lazy grammar construction
#17048
opened Nov 6, 2025 by
marek-hradil
Loading…
Add MoE dynamic routing with expert caching
build
Compilation issues
documentation
Improvements or additions to documentation
examples
#17044
opened Nov 6, 2025 by
jmangold23
•
Draft
ggml-cpu: handle 3d tensors in repack mat_mul
ggml
changes relating to the ggml tensor library for machine learning
#17030
opened Nov 5, 2025 by
Alcpz
Loading…
tests(test-backend-ops): Test backend ops verbosity
testing
Everything test related
#17029
opened Nov 5, 2025 by
gabe-l-hart
Loading…
examples(eval-callback): Eval callback verbosity
examples
#17028
opened Nov 5, 2025 by
gabe-l-hart
Loading…
ci: add Arm-hosted Graviton4 runner
devops
improvements to build systems and github actions
#17021
opened Nov 5, 2025 by
sudhiarm
Loading…
sampling : add support for GPU sampling (wip)
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
server
testing
Everything test related
Q4/Q8 Tiled Gemm Optimization.
ggml
changes relating to the ggml tensor library for machine learning
#16999
opened Nov 4, 2025 by
shalinib-ibm
Loading…
kleidiai: add optimized per-channel kernels for Q8_0
ggml
changes relating to the ggml tensor library for machine learning
#16993
opened Nov 4, 2025 by
chaxu01
Loading…
CUDA: add stream-based concurrency
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16991
opened Nov 4, 2025 by
am17an
Loading…
2 tasks
Add circular tiling support to conv2d and pad, for Vulkan, CUDA, and CPU (used for making seamless textures)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#16985
opened Nov 4, 2025 by
Phylliida
Loading…
Mamba2 SSD
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
model
Model specific
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#16982
opened Nov 3, 2025 by
gabe-l-hart
•
Draft
sycl: flash-attention implementation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16969
opened Nov 3, 2025 by
ye-NX
Loading…
Refactor llm_chat_template_from_str to avoid throwing exceptions
#16965
opened Nov 3, 2025 by
AnonN10
Loading…
CUDA: add implicit conv3d
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#16948
opened Nov 2, 2025 by
bssrdf
Loading…
Add e2e tests for embedding raw flag
devops
improvements to build systems and github actions
examples
python
python script changes
testing
Everything test related
#16940
opened Nov 2, 2025 by
SamMalayek
•
Draft
doc: Windows + clang/ninja build guide format cleanup
documentation
Improvements or additions to documentation
#16939
opened Nov 2, 2025 by
jsjtxietian
Loading…
ProTip!
Exclude everything labeled
bug with -label:bug.