Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Model: Qwen3 Next examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes testing Everything test related
#16095 opened Sep 18, 2025 by pwilkin Loading…
Modern Bert Support python python script changes
#15641 opened Aug 28, 2025 by ryan-mangeno Loading…
llama : add llama_batch_ext android Issues specific to Android examples python python script changes server
#11875 opened Feb 14, 2025 by ngxson Loading…
llama: Attempt to add ModernBert model Model specific python python script changes
#14014 opened Jun 4, 2025 by huydt84 Loading…
add FP8 support to gguf/llama: build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning script Script related Tensor Encoding Scheme https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes testing Everything test related
#10055 opened Oct 26, 2024 by Djip007 Draft
1 of 3 tasks
tool: add convertation of text/parquet to custom format build Compilation issues examples
#14622 opened Jul 10, 2025 by lexasub Loading…
Implementation of a sequence repetition penalty sampler enhancement New feature or request generation quality Quality of model output need feedback Testing and feedback with results are needed
#2593 opened Aug 12, 2023 by KerfuffleV2 Draft
llama : second attempt to refactor vision API examples python python script changes server
#11292 opened Jan 18, 2025 by ngxson Draft
1 of 5 tasks
WIP: Add model merge example demo Demonstrate some concept or idea, not intended to be merged help wanted Needs help from the community
#5741 opened Feb 26, 2024 by ngxson Draft
cuda : Add conv2d Implicit GEMM ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#15805 opened Sep 4, 2025 by bssrdf Loading…
[MPI] Add support for per-node options, thread counts, and layer allocations build Compilation issues examples ggml changes relating to the ggml tensor library for machine learning server
#3334 opened Sep 26, 2023 by AutonomicPerfectionist Draft
2 of 5 tasks
Implement llama-pull tool examples
#16423 opened Oct 4, 2025 by ericcurtin Loading…
llama-cli: add support for reasoning examples
#16603 opened Oct 16, 2025 by bandoti Loading…
support MiniCPM-V-2 demo Demonstrate some concept or idea, not intended to be merged enhancement New feature or request examples python python script changes Review Complexity : High Generally require indepth knowledge of LLMs or GPUs
#6919 opened Apr 26, 2024 by Achazwl Loading…
Layer skipping/self-speculation demo demo Demonstrate some concept or idea, not intended to be merged research 🔬
#3565 opened Oct 10, 2023 by KerfuffleV2 Draft
Add ops needed for new hybrid models: SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17063 opened Nov 6, 2025 by pwilkin Loading…
Server: enable lookup decoding enhancement New feature or request examples Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#6828 opened Apr 22, 2024 by JohannesGaessler Loading…
Introduce New Lookup-Table(LUT)-Based Matrix Multiplication Method ggml changes relating to the ggml tensor library for machine learning python python script changes Tensor Encoding Scheme https://github.com/ggerganov/llama.cpp/wiki/Tensor-Encoding-Schemes
#10181 opened Nov 5, 2024 by QingtaoLi1 Loading…
2 of 4 tasks
ggml-cuda: Vulkan direct conv 2D ported to CUDA ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16088 opened Sep 18, 2025 by etasnadi Loading…
ProTip! Updated in the last three days: updated:>2025-11-07.