Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ggml-cpu:add RISC-V RVV (Zvfh) optimization for FP16 to FP32 conversion ggml changes relating to the ggml tensor library for machine learning
#17161 opened Nov 11, 2025 by ixgbe Loading…
convert : register UMT5Model architecture for T5 conversion python python script changes
#17160 opened Nov 11, 2025 by levkropp Loading…
vulkan: change graph_compute to be async and enable get_tensor_async ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17158 opened Nov 10, 2025 by jeffbolznv Loading…
HIP: WMMA-MMQ kernels for RDNA 4 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17156 opened Nov 10, 2025 by jiachengjason Draft
llama.android : Rewrite Android binding android Issues specific to Android documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning
#17152 opened Nov 10, 2025 by hanyin-arm Loading…
vendor: split httplib to cpp/h files build Compilation issues examples python python script changes script Script related server
#17150 opened Nov 10, 2025 by ngxson Loading…
Install rpc-server when GGML_RPC is ON. devops improvements to build systems and github actions examples nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#17149 opened Nov 10, 2025 by nbp Loading…
vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2 ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17147 opened Nov 10, 2025 by SavicStefan Loading…
metal : make the FA extra sizes consistent Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#17143 opened Nov 10, 2025 by ggerganov Loading…
Add complete Megrez-MoE support: GGUF conversion + inference. model Model specific python python script changes
#17141 opened Nov 10, 2025 by tamarPal Loading…
common : implement parser combinators for chat parsing [WIP] testing Everything test related
#17136 opened Nov 10, 2025 by aldehir Draft
5 of 9 tasks
hexagon: various Op fixes ggml changes relating to the ggml tensor library for machine learning
#17135 opened Nov 10, 2025 by max-krasnyansky Loading…
vulkan: disable rms_norm + mul + rope for old gpus ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17134 opened Nov 10, 2025 by netrunnereve Loading…
SYCL: add full support for ABS unary op documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17126 opened Nov 9, 2025 by shani-f Loading…
llama: introduce support for model-embedded sampling parameters python python script changes
#17120 opened Nov 9, 2025 by taronaeo Loading…
rpc : fix alloc size logic Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#17116 opened Nov 9, 2025 by ggerganov Loading…
2 tasks
Refactor: convert_hf_to_gguf.py python python script changes refactoring Refactoring
#17114 opened Nov 9, 2025 by pwilkin Draft
CPU SIMD and pipeline optimizations across vec/mmq/ops/kv-cache/repack ggml changes relating to the ggml tensor library for machine learning
#17113 opened Nov 8, 2025 by NoahOksuz Loading…
CUDA: support F32 kernel type for CONV_TRANSPOSE_2D ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17094 opened Nov 8, 2025 by AgainstEntropy Loading…
add version to all shared object files examples ggml changes relating to the ggml tensor library for machine learning
#17091 opened Nov 7, 2025 by furrysalamander Loading…
convert: (demo) repacking compressed_tensor format of kimi-k2 python python script changes
#17083 opened Nov 7, 2025 by ngxson Draft
ProTip! Add no:assignee to see everything that’s not assigned.