vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2 #17147

SavicStefan · 2025-11-10T14:33:08Z

For q2_K, added the ACC_TYPE_VEC2 implementation, as seen in PR #16203.

Before (master) NVIDIA GeForce RTX 4060 Ti:
MUL_MAT(type_a=q2_K,type_b=f32,m=4096,n=512,k=14336,bs=[1,1],nr=[1,1],per=[0,1,2,3],k_v=0,o=1): 190 runs - 5292.84 us/run - 60.13 GFLOP/run - 11.36 TFLOPS

After (PR) NVIDIA GeForce RTX 4060 Ti:
MUL_MAT(type_a=q2_K,type_b=f32,m=4096,n=512,k=14336,bs=[1,1],nr=[1,1],per=[0,1,2,3],k_v=0,o=1):                258 runs -  3887.86 us/run -  60.13 GFLOP/run -  15.47 TFLOPS

Which is around +26% peformance increase on us/run.
I also need to try for other types.

Signed-off-by: Stefan Savic <[email protected]>

Vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2

0d26397

Signed-off-by: Stefan Savic <[email protected]>

SavicStefan requested a review from 0cc4m as a code owner November 10, 2025 14:33

DajanaV mentioned this pull request Nov 10, 2025

UPSTREAM PR #17147: Vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2 auroralabs-loci/llama.cpp#158

Open

SavicStefan changed the title ~~Vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2~~ vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2 Nov 10, 2025

github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Nov 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2 #17147

vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2 #17147

SavicStefan commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2 #17147

Are you sure you want to change the base?

vulkan: add q2_K implementation in mul_mmq with ACC_TYPE_VEC2 #17147

Conversation

SavicStefan commented Nov 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant