CUDA: add bf16 and i32 to getrows #14529

am17an · 2025-07-04T08:53:07Z

Just add the missing case statements

* origin/master: CUDA: add bf16 and i32 to getrows (ggml-org#14529) vulkan: increase LOAD_VEC_A to 8 (IQ1/IQ2) or 4 (IQ3) (ggml-org#14485) vulkan: fix rms_norm+mul fusion (ggml-org#14545) vulkan: Handle updated FA dim2/3 definition (ggml-org#14518) server : fix assistant prefilling when content is an array (ggml-org#14360) opencl: add GELU_ERF (ggml-org#14476) eval-callback : check for empty input (ggml-org#14539) test-backend-ops: add support for specifying output format (ggml-org#14368) metal : disable fast math in all quantize kernels (ggml-org#14528) batch : add optional for sequential equal split (ggml-org#14511) graph : prepare for 4D mask (ggml-org#14515) batch : add n_used count (ggml-org#14512) CANN: Replace aclrtMemsetSync with aclnnInplaceZero operator (ggml-org#14002) ggml : implement GEGLU_ERF and GEGLU_QUICK ops (ggml-org#14445)

CUDA: add bf16 and i32 to getrows

1700086

am17an requested a review from JohannesGaessler July 4, 2025 08:53

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Jul 4, 2025

JohannesGaessler approved these changes Jul 7, 2025

View reviewed changes

am17an merged commit b9c3eef into ggml-org:master Jul 7, 2025
48 checks passed

am17an deleted the cuda_bf16_i32_get_rows branch July 7, 2025 13:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA: add bf16 and i32 to getrows #14529

CUDA: add bf16 and i32 to getrows #14529

Uh oh!

am17an commented Jul 4, 2025

Uh oh!

Uh oh!

Uh oh!

CUDA: add bf16 and i32 to getrows #14529

CUDA: add bf16 and i32 to getrows #14529

Uh oh!

Conversation

am17an commented Jul 4, 2025

Uh oh!

Uh oh!

Uh oh!