llama : print size and type of overridden tensors #13364

slaren · 2025-05-07T19:17:08Z

Small QoL improvement. Example output:

tensor blk.10.exp_probs_b.bias (0 MiB f32) buffer type overridden to CPU
tensor blk.10.ffn_gate_exps.weight (700 MiB iq1_s) buffer type overridden to CPU
tensor blk.10.ffn_down_exps.weight (700 MiB iq1_s) buffer type overridden to CPU
tensor blk.10.ffn_up_exps.weight (700 MiB iq1_s) buffer type overridden to CPU
tensor blk.10.ffn_gate_shexp.weight (9 MiB q5_K) buffer type overridden to CPU
tensor blk.10.ffn_down_shexp.weight (11 MiB q6_K) buffer type overridden to CPU
tensor blk.10.ffn_up_shexp.weight (9 MiB q5_K) buffer type overridden to CPU

Copilot

Pull Request Overview

This PR enhances the debugging output for overridden tensor buffer types by including the tensor’s memory size (in MiB) and data type in the log message. It also corrects the spelling of "overridden" in the log output.

Enhanced debug logging for better visibility into tensor properties.
Fixed minor spelling error in the log messages.

Comments suppressed due to low confidence (1)

src/llama-model.cpp:1655

The updated debug log message improves clarity by including additional tensor details. Consider verifying that the integer division used for calculating the tensor size meets your precision requirements, or switch to floating point arithmetic if a more precise value is desired.

LLAMA_LOG_DEBUG("tensor %s (%zu MiB %s) buffer type overridden to %s\n", tensor_name.c_str(), ggml_nbytes(t_meta) / 1024 / 1024, ggml_type_name(t_meta->type), ggml_backend_buft_name(buft));

ddh0 · 2025-05-09T05:45:08Z

yayy thank you!

llama : print size and type of overridden tensors

b2fb63c

slaren requested a review from Copilot May 7, 2025 19:18

Copilot AI reviewed May 7, 2025

View reviewed changes

ggerganov approved these changes May 8, 2025

View reviewed changes

slaren merged commit f061021 into master May 8, 2025
46 checks passed

slaren deleted the sl/ot-tensor-size branch May 8, 2025 11:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : print size and type of overridden tensors #13364

llama : print size and type of overridden tensors #13364

Uh oh!

slaren commented May 7, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

ddh0 commented May 9, 2025

Uh oh!

Uh oh!

llama : print size and type of overridden tensors #13364

llama : print size and type of overridden tensors #13364

Uh oh!

Conversation

slaren commented May 7, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

ddh0 commented May 9, 2025

Uh oh!

Uh oh!