cont : more correct comment [no ci]

ggerganov · ggerganov · commit 55b5ae146b6c · 2025-05-14T22:46:26.000+03:00
diff --git a/src/llama-kv-cache.cpp b/src/llama-kv-cache.cpp
@@ -442,8 +442,8 @@ void llama_kv_cache_unified::defrag_sched(float thold) {
 void llama_kv_cache_unified::set_full() {
     n = size;
 
-    // when simulating a full KV cache, the specific value of the "head" pointer is not important because we are not
-    //   going to write any data - we just want to measure the memory needed by the graph in such state.
+    // when simulating a full KV cache, the specific value of the "head" pointer is not important because it does not
+    //   affect the shapes of the tensors in the compute graph - it only affects the offests of the K/V views.
     //   we should only guarantee that the head position won't cause out-of-bounds view of the K, V tensors, so
     //   setting it to 0 is the simplest way to achieve that
     // ref: https://github.com/ggml-org/llama.cpp/issues/13359