[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache. #27144

sighingnow · 2025-10-18T06:11:51Z

Require the flashmla patch vllm-project/FlashMLA#7 to be landed first.

Signed-off-by: Tao He <[email protected]>

gemini-code-assist

Code Review

This pull request fixes an issue with decoding metadata for dense MLA's FP8 K/V cache by introducing a specialized operator. The changes in the Python code correctly route the execution to this new operator when appropriate. However, there is a critical issue in the CMake configuration where the flashmla dependency is pointed to a personal fork. This practice introduces significant risks and should be rectified by merging the required changes into the official upstream repository and updating the commit hash accordingly.

gemini-code-assist · 2025-10-18T06:12:25Z

cmake/external_projects/flashmla.cmake

+        GIT_REPOSITORY https://github.com/sighingnow/FlashMLA
+        GIT_TAG 7af725e6c2a3f0262e5b8573c715411a6d895cae


Pointing the GIT_REPOSITORY to a personal fork (sighingnow/FlashMLA) introduces a significant dependency risk. For project stability, security, and long-term maintainability, dependencies should point to official repositories. The required changes should be merged into the official vllm-project/FlashMLA repository first. Afterward, this pull request can be updated to use the new commit hash from the official repository.

GIT_REPOSITORY https://github.com/vllm-project/FlashMLA GIT_TAG <new_commit_hash_from_official_repo>

[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache.

30a5293

Signed-off-by: Tao He <[email protected]>

sighingnow requested a review from LucasWilkinson as a code owner October 18, 2025 06:11

gemini-code-assist bot reviewed Oct 18, 2025

View reviewed changes

mergify bot added ci/build v1 labels Oct 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache. #27144

[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache. #27144

sighingnow commented Oct 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		GIT_REPOSITORY https://github.com/sighingnow/FlashMLA
		GIT_TAG 7af725e6c2a3f0262e5b8573c715411a6d895cae

Uh oh!

[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache. #27144

Are you sure you want to change the base?

[Bugfix] fixes the decoding metadata of dense mla's fp8 kvcache. #27144

Conversation

sighingnow commented Oct 18, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant