allow disable flashinfer prefill #25276

luccafong · 2025-09-19T18:26:44Z

Summary: GB200 FlashInfer Prefill is not compatible with CutlassMLA FP8, allowing disable it for now.

Differential Revision: D81994905

facebook-github-bot · 2025-09-19T18:27:02Z

@luccafong has exported this pull request. If you are a Meta employee, you can view the originating diff in D81994905.

gemini-code-assist

Code Review

This pull request introduces a new environment variable, VLLM_DISABLE_FLASHINFER_PREFILL, to provide an option to disable FlashInfer prefill. This change addresses a compatibility issue on GB200 with CutlassMLA FP8. The implementation adds the new flag in vllm/envs.py and correctly uses it in vllm/v1/attention/backends/mla/common.py to control the feature. The default behavior is unchanged. The changes are correct and well-contained.

Summary: GB200 FlashInfer Prefill is not compatible with CutlassMLA FP8, allowing disable it for now. Differential Revision: D81994905 Signed-off-by: Lu Fang <[email protected]>

mgoin · 2025-09-19T21:08:50Z

Seems reasonable for now, thanks

Signed-off-by: Lu Fang <[email protected]>

Signed-off-by: Lu Fang <[email protected]> Signed-off-by: charlifu <[email protected]>

Signed-off-by: Lu Fang <[email protected]> Signed-off-by: yewentao256 <[email protected]>

Signed-off-by: Lu Fang <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Signed-off-by: Lu Fang <[email protected]>

luccafong requested review from WoosukKwon, alexm-redhat, comaniac, njhill, robertgshaw2-redhat and ywang96 as code owners September 19, 2025 18:26

mergify bot added the v1 label Sep 19, 2025

gemini-code-assist bot reviewed Sep 19, 2025

View reviewed changes

allow disable flashinfer prefill

95df807

Summary: GB200 FlashInfer Prefill is not compatible with CutlassMLA FP8, allowing disable it for now. Differential Revision: D81994905 Signed-off-by: Lu Fang <[email protected]>

luccafong force-pushed the export-D81994905 branch from cc162ce to 95df807 Compare September 19, 2025 20:49

luccafong requested a review from mgoin September 19, 2025 20:52

mgoin approved these changes Sep 19, 2025

View reviewed changes

mgoin enabled auto-merge (squash) September 19, 2025 21:08

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 19, 2025

mgoin merged commit ee7a66d into vllm-project:main Sep 19, 2025
52 checks passed

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

allow disable flashinfer prefill (vllm-project#25276)

ea96ea5

Signed-off-by: Lu Fang <[email protected]>

charlifu pushed a commit to ROCm/vllm that referenced this pull request Sep 25, 2025

allow disable flashinfer prefill (vllm-project#25276)

3abedfb

Signed-off-by: Lu Fang <[email protected]> Signed-off-by: charlifu <[email protected]>

yewentao256 pushed a commit that referenced this pull request Oct 3, 2025

allow disable flashinfer prefill (#25276)

5051270

Signed-off-by: Lu Fang <[email protected]> Signed-off-by: yewentao256 <[email protected]>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

allow disable flashinfer prefill (vllm-project#25276)

aa2e63a

Signed-off-by: Lu Fang <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025

allow disable flashinfer prefill (vllm-project#25276)

0d5684f

Signed-off-by: Lu Fang <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

allow disable flashinfer prefill #25276

allow disable flashinfer prefill #25276

Uh oh!

luccafong commented Sep 19, 2025

Uh oh!

facebook-github-bot commented Sep 19, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

mgoin commented Sep 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

allow disable flashinfer prefill #25276

allow disable flashinfer prefill #25276

Uh oh!

Conversation

luccafong commented Sep 19, 2025

Uh oh!

facebook-github-bot commented Sep 19, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

mgoin commented Sep 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants