[Qwen3-Next] Add MoE Config for H200 #24688

WoosukKwon · 2025-09-11T19:29:08Z

No description provided.

Signed-off-by: Woosuk Kwon <[email protected]>

gemini-code-assist

Code Review

This pull request introduces a new configuration file for the Fused Mixture-of-Experts (MoE) kernel, tailored for the NVIDIA H200 GPU. The configuration is for a model with 512 experts and a sharded intermediate size of 64, which is likely for the Qwen3-Next model as indicated in the pull request title. The JSON file contains tuned kernel parameters for various batch sizes. The structure of the file is consistent with existing configurations, and the values appear to be the result of a tuning process. The change is straightforward and I found no issues.

WoosukKwon · 2025-09-11T19:42:21Z

@simon-mo Actually this only includes the config for TP=8. Will add TP=1, 2, and 4 shortly.

Signed-off-by: Woosuk Kwon <[email protected]>

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

[Qwen3-Next] Add MoE Config for H200

ffdc66b

Signed-off-by: Woosuk Kwon <[email protected]>

mergify bot added the qwen Related to Qwen models label Sep 11, 2025

gemini-code-assist bot reviewed Sep 11, 2025

View reviewed changes

simon-mo approved these changes Sep 11, 2025

View reviewed changes

simon-mo merged commit c733bd5 into main Sep 11, 2025
9 of 13 checks passed

simon-mo deleted the woosuk/qwen3-next-h200 branch September 11, 2025 19:40

skyloevil pushed a commit to skyloevil/vllm that referenced this pull request Sep 13, 2025

[Qwen3-Next] Add MoE Config for H200 (vllm-project#24688)

6ca8058

Signed-off-by: Woosuk Kwon <[email protected]>

dsxsteven pushed a commit to dsxsteven/vllm_splitPR that referenced this pull request Sep 15, 2025

[Qwen3-Next] Add MoE Config for H200 (vllm-project#24688)

19e5c7b

Signed-off-by: Woosuk Kwon <[email protected]>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Qwen3-Next] Add MoE Config for H200 (vllm-project#24688)

1cc0c7d

Signed-off-by: Woosuk Kwon <[email protected]>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[Qwen3-Next] Add MoE Config for H200 (vllm-project#24688)

575a561

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: xuebwang-amd <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Qwen3-Next] Add MoE Config for H200 #24688

[Qwen3-Next] Add MoE Config for H200 #24688

Uh oh!

WoosukKwon commented Sep 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

WoosukKwon commented Sep 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Qwen3-Next] Add MoE Config for H200 #24688

[Qwen3-Next] Add MoE Config for H200 #24688

Uh oh!

Conversation

WoosukKwon commented Sep 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

WoosukKwon commented Sep 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants