[moe][quant] add weight name case for offset #15515

MengqingCao · 2025-03-26T02:10:56Z

This PR adds weight name case for offset, thus making it compatible with some quantization tools that name zero-points as offsets, e.g., modelslim

github-actions · 2025-03-26T02:11:06Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

mgoin

Are you going to contribute an integration for the modelslim format? This seems fine to land, but it is a bit strange to have this fix without actually having the format integrated upstream

MengqingCao · 2025-03-27T01:44:21Z

Are you going to contribute an integration for the modelslim format? This seems fine to land, but it is a bit strange to have this fix without actually having the format integrated upstream

Acctually this is a fix for downstream vllm-ascend, we are integrating modelslim into vllm-ascend through vllm-project/vllm-ascend#391. And modelslim is a quant tool for Ascend NPU, I think we'd better make smallest change in vLLM upstream, thus only adding offset in weight_loader.

Signed-off-by: Mengqing Cao <[email protected]>

mgoin · 2025-03-27T02:06:40Z

I see, thanks for the context. LGTM then!

MengqingCao · 2025-03-27T02:11:12Z

I see, thanks for the context. LGTM then!

Thanks! 👍

Signed-off-by: Mengqing Cao <[email protected]> Signed-off-by: xinyuxiao <[email protected]>

Signed-off-by: Mengqing Cao <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>

Signed-off-by: Mengqing Cao <[email protected]>

Signed-off-by: Mengqing Cao <[email protected]> Signed-off-by: Mu Huai <[email protected]>

jeejeelee requested a review from mgoin March 26, 2025 03:03

mgoin approved these changes Mar 26, 2025

View reviewed changes

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 26, 2025

[moe][quant] add weight name case for offset

6d980ea

Signed-off-by: Mengqing Cao <[email protected]>

MengqingCao force-pushed the offset branch from 6e43479 to 6d980ea Compare March 27, 2025 02:04

mgoin enabled auto-merge (squash) March 27, 2025 02:06

mgoin merged commit fb22be5 into vllm-project:main Mar 27, 2025
35 checks passed

MengqingCao deleted the offset branch March 27, 2025 06:21

MengqingCao mentioned this pull request Mar 27, 2025

feat: add w8a8_dynamic quant & support deepseek quant vllm-project/vllm-ascend#391

Merged

Alex4210987 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request Apr 5, 2025

[moe][quant] add weight name case for offset (vllm-project#15515)

5b407ad

Signed-off-by: Mengqing Cao <[email protected]> Signed-off-by: xinyuxiao <[email protected]>

lulmer pushed a commit to lulmer/vllm that referenced this pull request Apr 7, 2025

[moe][quant] add weight name case for offset (vllm-project#15515)

bf7b9ac

Signed-off-by: Mengqing Cao <[email protected]> Signed-off-by: Louis Ulmer <[email protected]>

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[moe][quant] add weight name case for offset (vllm-project#15515)

4380526

Signed-off-by: Mengqing Cao <[email protected]>

shreyankg pushed a commit to shreyankg/vllm that referenced this pull request May 3, 2025

[moe][quant] add weight name case for offset (vllm-project#15515)

9616132

Signed-off-by: Mengqing Cao <[email protected]>

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

[moe][quant] add weight name case for offset (vllm-project#15515)

1ca6d2f

Signed-off-by: Mengqing Cao <[email protected]> Signed-off-by: Mu Huai <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[moe][quant] add weight name case for offset #15515

[moe][quant] add weight name case for offset #15515

MengqingCao commented Mar 26, 2025

Uh oh!

github-actions bot commented Mar 26, 2025

Uh oh!

mgoin left a comment

Uh oh!

MengqingCao commented Mar 27, 2025

Uh oh!

mgoin commented Mar 27, 2025

Uh oh!

MengqingCao commented Mar 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[moe][quant] add weight name case for offset #15515

[moe][quant] add weight name case for offset #15515

Conversation

MengqingCao commented Mar 26, 2025

Uh oh!

github-actions bot commented Mar 26, 2025

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

MengqingCao commented Mar 27, 2025

Uh oh!

mgoin commented Mar 27, 2025

Uh oh!

MengqingCao commented Mar 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants