Disable `torch.compile` for dynamic rope models in Transformers backend #23738

hmellor · 2025-08-27T11:15:17Z

Fix the Transformers backend torch.compile support by disabling it when using models with dynamic rope are loaded.

… backend Signed-off-by: Harry Mellor <[email protected]>

… rope Signed-off-by: Harry Mellor <[email protected]>

gemini-code-assist

Code Review

This pull request adds support for the InternVLForConditionalGeneration architecture and disables torch.compile for models with dynamic rope scaling. The changes for the new model support are correct. However, I've identified a critical bug in the implementation for disabling torch.compile where a potential AttributeError can occur. Please see my comment for the suggested fix.

vllm/model_executor/models/transformers.py

Signed-off-by: Harry Mellor <[email protected]>

vllm/model_executor/models/transformers.py

Isotr0py · 2025-08-27T11:37:30Z

tests/models/registry.py

                                                 "3.5-qwen3moe": "OpenGVLab/InternVL3_5-30B-A3B",   # noqa: E501
                                                 "3.5-gptoss": "OpenGVLab/InternVL3_5-GPT-OSS-20B-A4B-Preview"},  # noqa: E501
                                         trust_remote_code=True),
+    "InternVLForConditionalGeneration": _HfExamplesInfo("OpenGVLab/InternVL3-1B-hf"),  # noqa: E501


I plan to enable native support through existing InternS1ForConditionalGeneration with video input support in #23742, because their implementations are exactly same.

Could we add video support to the Transformers backend instead 👀

Long term, if it's not necessary to reimplement a model in vLLM we should avoid it if possible

Could we add video support to the Transformers backend instead 👀

I think this needs extra refactoring at Transformers side first, because we only added helper function in processor for images previously IIRC.

Long term, if it's not necessary to reimplement a model in vLLM we should avoid it if possible

Yes, but since InternS1ForConditionalGeneration is already here, we can reuse it simply with video support and no need to disable torch.compile:

"InternVLForConditionalGeneration": ("interns1", "InternS1ForConditionalGeneration")

Yes, but since InternS1ForConditionalGeneration is already here, we can reuse it simply with video support

For now I will remove the InternVL stuff from my PR

no need to disable torch.compile

InternVLChatModel and InternS1ForConditionalGeneration do not support torch.compile in vLLM. Neither of them are decorated with @support_torch_compile

InternVLChatModel and InternS1ForConditionalGeneration do not support torch.compile in vLLM. Neither of them are decorated with @support_torch_compile

No, we only compile text backbone for VLM exactly (no ViT compilation yet), @support_torch_compile will only take effect at language model level. And Qwen2 and Qwen3MoE (without MRoPE) they used have supported @support_torch_compile.

(There are no VLM implementations wrapped by @support_torch_compile, because we only added it to backbone)

Oh, I wasn't aware that's how it worked.

TransformersForMultimodalLM is wrapped with @support_torch_compile and it seems to work fine. In TransformersForMultimodalLM there is no separate backbone to decorate as we use inheritance to add MM support to TransformersForCausalLM. If we removed @support_torch_compile from TransformersForMultimodalLM, would the @support_torch_compile on TransformersForCausalLM do anything?

Yes, the torch compile will be limited to the modules that have the decorator (and their submodules)

TransformersForMultimodalLM is a subclass of TransformersForCausalLM, not a submodule.

Do you know how @support_torch_compile interacts with inheritance?

I think torch.compile on the base class is also applied to subclasses. Maybe @youkaichao can confirm this

docs/models/supported_models.md

Signed-off-by: Harry Mellor <[email protected]>

…nd (vllm-project#23738) Signed-off-by: Harry Mellor <[email protected]>

hmellor added 2 commits August 27, 2025 13:12

Add support for InternVLForConditionalGeneration using Transformers…

96021b0

… backend Signed-off-by: Harry Mellor <[email protected]>

Disable torch.compile in Transformers backend if model uses dynamic…

17e75cb

… rope Signed-off-by: Harry Mellor <[email protected]>

hmellor requested review from DarkLight1337 and ywang96 as code owners August 27, 2025 11:15

mergify bot added documentation Improvements or additions to documentation new-model Requests to new models labels Aug 27, 2025

hmellor mentioned this pull request Aug 27, 2025

[Bug]: vllm fails to run internvl hf format multimodal model but works with the default vllm one #23714

Closed

1 task

gemini-code-assist bot reviewed Aug 27, 2025

View reviewed changes

vllm/model_executor/models/transformers.py Outdated Show resolved Hide resolved

Fix trying to call get on None

2292857

Signed-off-by: Harry Mellor <[email protected]>

Isotr0py approved these changes Aug 27, 2025

View reviewed changes

ZJY0516 reviewed Aug 27, 2025

View reviewed changes

docs/models/supported_models.md Outdated Show resolved Hide resolved

congw729 mentioned this pull request Aug 27, 2025

[Bug]: Only can run InternlVL3-8B-hf in eager mode, otherwise will face torch._dynamo.exc.Unsupported: Data-dependent branching problem. #23730

Closed

1 task

hmellor changed the title ~~Add support for InternVLForConditionalGeneration and disable torch.compile for dynamic rope models~~ Disable torch.compile for dynamic rope models in Transformers backend Aug 27, 2025

hmellor added 3 commits August 27, 2025 14:19

Remove InternVL changes

eaac40a

Signed-off-by: Harry Mellor <[email protected]>

enable_if -> can_enable_torch_compile

51a9e4a

Signed-off-by: Harry Mellor <[email protected]>

Merge branch 'main' into internvl-hf

b61d75d

hmellor enabled auto-merge (squash) August 27, 2025 12:22

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 27, 2025

rope_scaling could explicitly be None, handle that case

8412f11

Signed-off-by: Harry Mellor <[email protected]>

hmellor merged commit 0585a9e into vllm-project:main Aug 27, 2025
40 checks passed

hmellor deleted the internvl-hf branch August 27, 2025 19:03

epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025

Disable torch.compile for dynamic rope models in Transformers backe…

6a4fd4f

…nd (vllm-project#23738) Signed-off-by: Harry Mellor <[email protected]>

zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Aug 28, 2025

Disable torch.compile for dynamic rope models in Transformers backe…

9bc6170

…nd (vllm-project#23738) Signed-off-by: Harry Mellor <[email protected]>

zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Sep 3, 2025

Disable torch.compile for dynamic rope models in Transformers backe…

57e9b8f

…nd (vllm-project#23738) Signed-off-by: Harry Mellor <[email protected]>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

Disable torch.compile for dynamic rope models in Transformers backe…

1177798

…nd (vllm-project#23738) Signed-off-by: Harry Mellor <[email protected]>

Uh oh!

Disable torch.compile for dynamic rope models in Transformers backend #23738

Disable torch.compile for dynamic rope models in Transformers backend #23738

Uh oh!

Conversation

hmellor commented Aug 27, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Isotr0py Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Disable `torch.compile` for dynamic rope models in Transformers backend #23738

Disable `torch.compile` for dynamic rope models in Transformers backend #23738

hmellor commented Aug 27, 2025 •

edited by github-actions bot

Loading

Isotr0py Aug 27, 2025 •

edited

Loading

DarkLight1337 Aug 27, 2025 •

edited

Loading