[Neuron] Add multi-LoRA support for Neuron. #18284

aws-satyajith · 2025-05-16T23:25:21Z

Add multi-LoRA support for Neuron.

Add a test using single and multiple LoRAs

github-actions · 2025-05-16T23:25:30Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

mergify · 2025-05-19T16:56:49Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @aws-satyajith.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

mergify · 2025-05-22T09:22:08Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @aws-satyajith.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

liangfu

thanks for contributing @aws-satyajith . the changes look good to me. it would be better if we could support dynamic lora loading, so that we don't need to pass lora-module in the argument.

mrinalks · 2025-05-23T00:30:01Z

@WoosukKwon @simon-mo could we get a quick review on this PR so we merge before V0 Freeze in 0.9.0? 🙏

mergify · 2025-05-27T15:47:31Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @aws-satyajith.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Satyajith Chilappagari <[email protected]>

jeejeelee · 2025-05-28T14:29:41Z

vllm/engine/arg_utils.py

    max_lora_rank: int = LoRAConfig.max_lora_rank
    fully_sharded_loras: bool = LoRAConfig.fully_sharded_loras
    max_cpu_loras: Optional[int] = LoRAConfig.max_cpu_loras
+    lora_modules: Optional[LoRAModulePath] = None


Can we use override_neuron_config to include lora modules, instead of adding an additional argument lora_modules?

Makes sense. Since the regular usage diverges from dynamic loading anyway, I think moving lora_modules to override neuron config is doable. I'll make the change and publish a new revision

jeejeelee

Considering that most of the changes in this PR are within Neuron, after addressing the comments above, overall LGTM.

Signed-off-by: Satyajith Chilappagari <[email protected]>

mrinalks · 2025-05-28T21:48:02Z

@jeejeelee @simon-mo seems like all checks are complete - we know that entrypoints has been flaky recently.
Ready to merge?

GhostCCCatHenry · 2025-05-29T08:42:21Z

这是来自QQ邮箱的假期自动回复邮件。您好，我最近正在休假中，无法亲自回复您的邮件。我将在假期结束后，尽快给您回复。

Signed-off-by: Satyajith Chilappagari <[email protected]> Signed-off-by: amit <[email protected]>

mergify bot added the needs-rebase label May 19, 2025

aws-satyajith force-pushed the neuron_up_mlora branch from da77995 to 3230f31 Compare May 20, 2025 00:35

mergify bot removed the needs-rebase label May 20, 2025

mergify bot added the needs-rebase label May 22, 2025

aws-satyajith force-pushed the neuron_up_mlora branch from 3230f31 to 54cbd07 Compare May 22, 2025 18:57

mergify bot removed the needs-rebase label May 22, 2025

aws-satyajith changed the title ~~Add multi-LoRA support for Neuron.~~ [Neuron] Add multi-LoRA support for Neuron. May 22, 2025

liangfu approved these changes May 22, 2025

View reviewed changes

mergify bot added the needs-rebase label May 27, 2025

aws-satyajith added 4 commits May 27, 2025 23:14

Add multi LoRA support for Neuron

a053ce1

Signed-off-by: Satyajith Chilappagari <[email protected]>

Move override_neuron_config logic and modify example

54d0e3d

Signed-off-by: Satyajith Chilappagari <[email protected]>

Convert multi-lora example to test

1f72c58

Signed-off-by: Satyajith Chilappagari <[email protected]>

Bug fix: Don't initialize LoraCheckpoint if no lora_config is supplied

0790f87

Signed-off-by: Satyajith Chilappagari <[email protected]>

aws-satyajith force-pushed the neuron_up_mlora branch from 54cbd07 to 0790f87 Compare May 27, 2025 23:16

mergify bot removed the needs-rebase label May 27, 2025

jeejeelee reviewed May 28, 2025

View reviewed changes

jeejeelee approved these changes May 28, 2025

View reviewed changes

Move lora_modules to override_neuron_config

fefba85

Signed-off-by: Satyajith Chilappagari <[email protected]>

aws-satyajith force-pushed the neuron_up_mlora branch from 73a0573 to fefba85 Compare May 28, 2025 19:59

jeejeelee added the ready ONLY add when PR is ready to merge/full CI is needed label May 29, 2025

jeejeelee merged commit 972eddf into vllm-project:main May 29, 2025
72 checks passed

amitm02 pushed a commit to amitm02/vllm that referenced this pull request Jun 1, 2025

[Neuron] Add multi-LoRA support for Neuron. (vllm-project#18284)

7a1c776

Signed-off-by: Satyajith Chilappagari <[email protected]> Signed-off-by: amit <[email protected]>

aws-satyajith deleted the neuron_up_mlora branch June 12, 2025 20:37

Uh oh!

[Neuron] Add multi-LoRA support for Neuron. #18284

[Neuron] Add multi-LoRA support for Neuron. #18284

Uh oh!

Conversation

aws-satyajith commented May 16, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented May 16, 2025

Uh oh!

mergify bot commented May 19, 2025

Uh oh!

mergify bot commented May 22, 2025

Uh oh!

liangfu left a comment

Choose a reason for hiding this comment

Uh oh!

mrinalks commented May 23, 2025

Uh oh!

mergify bot commented May 27, 2025

Uh oh!

jeejeelee May 28, 2025

Choose a reason for hiding this comment

Uh oh!

aws-satyajith May 28, 2025

Choose a reason for hiding this comment

Uh oh!

jeejeelee left a comment

Choose a reason for hiding this comment

Uh oh!

mrinalks commented May 28, 2025

Uh oh!

Uh oh!

GhostCCCatHenry commented May 29, 2025 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

aws-satyajith commented May 16, 2025 •

edited by github-actions bot

Loading