ENH: Improve speed of function expanding LoRA scales #11834

BenjaminBossan · 2025-06-30T14:17:57Z

What does this PR do?

Resolves #11816

The following call proved to be a bottleneck when setting a lot of LoRA adapters in diffusers:

diffusers/src/diffusers/loaders/peft.py

Line 482 in cdaf84a

weights = scale_expansion_fn(self, weights)

This is because we would repeatedly call unet.state_dict(), even though in the standard case, it is not necessary:

diffusers/src/diffusers/loaders/unet_loader_utils.py

Line 55 in cdaf84a

unet.state_dict(),

This PR fixes this by deferring this call, so that it is only run when it's necessary, not earlier.

Note: This PR doesn't change the fact that set_adapters becomes slower the more adapters are already loaded, but since it speeds up the whole process by a factor of approximately 10x, set_adapters is much less of a bottleneck.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Resolves huggingface#11816 The following call proved to be a bottleneck when setting a lot of LoRA adapters in diffusers: https://github.com/huggingface/diffusers/blob/cdaf84a708eadf17d731657f4be3fa39d09a12c0/src/diffusers/loaders/peft.py#L482 This is because we would repeatedly call unet.state_dict(), even though in the standard case, it is not necessary: https://github.com/huggingface/diffusers/blob/cdaf84a708eadf17d731657f4be3fa39d09a12c0/src/diffusers/loaders/unet_loader_utils.py#L55 This PR fixes this by deferring this call, so that it is only run when it's necessary, not earlier.

HuggingFaceDocBuilderDev · 2025-06-30T14:24:47Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul

Thanks, LGTM!

Concerned tests:

diffusers/tests/lora/utils.py

Line 1108 in 05e7a85

def test_simple_inference_with_text_denoiser_multi_adapter_block_lora(self):
diffusers/tests/lora/utils.py

Line 1182 in 05e7a85

def test_simple_inference_with_text_denoiser_block_scale_for_all_dict_options(self):

sayakpaul · 2025-06-30T14:40:32Z

src/diffusers/loaders/unet_loader_utils.py

@@ -52,7 +54,7 @@ def _maybe_expand_lora_scales(
            weight_for_adapter,
            blocks_with_transformer,
            transformer_per_block,
-            unet.state_dict(),


Since this is a private function, this should be more than okay to break

sayakpaul · 2025-06-30T14:56:05Z

Thanks for this contribution!

* ENH Improve speed of expanding LoRA scales Resolves huggingface#11816 The following call proved to be a bottleneck when setting a lot of LoRA adapters in diffusers: https://github.com/huggingface/diffusers/blob/cdaf84a708eadf17d731657f4be3fa39d09a12c0/src/diffusers/loaders/peft.py#L482 This is because we would repeatedly call unet.state_dict(), even though in the standard case, it is not necessary: https://github.com/huggingface/diffusers/blob/cdaf84a708eadf17d731657f4be3fa39d09a12c0/src/diffusers/loaders/unet_loader_utils.py#L55 This PR fixes this by deferring this call, so that it is only run when it's necessary, not earlier. * Small fix --------- Co-authored-by: Sayak Paul <[email protected]>

BenjaminBossan added 2 commits June 30, 2025 16:14

Small fix

8bebb06

BenjaminBossan changed the title ~~Enh improve speed expand lora scales~~ ENH: Improve speed of function expanding LoRA scales Jun 30, 2025

BenjaminBossan requested a review from sayakpaul June 30, 2025 14:33

sayakpaul approved these changes Jun 30, 2025

View reviewed changes

Merge branch 'main' into enh-improve-speed-expand-lora-scales

3a8edeb

sayakpaul merged commit 3b079ec into huggingface:main Jun 30, 2025
11 checks passed

BenjaminBossan mentioned this pull request Jul 1, 2025

set_adapters performance degrades with the number of inactive adapters #11816

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ENH: Improve speed of function expanding LoRA scales #11834

ENH: Improve speed of function expanding LoRA scales #11834

Uh oh!

BenjaminBossan commented Jun 30, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jun 30, 2025

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul Jun 30, 2025

Uh oh!

Uh oh!

sayakpaul commented Jun 30, 2025

Uh oh!

Uh oh!

ENH: Improve speed of function expanding LoRA scales #11834

ENH: Improve speed of function expanding LoRA scales #11834

Uh oh!

Conversation

BenjaminBossan commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Uh oh!

HuggingFaceDocBuilderDev commented Jun 30, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jun 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sayakpaul commented Jun 30, 2025

Uh oh!

Uh oh!

BenjaminBossan commented Jun 30, 2025 •

edited

Loading