[LoRA] ensure different LoRA ranks for text encoders can be properly handled #4669

sayakpaul · 2023-08-18T08:57:58Z

Internal thread: https://huggingface.slack.com/archives/C03UQJENJTV/p1692343210344049

HuggingFaceDocBuilderDev · 2023-08-18T09:05:10Z

The documentation is not available anymore as the PR was closed or merged.

sayakpaul · 2023-08-19T02:50:41Z

src/diffusers/loaders.py

@@ -1344,7 +1352,7 @@ def _modify_text_encoder(
        text_encoder,
        lora_scale=1,
        network_alphas=None,
-        rank=4,
+        rank: Union[Dict[str, int], int] = 4,


To have backward compatibility in our training scripts.

sayakpaul · 2023-08-19T02:51:23Z

src/diffusers/loaders.py

+                current_rank_fc1 = rank.pop(f"{name}.fc1.lora_linear_layer.up.weight")
+                current_rank_fc2 = rank.pop(f"{name}.fc2.lora_linear_layer.up.weight")

                mlp_module.fc1 = PatchedLoraProjection(
-                    mlp_module.fc1, lora_scale, network_alpha=fc1_alpha, rank=rank, dtype=dtype
+                    mlp_module.fc1, lora_scale, network_alpha=fc1_alpha, rank=current_rank_fc1, dtype=dtype
                )
                lora_parameters.extend(mlp_module.fc1.lora_linear_layer.parameters())

                mlp_module.fc2 = PatchedLoraProjection(
-                    mlp_module.fc2, lora_scale, network_alpha=fc2_alpha, rank=rank, dtype=dtype
+                    mlp_module.fc2, lora_scale, network_alpha=fc2_alpha, rank=current_rank_fc2, dtype=dtype


We never allow patching the MLP from our training scripts. So, this should be okay.

williamberman · 2023-08-21T20:10:27Z

src/diffusers/loaders.py

-                rank = text_encoder_lora_state_dict[
-                    "text_model.encoder.layers.0.self_attn.out_proj.lora_linear_layer.up.weight"
-                ].shape[1]
+                for name, _ in text_encoder_attn_modules(text_encoder):
+                    rank_key = f"{name}.out_proj.lora_linear_layer.up.weight"
+                    rank.update({rank_key: text_encoder_lora_state_dict[rank_key].shape[1]})
+
                patch_mlp = any(".mlp." in key for key in text_encoder_lora_state_dict.keys())
+                if patch_mlp:
+                    for name, _ in text_encoder_mlp_modules(text_encoder):
+                        rank_key_fc1 = f"{name}.fc1.lora_linear_layer.up.weight"
+                        rank_key_fc2 = f"{name}.fc2.lora_linear_layer.up.weight"
+                        rank.update({rank_key_fc1: text_encoder_lora_state_dict[rank_key_fc1].shape[1]})
+                        rank.update({rank_key_fc2: text_encoder_lora_state_dict[rank_key_fc2].shape[1]})


Would it instead be possible to use _register_state_dict_pre_hook on LoRALinearLayer so they can look at the incoming weights when the state dict is loaded and change the internal weights to the appropriate shape? This allows us to treat the state dict more transparently and avoid having to construct a rank dict by looking at strings in the passed in state dict.

This allows us to treat the state dict more transparently and avoid having to construct a rank dict by looking at strings in the passed in state dict.

_register_state_dict_pre_hook will also need looking at the state dicts if we were to retrieve the ranks no?

What you're suggesting isn't clear to me. So, need some elaboration.

sayakpaul · 2023-08-22T02:51:06Z

Merging after internal discussions with Will on Slack.

…handled (huggingface#4669) * debugging starts * debugging * debugging * debugging * debugging * debugging * debugging ends, but does it? * more robustness.

sayakpaul added 7 commits August 18, 2023 13:38

debugging starts

0ff43b5

debugging

c525d38

debugging

6a46989

debugging

f4f528b

debugging

5de292a

debugging

2ef1ef0

debugging ends, but does it?

02c9120

sayakpaul requested a review from williamberman August 18, 2023 08:57

more robustness.

f360e28

sayakpaul commented Aug 19, 2023

View reviewed changes

williamberman reviewed Aug 21, 2023

View reviewed changes

sayakpaul requested a review from patrickvonplaten August 22, 2023 01:40

sayakpaul mentioned this pull request Aug 22, 2023

[LoRA] default to None when fc alphas are not available. #4706

Merged

sayakpaul merged commit 1e0395e into main Aug 22, 2023

sayakpaul deleted the fix/text-encoder-lora-sdxl-2 branch August 22, 2023 02:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[LoRA] ensure different LoRA ranks for text encoders can be properly handled #4669

[LoRA] ensure different LoRA ranks for text encoders can be properly handled #4669

Uh oh!

sayakpaul commented Aug 18, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Aug 18, 2023 •

edited

Loading

Uh oh!

sayakpaul Aug 19, 2023

Uh oh!

sayakpaul Aug 19, 2023

Uh oh!

williamberman Aug 21, 2023

Uh oh!

sayakpaul Aug 22, 2023

Uh oh!

sayakpaul commented Aug 22, 2023

Uh oh!

Uh oh!

[LoRA] ensure different LoRA ranks for text encoders can be properly handled #4669

[LoRA] ensure different LoRA ranks for text encoders can be properly handled #4669

Uh oh!

Conversation

sayakpaul commented Aug 18, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Aug 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sayakpaul Aug 19, 2023

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 19, 2023

Choose a reason for hiding this comment

Uh oh!

williamberman Aug 21, 2023

Choose a reason for hiding this comment

Uh oh!

sayakpaul Aug 22, 2023

Choose a reason for hiding this comment

Uh oh!

sayakpaul commented Aug 22, 2023

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 18, 2023 •

edited

Loading