[SD 3.5 Dreambooth LoRA] support configurable training block & layers #9762

linoytsaban · 2024-10-24T11:02:38Z

This PR adds

--lora_layers and --lora_blocks to dreambooth lora training script to allow targeting specific blocks & layers.
This is generally proven to be useful, and specifically for SD3.5 initial thoughts on the matter brought up here - Stable Diffusion 3.5 Large Fine-tuning Tutorial
some initial results to demonstrate the differences:

HuggingFaceDocBuilderDev · 2024-10-24T11:09:29Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

linoytsaban · 2024-10-25T13:16:42Z

cc @bghira - maybe you explored and have some insights/thoughts to share too 🤗

…-explorations

sayakpaul

Thank you! Left a couple of comments. LMK if they make sense or are unclear.

examples/dreambooth/README_sd3.md

sayakpaul · 2024-10-25T14:42:31Z

examples/dreambooth/README_sd3.md

+> **Photorealism**
+> In preliminary testing, we observed that freezing the last few layers of the architecture significantly improved model training when using a photorealistic dataset, preventing detail degradation introduced by small dataset from happening.
+> **Anatomy preservation**
+> To dampen any possible degradation of anatomy, training only the attention layers and **not** the adaptive linear layers could help. For reference, below is one of the transformer blocks.


Makes total sense to me!

examples/dreambooth/README_sd3.md

sayakpaul · 2024-10-25T14:45:03Z

examples/dreambooth/README_sd3.md

+- with `--lora_layers` you can specify the types of layers you wish to train. 
+By default, the trained layers are -  
+`"attn.add_k_proj","attn.add_q_proj","attn.add_v_proj", "attn.to_add_out","attn.to_k","attn.to_out.0","attn.to_q","attn.to_v"`
+If you wish to have a leaner LoRA / train more blocks over layers you could pass - 


Leaner LoRA targetting what aspect? From what I understand, this heuristic is for targeting a specific quality, right?

Indeed the feature was added to allow experimentation with what layers produce the best quality, but since the default (once the pr is merged) will be
attn.add_k_proj attn.add_q_proj attn.add_v_proj attn.to_add_out attn.to_k attn.to_out.0 attn.to_q attn.to_v
which makes every trained block chunkier than the previous default, I wanted to also give as an example the previous setting we had which is
--lora_layers attn.to_k attn.to_q attn.to_v attn.to_out.0
that will result in a smaller lora.
But if it's confusing/unclear I can remove that

examples/dreambooth/README_sd3.md

sayakpaul · 2024-10-25T14:46:10Z

examples/dreambooth/test_dreambooth_lora_sd3.py

+            # when not training the text encoder, all the parameters in the state dict should start
+            # with `"transformer"` in their names.
+            # In this test, only params of transformer block 0 should be in the state dict
+            starts_with_transformer = all(
+                key.startswith("transformer.transformer_blocks.0") for key in lora_state_dict.keys()
+            )


examples/dreambooth/train_dreambooth_lora_sd3.py

…-explorations

…-explorations # Conflicts: # examples/dreambooth/train_dreambooth_lora_sd3.py

sayakpaul · 2024-10-28T13:06:50Z

examples/dreambooth/README_sd3.md

+`attn.add_k_proj,attn.add_q_proj,attn.add_v_proj,attn.to_add_out,attn.to_k,attn.to_out.0,attn.to_q,attn.to_v`
+If you wish to have a leaner LoRA / train more blocks over layers you could pass - 
+```diff
+--lora_layers attn.to_k,attn.to_q,attn.to_v,attn.to_out.0


Are we deleting this block or we're suggesting to add it? In case of addition, it should be like so + --lora_layers attn.to_k,attn.to_q,attn.to_v,attn.to_out.0 under the diff block.

sayakpaul · 2024-10-28T13:07:36Z

examples/dreambooth/test_dreambooth_lora_sd3.py

+
+            # when not training the text encoder, all the parameters in the state dict should start
+            # with `"transformer"` in their names.
+            # In this test, only params of transformer block 0 should be in the state dict


Revisit the comment?

sayakpaul · 2024-10-28T13:08:16Z

examples/dreambooth/test_dreambooth_lora_sd3.py

+                --train_batch_size 1
+                --gradient_accumulation_steps 1
+                --max_train_steps 2
+                --lora_blocks 0


I would make the value we're passing here a constant so that it's clear from the test as to what we're varying. Does that make sense?

sayakpaul · 2024-10-28T13:08:39Z

examples/dreambooth/test_dreambooth_lora_sd3.py

+                --train_batch_size 1
+                --gradient_accumulation_steps 1
+                --max_train_steps 2
+                --lora_layers attn.to_k


Same as https://github.com/huggingface/diffusers/pull/9762/files#r1819028559.

sayakpaul

Excellent work here. I love it, especially the tests! I left only minor comments, LMK if they make sense.

…#9762) * configurable layers * configurable layers * update README * style * add test * style * add layer test, update readme, add nargs * readme * test style * remove print, change nargs * test arg change * style * revert nargs 2/2 * address sayaks comments * style * address sayaks comments

configurable layers

2e54b93

linoytsaban and others added 4 commits October 25, 2024 11:55

Merge branch 'main' into sd-3-5-explorations

8fb8fc0

configurable layers

df919b8

update README

dfd8897

Merge branch 'main' into sd-3-5-explorations

75c12a9

linoytsaban requested review from sayakpaul and apolinario October 25, 2024 13:16

linoytsaban added 4 commits October 25, 2024 13:25

style

62e152a

add test

e285d69

Merge remote-tracking branch 'origin/sd-3-5-explorations' into sd-3-5…

f886565

…-explorations

style

f073014

sayakpaul reviewed Oct 25, 2024

View reviewed changes

linoytsaban and others added 13 commits October 25, 2024 18:05

add layer test, update readme, add nargs

701bd35

readme

90550a8

test style

2cba0c9

remove print, change nargs

128826b

Merge branch 'main' into sd-3-5-explorations

01995ed

Merge remote-tracking branch 'origin/sd-3-5-explorations' into sd-3-5…

6f8e392

…-explorations

test arg change

0c7fa8b

Merge branch 'main' into sd-3-5-explorations

d96db50

style

10a2659

Merge branch 'main' into sd-3-5-explorations

7b087d2

revert nargs 2/2

ad6c2f3

Merge remote-tracking branch 'origin/sd-3-5-explorations' into sd-3-5…

aebfa03

…-explorations # Conflicts: # examples/dreambooth/train_dreambooth_lora_sd3.py

Merge branch 'main' into sd-3-5-explorations

3623a6b

linoytsaban requested a review from sayakpaul October 28, 2024 11:22

sayakpaul reviewed Oct 28, 2024

View reviewed changes

sayakpaul approved these changes Oct 28, 2024

View reviewed changes

linoytsaban added 3 commits October 28, 2024 15:47

address sayaks comments

df018dd

style

65dd59d

address sayaks comments

559b0bc

linoytsaban merged commit db5b6a9 into huggingface:main Oct 28, 2024
8 checks passed

linoytsaban deleted the sd-3-5-explorations branch November 26, 2024 10:18

[SD 3.5 Dreambooth LoRA] support configurable training block & layers #9762

[SD 3.5 Dreambooth LoRA] support configurable training block & layers #9762

Uh oh!

Conversation

linoytsaban commented Oct 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Oct 24, 2024

Uh oh!

linoytsaban commented Oct 25, 2024

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

linoytsaban commented Oct 24, 2024 •

edited

Loading