device map legacy attention block weight conversion #3804

williamberman · 2023-06-15T22:06:58Z

this is not ideal but afaik this is the only way to solve this without exposing functionality in accelerate

williamberman · 2023-06-15T22:09:06Z

src/diffusers/models/attention_processor.py

@@ -78,6 +78,7 @@ def __init__(
        self.upcast_softmax = upcast_softmax
        self.rescale_output_factor = rescale_output_factor
        self.residual_connection = residual_connection
+        self.dropout = dropout


Added so can re-create the dropout module when converting back to new weight format

HuggingFaceDocBuilderDev · 2023-06-15T22:12:58Z

The documentation is not available anymore as the PR was closed or merged.

sayakpaul · 2023-06-16T03:27:13Z

src/diffusers/models/modeling_utils.py

+                        # (which look like they should be private variables?), so we can't use the standard hooks
+                        # to rename parameters on load. We need to mimic the original weight names so the correct
+                        # attributes are available. After we have loaded the weights, we convert the deprecated
+                        # names to the new non-deprecated names. Then we _greatly encourage_ the user to convert


Do we have guidance available for the users on how they should perform the conversion?

Ah nevermind. I guess you meant once we load the old attention block weight names and run conversion internally, and suggest users save the pipeline. Right?

Yeah just this blurb here

f"Taking `{str(e)}` while using `accelerate.load_checkpoint_and_dispatch` to mean {pretrained_model_name_or_path}" " was saved with deprecated attention block weight names. We will load it with the deprecated attention block" " names and convert them on the fly to the new attention block format. Please re-save the model after this conversion," " so we don't have to do the on the fly renaming in the future. If the model is from a hub checkpoint," " please also re-upload it or open a PR on the original repository."

sayakpaul · 2023-06-16T03:41:21Z

tests/models/test_attention_processor.py

+        pipe = DiffusionPipeline.from_pretrained("hf-internal-testing/tiny-stable-diffusion-pipe", safety_checker=None)
+
+        pre_conversion = pipe(
+            "foo",


Killer prompt.

sayakpaul

Concrete! Nice tests.

src/diffusers/models/modeling_utils.py

patrickvonplaten · 2023-06-16T15:41:24Z

src/diffusers/models/modeling_utils.py

+                        # names to the new non-deprecated names. Then we _greatly encourage_ the user to convert
+                        # the weights so we don't have to do this again.
+
+                        if "'Attention' object has no attribute" in str(e):


pretty hacky, but OK! Let's leave it for now :-)

yeah I cringed while writing this 🙃

williamberman commented Jun 15, 2023

View reviewed changes

williamberman requested review from patrickvonplaten and sayakpaul June 15, 2023 22:10

sayakpaul reviewed Jun 16, 2023

View reviewed changes

sayakpaul approved these changes Jun 16, 2023

View reviewed changes

patrickvonplaten reviewed Jun 16, 2023

View reviewed changes

src/diffusers/models/modeling_utils.py Show resolved Hide resolved

patrickvonplaten reviewed Jun 16, 2023

View reviewed changes

src/diffusers/models/modeling_utils.py Show resolved Hide resolved

patrickvonplaten reviewed Jun 16, 2023

View reviewed changes

src/diffusers/models/modeling_utils.py Show resolved Hide resolved

patrickvonplaten reviewed Jun 16, 2023

View reviewed changes

device map legacy attention block weight conversion

60aa2e6

williamberman force-pushed the device_map_legacy_attention_block_conversion branch from 0aab8ca to 60aa2e6 Compare June 16, 2023 17:10

williamberman requested a review from patrickvonplaten June 16, 2023 17:25

williamberman merged commit 59aefe9 into huggingface:main Jun 16, 2023

williamberman deleted the device_map_legacy_attention_block_conversion branch June 16, 2023 17:39

yoonseokjin pushed a commit to yoonseokjin/diffusers that referenced this pull request Dec 25, 2023

device map legacy attention block weight conversion (huggingface#3804)

afb5f28

AmericanPresidentJimmyCarter pushed a commit to AmericanPresidentJimmyCarter/diffusers that referenced this pull request Apr 26, 2024

device map legacy attention block weight conversion (huggingface#3804)

6b7e8c5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

device map legacy attention block weight conversion #3804

device map legacy attention block weight conversion #3804

Uh oh!

williamberman commented Jun 15, 2023 •

edited

Loading

Uh oh!

williamberman Jun 15, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Jun 15, 2023 •

edited

Loading

Uh oh!

sayakpaul Jun 16, 2023

Uh oh!

sayakpaul Jun 16, 2023

Uh oh!

williamberman Jun 16, 2023

Uh oh!

sayakpaul Jun 16, 2023

Uh oh!

sayakpaul left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten Jun 16, 2023

Uh oh!

williamberman Jun 16, 2023

Uh oh!

Uh oh!

device map legacy attention block weight conversion #3804

device map legacy attention block weight conversion #3804

Uh oh!

Conversation

williamberman commented Jun 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

williamberman Jun 15, 2023

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jun 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sayakpaul Jun 16, 2023

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jun 16, 2023

Choose a reason for hiding this comment

Uh oh!

williamberman Jun 16, 2023

Choose a reason for hiding this comment

Uh oh!

sayakpaul Jun 16, 2023

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten Jun 16, 2023

Choose a reason for hiding this comment

Uh oh!

williamberman Jun 16, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

williamberman commented Jun 15, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 15, 2023 •

edited

Loading