[AudioLDM 2] Pipeline fixes #4738

sanchit-gandhi · 2023-08-23T11:17:37Z

What does this PR do?

Pipeline fixes:

Use return_dict=False in the UNet to allow for torch.compile
~~Use the ImagePipelineOutput when we return latents (not the AudioPipelineOutput)~~

Test fixes:

Fast tests: don't override the dtype test (all CLAP params now respect the default dtype)
Slow tests: Use Hub checkpoints (not local paths!)

src/diffusers/pipelines/audioldm2/pipeline_audioldm2.py

HuggingFaceDocBuilderDev · 2023-08-23T11:41:21Z

The documentation is not available anymore as the PR was closed or merged.

sanchit-gandhi · 2023-08-23T11:45:42Z

tests/pipelines/audioldm2/test_audioldm2.py

@@ -469,29 +469,6 @@ def test_save_load_optional_components(self):
        # increase tolerance from 1e-4 -> 2e-4 to account for large composite model
        super().test_save_load_optional_components(expected_max_difference=2e-4)

-    def test_to_dtype(self):


Now that huggingface/transformers#25682 is merged, overriding this test won't be necessary - CLAP will always have the default dtype of float32

sanchit-gandhi · 2023-08-23T13:19:06Z

Ready for review @sayakpaul!

* fix docs * fix unet docs * use image output for latents * fix hub checkpoints * fix pipeline example * update example * return_dict = False * revert image pipeline output * revert doc changes * remove dtype test * make style * remove docstring updates * remove unet docstring update * Empty commit to re-trigger CI * fix cpu offload * fix dtype test * add offload test

sanchit-gandhi added 5 commits August 23, 2023 12:12

fix docs

7c0c9ac

fix unet docs

c52100b

use image output for latents

cf78776

fix hub checkpoints

7ff4b46

fix pipeline example

6f90747

sanchit-gandhi commented Aug 23, 2023

View reviewed changes

src/diffusers/pipelines/audioldm2/pipeline_audioldm2.py Outdated Show resolved Hide resolved

sanchit-gandhi added 3 commits August 23, 2023 12:22

update example

a0f6ac5

return_dict = False

addb98e

revert image pipeline output

e150450

sanchit-gandhi changed the title ~~[AudioLDM 2] Pipeline + doc fixes~~ [AudioLDM 2] Pipeline fixes Aug 23, 2023

revert doc changes

b296431

remove dtype test

1c84471

sanchit-gandhi commented Aug 23, 2023

View reviewed changes

sanchit-gandhi marked this pull request as ready for review August 23, 2023 12:19

sanchit-gandhi added 3 commits August 23, 2023 13:24

make style

5d420c9

remove docstring updates

d0e22e4

remove unet docstring update

4efa070

patrickvonplaten approved these changes Aug 23, 2023

View reviewed changes

sanchit-gandhi added 4 commits August 24, 2023 15:12

Empty commit to re-trigger CI

0fb7aad

fix cpu offload

76a52b5

fix dtype test

f4b7499

add offload test

6def17a

sanchit-gandhi merged commit 29a11c2 into huggingface:main Aug 25, 2023

sanchit-gandhi deleted the audioldm2-fix branch August 25, 2023 10:38

sanchit-gandhi mentioned this pull request Sep 13, 2023

Return effective attention mask in Wav2Vec2BaseModelOutput huggingface/transformers#25471

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AudioLDM 2] Pipeline fixes #4738

[AudioLDM 2] Pipeline fixes #4738

Uh oh!

sanchit-gandhi commented Aug 23, 2023 •

edited

Loading

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 23, 2023 •

edited

Loading

Uh oh!

sanchit-gandhi Aug 23, 2023 •

edited

Loading

Uh oh!

sanchit-gandhi commented Aug 23, 2023

Uh oh!

Uh oh!

[AudioLDM 2] Pipeline fixes #4738

[AudioLDM 2] Pipeline fixes #4738

Uh oh!

Conversation

sanchit-gandhi commented Aug 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sanchit-gandhi Aug 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sanchit-gandhi commented Aug 23, 2023

Uh oh!

Uh oh!

sanchit-gandhi commented Aug 23, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 23, 2023 •

edited

Loading

sanchit-gandhi Aug 23, 2023 •

edited

Loading