Add guidance start/stop #3770

holwech · 2023-06-13T08:47:54Z

Adds guidance start/stop to the img2img and text2img Stable Diffusion pipeline classe. This is a setting that is available in automatic1111. This PR is heavily inspired by this community pipeline that implements the same functionality.

I've used and tested the PR on my own project and it seems to work fine.

patrickvonplaten

Looks like a nice addition to me ,could we add some tests and also add it to the controlnet inpainting pipeline?

Tests should be added here:
https://github.com/huggingface/diffusers/tree/main/tests/pipelines/controlnet
Inpainting pipeline is here:
https://github.com/huggingface/diffusers/blob/main/src/diffusers/pipelines/controlnet/pipeline_controlnet_inpaint.py

sayakpaul

Seems to be a nice addition. Would be cool to also see some results with and without it.

And as mentioned by @patrickvonplaten let's add some test cases for it to ensure robustness and feature compatibility.

julien-blanchon · 2023-06-15T20:42:02Z

controlnet_guidance_start and controlnet_guidance_end don't work for MultiControlnet. It should accept list/tuple for each controlnet

holwech · 2023-06-22T08:19:27Z

Thanks for the feedback. Seems like implementing this is slightly more complex than I anticipated, but I will try to look at this in the coming weeks. If anyone wants this implemented asap, feel free to take it over.

Todos

Write tests
Add guidance start/end to inpainting
Add guidance start/end to MultiControlNet

holwech · 2023-06-22T08:26:48Z

Here is a before and after example:

modern apartment, grass, rocks, mountains, norway, summer, vray, hyper-realistic, high resolution, highly detailed
 Negative prompt: ugly, bad, jpeg artifacts, vignette
 Steps: 40, Sampler: DPM++ 2M, CFG scale: 7, Seed: 3313150245, Size: 896x512, Model: v1_realisticVisionV20_v20, ControlNet 0: "preprocessor: lineart_standard (from white bg & black line), model: control_v11p_sd15_lineart [43d4be0d], weight: 1, starting/ending: (0, 1), resize mode: Crop and Resize, pixel perfect: False, control mode: Balanced, preprocessor params: (512, 64, 64)", ControlNet 1: "preprocessor: depth_midas, model: control_v11f1p_sd15_depth [cfd03158], weight: 0.1, starting/ending: (0, 1), resize mode: Crop and Resize, pixel perfect: False, control mode: Balanced, preprocessor params: (512, 64, 64)", Version: v1.3.2

modern apartment, grass, rocks, mountains, norway, summer, vray, hyper-realistic, high resolution, highly detailed
 Negative prompt: ugly, bad, jpeg artifacts, vignette
 Steps: 40, Sampler: DPM++ 2M, CFG scale: 7, Seed: 1996722551, Size: 896x512, Model: v1_realisticVisionV20_v20, ControlNet 0: "preprocessor: lineart_standard (from white bg & black line), model: control_v11p_sd15_lineart [43d4be0d], weight: 1, starting/ending: (0.05, 0.7), resize mode: Crop and Resize, pixel perfect: False, control mode: Balanced, preprocessor params: (512, 64, 64)", ControlNet 1: "preprocessor: depth_midas, model: control_v11f1p_sd15_depth [cfd03158], weight: 0.1, starting/ending: (0.05, 0.7), resize mode: Crop and Resize, pixel perfect: False, control mode: Balanced, preprocessor params: (512, 64, 64)", Version: v1.3.2

I find it's quite useful to set guidance end to a number lower than 1. This allows the model to clean up any artifacts caused by controlnet in the last steps of the process. By setting guidance start larger than 0, details that don't exist in the input images are added.

holwech · 2023-06-23T08:38:23Z

I've updated the PR to support multicontrolnet, can someone have a look at the changes and let me know if you are fine with the changes? I will add some tests afterwards if we agree to proceed.

cylwin · 2023-06-26T16:52:52Z

We need this for our qrcode controlnet demo.

HuggingFaceDocBuilderDev · 2023-06-26T19:09:54Z

The documentation is not available anymore as the PR was closed or merged.

PhilSad · 2023-06-26T20:42:42Z

Thanks for your PR! Here is a demo colab that shows how to use it to generate qrcodes.

I also made the lib sdqrcode to generate ai qrcodes with this method with a simple one-liner (colab)

patrickvonplaten · 2023-06-26T23:03:49Z

Hey @holwech,

Sorry for fiddling soo much with your PR. Some tests were missing and I think it's better if we try to not change the controlnet files. Here I just take advantage of the fact that zero'ing out the controlnet result is the same as skipping it. The gain we might get from skipping on controlnet is negligible IMO so design-wise I think it's cleaner to just zero it out.

* Add guidance start/stop * Add guidance start/stop to inpaint class * Black formatting * Add support for guidance for multicontrolnet * Add inclusive end * Improve design * correct imports * Finish * Finish all * Correct more * make style --------- Co-authored-by: Patrick von Platen <[email protected]>

Add guidance start/stop

911efba

patrickvonplaten reviewed Jun 15, 2023

View reviewed changes

patrickvonplaten requested review from yiyixuxu and sayakpaul June 15, 2023 12:56

patrickvonplaten mentioned this pull request Jun 15, 2023

[New feature] Add ControlNet conditioning_scale scheduling to StableDiffusionControlNetPipeline #3759

Closed

sayakpaul reviewed Jun 15, 2023

View reviewed changes

holwech and others added 4 commits June 22, 2023 13:03

Merge branch 'huggingface:main' into main

cea10c7

Add guidance start/stop to inpaint class

597dc75

Black formatting

659a8a1

Add support for guidance for multicontrolnet

bbb9a39

holwech requested a review from patrickvonplaten June 23, 2023 09:02

Add inclusive end

5257fec

patrickvonplaten added 6 commits June 26, 2023 23:33

Improve design

19d8d0c

correct imports

522ebcf

Finish

5e4a46d

Finish all

c1f34db

Correct more

10771a1

make style

e0d07f6

patrickvonplaten merged commit 9a45d7f into huggingface:main Jun 26, 2023

VV-A-VV mentioned this pull request Aug 1, 2023

Delete the duplicate code for the contolnet img 2 img #4411

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add guidance start/stop #3770

Add guidance start/stop #3770

Uh oh!

holwech commented Jun 13, 2023

Uh oh!

patrickvonplaten left a comment

Uh oh!

sayakpaul left a comment

Uh oh!

julien-blanchon commented Jun 15, 2023

Uh oh!

holwech commented Jun 22, 2023 •

edited

Loading

Uh oh!

holwech commented Jun 22, 2023 •

edited

Loading

Uh oh!

holwech commented Jun 23, 2023

Uh oh!

cylwin commented Jun 26, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Jun 26, 2023 •

edited

Loading

Uh oh!

PhilSad commented Jun 26, 2023

Uh oh!

patrickvonplaten commented Jun 26, 2023

Uh oh!

Uh oh!

Add guidance start/stop #3770

Add guidance start/stop #3770

Uh oh!

Conversation

holwech commented Jun 13, 2023

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

julien-blanchon commented Jun 15, 2023

Uh oh!

holwech commented Jun 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

holwech commented Jun 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

holwech commented Jun 23, 2023

Uh oh!

cylwin commented Jun 26, 2023

Uh oh!

HuggingFaceDocBuilderDev commented Jun 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PhilSad commented Jun 26, 2023

Uh oh!

patrickvonplaten commented Jun 26, 2023

Uh oh!

Uh oh!

holwech commented Jun 22, 2023 •

edited

Loading

holwech commented Jun 22, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 26, 2023 •

edited

Loading