update expected results of slow tests #268

kashif · 2022-08-29T10:28:09Z

Updated the two failing slow tests

anton-l

Thanks for the fixes @kashif!

@patrickvonplaten IIRC you wanted to double-check something before updating the VE tests?

kashif · 2022-08-29T10:31:27Z

I can run the tests now on a GPU to double check as well... will report soon

HuggingFaceDocBuilderDev · 2022-08-29T10:31:59Z

The documentation is not available anymore as the PR was closed or merged.

anton-l · 2022-08-29T10:33:39Z

Hmm, looks like the .sum() test is still flaky depending on the GPU, maybe that test should use tensor values instead.

patrickvonplaten · 2022-08-29T10:33:41Z

Ah yeah lemme run the pipeline real quick to make sure this all still works

patrickvonplaten · 2022-08-29T10:39:29Z

tests/test_scheduler.py

@@ -714,7 +714,7 @@ def test_full_loop_no_noise(self):
        result_sum = torch.sum(torch.abs(sample))
        result_mean = torch.mean(torch.abs(sample))

-        assert abs(result_sum.item() - 14379591680.0) < 1e-2
+        assert abs(result_sum.item() - 14379589632.0) < 1e-2


Circle CI seemed to be happy with 14379591680.0: https://github.com/huggingface/diffusers/runs/8068605613?check_suite_focus=true#step:6:71 So not sure about this change

=> maybe we should increase the tolerance here?

yes perhaps that is what i was thinking that too so perhaps abs < 2.5 e3 or so

patrickvonplaten · 2022-08-29T13:05:05Z

tests/test_scheduler.py

@@ -714,8 +714,8 @@ def test_full_loop_no_noise(self):
        result_sum = torch.sum(torch.abs(sample))
        result_mean = torch.mean(torch.abs(sample))

-        assert abs(result_sum.item() - 14379591680.0) < 1e-2
-        assert abs(result_mean.item() - 18723426.0) < 1e-3
+        assert abs(result_sum.item() - 14379591680.0) < 2.5e3


Can we add a comment here that says that the results are flaky depending on GPU and this is why we have such high values?

Actually upon taking a second look I think the reason is that we don't make the test deterministic with generators - both DDPM and Score-VE schedulers make use of torch.randn(...)

tests/test_scheduler.py

patrickvonplaten · 2022-08-29T13:58:45Z

@kashif, just noticed that most of our tests don't run on GPU - here a fix: #269

Could you maybe rebase your PR to main and check the values again? 😅

tests/test_pipelines.py

kashif · 2022-08-30T07:58:01Z

@natolambert @patrickvonplaten I tested now on a bunch of servers and laptop and all tests (slow ones) are passing for me...

natolambert · 2022-08-30T22:39:57Z

Great, it has been a bit since I tested them all so hopefully it's more stable now!

tests/test_scheduler.py

patrickvonplaten

It'd be nice if we make the random scheduler tests deterministic by passing the generator correctly. Regarding the pipeline test it'd be nice to reduce the number of steps to speed it up a bit :-)

Think merging: #289 can unblock this PR here

anton-l · 2022-09-05T16:47:37Z

@kashif looks like a couple of VE tests are failing now, could you check please? https://github.com/huggingface/diffusers/runs/8158641999?check_suite_focus=true

When those are back, I think we can carefully merge this :)

kashif · 2022-09-05T16:48:17Z

sure! let me get it over the line

patrickvonplaten · 2022-09-09T15:23:46Z

Should we still try to get this one merged?

kashif · 2022-09-09T16:14:51Z

yes please! I believe some of the images on the hub might need updating? let me fix the conflict

tests/test_pipelines.py

patrickvonplaten · 2022-09-12T13:20:10Z

Can we fix the quality checks here and then run all slow tests to be sure?

patrickvonplaten · 2022-09-12T13:47:32Z

Awesome - thank to make all the changes!

Feel free to merge whenever :-)

* update expected results of slow tests * relax sum and mean tests * Print shapes when reporting exception * formatting * fix sentence * relax test_stable_diffusion_fast_ddim for gpu fp16 * relax flakey tests on GPU * added comment on large tolerences * black * format * set scheduler seed * added generator * use np.isclose * set num_inference_steps to 50 * fix dep. warning * update expected_slice * preprocess if image * updated expected results * updated expected from CI * pass generator to VAE * undo change back to orig * use orignal * revert back the expected on cpu * revert back values for CPU * more undo * update result after using gen * update mean * set generator for mps * update expected on CI server * undo * use new seed every time * cpu manual seed * reduce num_inference_steps * style * use generator for randn Co-authored-by: Patrick von Platen <[email protected]>

update expected results of slow tests

c365f4f

anton-l requested a review from patrickvonplaten August 29, 2022 10:29

anton-l approved these changes Aug 29, 2022

View reviewed changes

patrickvonplaten reviewed Aug 29, 2022

View reviewed changes

kashif added 5 commits August 29, 2022 12:47

relax sum and mean tests

fa76501

Print shapes when reporting exception

21e2897

formatting

b080f71

fix sentence

2e7a5d6

relax test_stable_diffusion_fast_ddim for gpu fp16

ea46acf

patrickvonplaten reviewed Aug 29, 2022

View reviewed changes

tests/test_scheduler.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Aug 29, 2022

View reviewed changes

tests/test_pipelines.py Outdated Show resolved Hide resolved

kashif added 4 commits August 29, 2022 17:14

Merge branch 'main' into fix-tests

7134264

relax flakey tests on GPU

bb6b522

added comment on large tolerences

6e4a075

Merge branch 'main' into fix-tests

75646db

patrickvonplaten reviewed Aug 31, 2022

View reviewed changes

tests/test_scheduler.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Aug 31, 2022

View reviewed changes

tests/test_scheduler.py Show resolved Hide resolved

patrickvonplaten reviewed Aug 31, 2022

View reviewed changes

patrickvonplaten and others added 4 commits August 31, 2022 13:03

Merge branch 'main' into fix-tests

1db4181

black

a765a19

format

71f7683

set scheduler seed

0a46c6a

kashif added 2 commits September 2, 2022 10:25

update mean

60fa253

Merge branch 'main' into fix-tests

5a6f65a

kashif added 3 commits September 5, 2022 19:29

Merge remote-tracking branch 'upstream/main' into fix-tests

16649d1

Merge remote-tracking branch 'origin/fix-tests' into fix-tests

c7736d0

Merge branch 'main' into fix-tests

64fc904

patrickvonplaten mentioned this pull request Sep 9, 2022

[Tests] Add tests for LMS Discrete #338

Closed

kashif added 8 commits September 9, 2022 18:15

Merge branch 'main' into fix-tests

0e03781

set generator for mps

8a67749

update expected on CI server

c3532b9

undo

7f1ce10

Merge branch 'main' into fix-tests

e341f48

use new seed every time

c945094

Merge branch 'main' into fix-tests

493123b

cpu manual seed

108d588

patrickvonplaten reviewed Sep 12, 2022

View reviewed changes

tests/test_pipelines.py Outdated Show resolved Hide resolved

reduce num_inference_steps

0fc7ca0

kashif added 3 commits September 12, 2022 15:21

Merge branch 'main' into fix-tests

d6d4b27

style

c943f4d

use generator for randn

8b84156

patrickvonplaten approved these changes Sep 12, 2022

View reviewed changes

pcuenca merged commit f4781a0 into huggingface:main Sep 12, 2022

kashif deleted the fix-tests branch September 12, 2022 13:49

PhaneeshB pushed a commit to nod-ai/diffusers that referenced this pull request Mar 1, 2023

Add huggingface top5 image classification automodel (huggingface#268)

17dba60

update expected results of slow tests #268

update expected results of slow tests #268

Uh oh!

Conversation

kashif commented Aug 29, 2022

Uh oh!

anton-l left a comment

Choose a reason for hiding this comment

Uh oh!

kashif commented Aug 29, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Aug 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anton-l commented Aug 29, 2022

Uh oh!

patrickvonplaten commented Aug 29, 2022

Uh oh!

patrickvonplaten Aug 29, 2022

Choose a reason for hiding this comment

Uh oh!

kashif Aug 29, 2022

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Aug 29, 2022

Choose a reason for hiding this comment

Uh oh!

patrickvonplaten Aug 31, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

patrickvonplaten commented Aug 29, 2022

Uh oh!

Uh oh!

kashif commented Aug 30, 2022

Uh oh!

natolambert commented Aug 30, 2022

Uh oh!

Uh oh!

Uh oh!

patrickvonplaten left a comment

Choose a reason for hiding this comment

Uh oh!

anton-l commented Sep 5, 2022

Uh oh!

kashif commented Sep 5, 2022

Uh oh!

patrickvonplaten commented Sep 9, 2022

Uh oh!

kashif commented Sep 9, 2022

Uh oh!

Uh oh!

patrickvonplaten commented Sep 12, 2022

Uh oh!

patrickvonplaten commented Sep 12, 2022

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Aug 29, 2022 •

edited

Loading