Speed improvement with new ROCm attention #8137

fgdfgfthgr-fox · 2025-05-15T10:00:38Z

fgdfgfthgr-fox
May 15, 2025

Since commit 08368f8 ask to share speed improvement, here I am.
More or less surprised to see there's an actual speed improvement, consider I am using Radeon VII which doesn't even have tensor core.
512x512, batch size 2, 20 steps, SDXL.
Original sub quadratic attention: 21.0 seconds
New PyTorch cross attention: 19.2 seconds

comfyanonymous · 2025-05-17T10:14:29Z

comfyanonymous
May 17, 2025
Maintainer

Can you try a bigger resolution to see if it OOMs?

1 reply

fgdfgfthgr-fox May 19, 2025
Author

1024x1024, bs 2, 20 steps, SDXL.
Original: 48.6s
New: 52.2
1536x1536, bs 2, 20 steps, SDXL"
Original: 132s
New: OOM

Seems like on higher resolution it's slower and uses more vram.

derfasthirnlosenick · 2025-06-01T23:53:00Z

derfasthirnlosenick
Jun 1, 2025

While we're at it - quad attention is significantly faster on a 1216x832 SDXL (DPM++ Sampler) than Pytorch Cross Attention and Flash Attention 2 (Triton Backend) for my 6800xt.
Quad is at 1.3-1.4 s/it whereas the others are at around 1.6s/it. No OOM though (yet).

Interestingly, I was getting better speeds with Forge than with Comfy for some reason (but Forge broke recently and i couldn't be bothered to fix it).

This is on ROCM 6.4.1, latest Ubutu (modified amdgpu-dkms to compile on the 6.14 kernel) and using today's pytorch nightly for ROCM 6.4.
edit: OOM for a 1532x1532 which finishes on quad.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speed improvement with new ROCm attention #8137

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Speed improvement with new ROCm attention #8137

Uh oh!

fgdfgfthgr-fox May 15, 2025

Replies: 2 comments · 1 reply

Uh oh!

comfyanonymous May 17, 2025 Maintainer

Uh oh!

fgdfgfthgr-fox May 19, 2025 Author

Uh oh!

Uh oh!

derfasthirnlosenick Jun 1, 2025

fgdfgfthgr-fox
May 15, 2025

Replies: 2 comments 1 reply

comfyanonymous
May 17, 2025
Maintainer

fgdfgfthgr-fox May 19, 2025
Author

derfasthirnlosenick
Jun 1, 2025