You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
rhjdvsgsgks
changed the title
Misc. bug: performance regression after fd123cfead49eb32e386e26b8ef7a6d41554dda5
Misc. bug: vulkan: performance regression after fd123cfead49eb32e386e26b8ef7a6d41554dda5
Mar 24, 2025
This is a little bit of a damned if you do, damned if you don't situation. Defaulting to smaller allocations fixes a number of OOM crashes due to fragmentation or driver problems, many of which have been reported over the last year. If it reduces performance slightly, that's regrettable, but I can't just go back to the old behaviour.
Please check if setting GGML_VK_SUBALLOCATION_BLOCK_SIZE=2147483648 fixes your performance regression already, otherwise you can go back to the old behaviour and performance by setting GGML_VK_SUBALLOCATION_BLOCK_SIZE=4294967296.
Name and Version
fd123cf
Operating systems
Linux
Which llama.cpp modules do you know to be affected?
vulkan backend
Command line
Problem description & steps to reproduce
build: a53f7f7 (4908)
build: fd123cf (4909)
First Bad Commit
fd123cf
Relevant log output
The text was updated successfully, but these errors were encountered: