vulkan: implement several ops relevant for ggml_opt #11769

remyoudompheng · 2025-02-09T10:09:33Z

This PR implements several GGML opcodes which are possibly relevant for #10544 (SUM, ARGMAX, SUB, COUNT_EQUAL, OPT_STEP_ADAMW, REPEAT_BACK).
After these patches, it is possible to run test-opt using the Vulkan backend (with a few failures maybe caused by rounding issues?).

Several issues were identified in test-backend-ops:

SUB was not tested at all
REPEAT_BACK has a few cases not supported by the CPU backend (crash with -b CPU)

Several issues were identified in Vulkan CHECK_RESULTS mode:

RWKV_WKV6 was crashing
various buffers were not freed

ggml/src/ggml-vulkan/vulkan-shaders/count_equal.comp

jeffbolznv

The tests all pass on my system.

ggml/src/ggml-vulkan/ggml-vulkan.cpp

jeffbolznv · 2025-02-13T19:43:28Z

ggml/src/ggml-vulkan/vulkan-shaders/repeat_back.comp

+        for (uint i2 = i12; i2 < p.ne02; i2 += p.ne12) {
+            for (uint i1 = i11; i1 < p.ne01; i1 += p.ne11) {
+                for (uint i0 = i10; i0 < p.ne00; i0 += p.ne10) {
+                    acc += data_a[i3*p.nb03 + i2*p.nb02 + i1*p.nb01 + i0*p.nb00];


Is get_aoffset() needed here? (I don't know)

0cc4m · 2025-02-15T08:30:43Z

The Intel crash can be ignored. Once you resolve the memset, this can be merged.

remyoudompheng · 2025-02-15T10:07:48Z

Thanks for the review
Let me know if 3d506e5 is the proper way to proceed.

ggml/src/ggml-vulkan/ggml-vulkan.cpp

netrunnereve · 2025-02-16T17:26:22Z

The tests are passing for me on GCN. It would definitely be cool to be able to do some finetuning on Vulkan once GGML gets training support.

remyoudompheng · 2025-02-16T22:21:32Z

rebased branch to resolve conflicts in ggml-vulkan.cpp

* vulkan: support memset_tensor * vulkan: support GGML_OP_SUM * vulkan: implement GGML_OP_ARGMAX * vulkan: implement GGML_OP_SUB * vulkan: implement GGML_OP_COUNT_EQUAL * vulkan: implement GGML_OP_OPT_STEP_ADAMW * vulkan: fix check_results RWKV_WKV6 crash and memory leaks * vulkan: implement GGML_OP_REPEAT_BACK * tests: remove invalid test-backend-ops REPEAT_BACK tests * vulkan: fix COUNT_EQUAL memset using a fillBuffer command

github-actions bot added testing Everything test related Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Feb 9, 2025

0cc4m reviewed Feb 10, 2025

View reviewed changes

ggml/src/ggml-vulkan/vulkan-shaders/count_equal.comp Show resolved Hide resolved

jeffbolznv reviewed Feb 14, 2025

View reviewed changes

jeffbolznv reviewed Feb 15, 2025

View reviewed changes

ggml/src/ggml-vulkan/ggml-vulkan.cpp Show resolved Hide resolved

ttkciar mentioned this pull request Feb 16, 2025

llama/ggml: add LLM training support #10544

Merged

remyoudompheng force-pushed the vulkan-ggmlopt-pr branch from 3d506e5 to 34792b3 Compare February 16, 2025 12:43

jeffbolznv approved these changes Feb 16, 2025

View reviewed changes

remyoudompheng mentioned this pull request Feb 16, 2025

vulkan: implement more backpropagation operators #11914

Merged

remyoudompheng added 10 commits February 16, 2025 23:17

vulkan: support memset_tensor

dca13cb

vulkan: support GGML_OP_SUM

9de61b5

vulkan: implement GGML_OP_ARGMAX

f354a78

vulkan: implement GGML_OP_SUB

eacca58

vulkan: implement GGML_OP_COUNT_EQUAL

7a6cb87

vulkan: implement GGML_OP_OPT_STEP_ADAMW

750e1a4

vulkan: fix check_results RWKV_WKV6 crash and memory leaks

6f58276

vulkan: implement GGML_OP_REPEAT_BACK

4908031

tests: remove invalid test-backend-ops REPEAT_BACK tests

e22079c

vulkan: fix COUNT_EQUAL memset using a fillBuffer command

e172d2d

remyoudompheng force-pushed the vulkan-ggmlopt-pr branch from b94a44c to e172d2d Compare February 16, 2025 22:21

0cc4m approved these changes Feb 17, 2025

View reviewed changes

0cc4m merged commit 2eea03d into ggml-org:master Feb 17, 2025
46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vulkan: implement several ops relevant for ggml_opt #11769

vulkan: implement several ops relevant for ggml_opt #11769

remyoudompheng commented Feb 9, 2025

jeffbolznv left a comment

jeffbolznv Feb 13, 2025

0cc4m Feb 15, 2025

0cc4m commented Feb 15, 2025

remyoudompheng commented Feb 15, 2025

netrunnereve commented Feb 16, 2025

remyoudompheng commented Feb 16, 2025

vulkan: implement several ops relevant for ggml_opt #11769

vulkan: implement several ops relevant for ggml_opt #11769

Conversation

remyoudompheng commented Feb 9, 2025

jeffbolznv left a comment

Choose a reason for hiding this comment

jeffbolznv Feb 13, 2025

Choose a reason for hiding this comment

0cc4m Feb 15, 2025

Choose a reason for hiding this comment

0cc4m commented Feb 15, 2025

remyoudompheng commented Feb 15, 2025

netrunnereve commented Feb 16, 2025

remyoudompheng commented Feb 16, 2025