Skip to content

Tags: silveroxides/llama.cpp

Tags

b5371

Toggle b5371's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
quantize : improve tensor-type pattern matching (ggml-org#13033)

b5370

Toggle b5370's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
clip : clip.h become private API (⚠️ breaking change) (ggml-org#13510)

b3713

Toggle b3713's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
llama : minor sampling refactor (2) (ggml-org#9386)

b3711

Toggle b3711's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
CUDA: fix variable name conflict for Windows build (ggml-org#9382)

b3707

Toggle b3707's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Overlap cmdbuffer creation and cmdbuffer execution in Vulkan backend …

…by submitting smaller cmdbuffers early. (ggml-org#9118)

* Overlap cmdbuffer creation and cmdbuffer execution in Vulkan backend by submitting smaller cmdbuffers early.

* fix compile issues

* Fix issues where the last submit wasn't executed or handled properly.

* remove trailing whitespace

* Repair GGML_VULKAN_CHECK_RESULTS

* Increase submit counter only if actual work has been submitted and increase submit count to 100.

* Fix some nodes are not checked with GGML_VULKAN_CHECK_RESULTS enabled.

b3706

Toggle b3706's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
cuda : fix FA Q src index (1 -> 0) (ggml-org#9374)

b3705

Toggle b3705's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
common : bring back missing args, add env var duplication check (ggml…

…-org#9375)

* common : bring back missing args

* move duplication check to test-arg-parser

* add check for duplicated env var

* correct default values

b3704

Toggle b3704's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
common : restore --n-gpu-layers (ggml-org#9371)

b3703

Toggle b3703's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
llama : refactor samplers internal implementation (ggml-org#9370)

b3702

Toggle b3702's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[SYCL] add check malloc result on device (ggml-org#9346)

* add check malloc result on device

* update for review comments, check all malloc_device() result

---------

Co-authored-by: arthw <[email protected]>