Tags · silveroxides/llama.cpp

b5371

quantize : improve tensor-type pattern matching (ggml-org#13033)

May 13, 2025
e5c834f
zip
tar.gz
Downloads

b5370

clip : clip.h become private API (⚠️ breaking change) (ggml-org#13510)

May 13, 2025
71bdbdb
zip
tar.gz
Downloads

b3713

llama : minor sampling refactor (2) (ggml-org#9386)

Sep 9, 2024
5fb5e24
zip
tar.gz

b3711

CUDA: fix variable name conflict for Windows build (ggml-org#9382)

Sep 9, 2024
8e6e2fb
zip
tar.gz

b3707

Overlap cmdbuffer creation and cmdbuffer execution in Vulkan backend …

…by submitting smaller cmdbuffers early. (ggml-org#9118)

* Overlap cmdbuffer creation and cmdbuffer execution in Vulkan backend by submitting smaller cmdbuffers early.

* fix compile issues

* Fix issues where the last submit wasn't executed or handled properly.

* remove trailing whitespace

* Repair GGML_VULKAN_CHECK_RESULTS

* Increase submit counter only if actual work has been submitted and increase submit count to 100.

* Fix some nodes are not checked with GGML_VULKAN_CHECK_RESULTS enabled.

Sep 8, 2024
daa9623
zip
tar.gz

b3706

cuda : fix FA Q src index (1 -> 0) (ggml-org#9374)

Sep 8, 2024
e079bff
zip
tar.gz

b3705

common : bring back missing args, add env var duplication check (ggml…

…-org#9375)

* common : bring back missing args

* move duplication check to test-arg-parser

* add check for duplicated env var

* correct default values

Sep 8, 2024
3f7ccfd
zip
tar.gz

b3704

common : restore --n-gpu-layers (ggml-org#9371)

Sep 8, 2024
a249843
zip
tar.gz

b3703

llama : refactor samplers internal implementation (ggml-org#9370)

Sep 8, 2024
19f4a7b
zip
tar.gz

b3702

[SYCL] add check malloc result on device (ggml-org#9346)

* add check malloc result on device

* update for review comments, check all malloc_device() result

---------

Co-authored-by: arthw <[email protected]>

Sep 8, 2024
2a358fb
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b5371

b5370

b3713

b3711

b3707

b3706

b3705

b3704

b3703

b3702

Tags: silveroxides/llama.cpp