Skip to content

Tags: CodeLinaro/llama.cpp

Tags

b5255

Toggle b5255's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ci: fix cross-compile sync issues (ggml-org#12804)

b5098

Toggle b5098's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
convert : ability to lazy-load safetensors remotely without downloadi…

…ng to disk (ggml-org#12820)

* gguf util : add SafetensorRemote

* fix style

* convert: add --remote option

* convert : allow using lazy remote tensors

It's a bit slow for now since everything is blocking and single-threaded.

* correct metadata.name

* small style fix

* support HF_TOKEN

* convert : use writeable buffer for remote lazy tensors

* convert : fix flake8 lint regarding lamdba assigment

* multithreaded download

* multithread: print debug

* fix style

* Revert "multithreaded download"

This reverts commit 42fc895.

* bring back _get_request_headers

---------

Co-authored-by: Francis Couture-Harpin <[email protected]>

b5022

Toggle b5022's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
opencl : fix memory allocation size (ggml-org#12649)

issue:
#17 (comment)

This patch fixes the memory allocation size
not exceeding the maximum size of the OpenCL device.

b4967

Toggle b4967's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
SYCL: implement memset ggml backend buffer interface (ggml-org#12580)

* SYCL: implement memset ggml backend buffer interface

* use GGML_ABORT macro

* Do not wait for all queues to finish for memset operation

b4951

Toggle b4951's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
opencl: simplify kernel embedding logic in cmakefile (ggml-org#12503)

Co-authored-by: Max Krasnyansky <[email protected]>

b4903

Toggle b4903's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
vulkan: Add N/2 and N/4 optimized paths in coopmat2 shader (ggml-org#…

…12312)

b4719

Toggle b4719's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
llguidance build fixes for Windows (ggml-org#11664)

* setup windows linking for llguidance; thanks @phil-scott-78

* add build instructions for windows and update script link

* change VS Community link from DE to EN

* whitespace fix

b4717

Toggle b4717's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
cuda : add ampere to the list of default architectures (ggml-org#11870)

b4667

Toggle b4667's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Make logging more verbose (ggml-org#11714)

Debugged an issue with a user who was on a read-only filesystem.

Signed-off-by: Eric Curtin <[email protected]>

b4649

Toggle b4649's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
vulkan: optimize coopmat2 iq2/iq3 callbacks (ggml-org#11521)

* vulkan: optimize coopmat2 iq2/iq3 callbacks

* build: trigger CI on GLSL compute shader changes