Skip to content

Tags: ochafik/llama.cpp

Tags

b5410

Toggle b5410's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
llguidance : official v0.7.20 release (no actual changes) [noci] (ggm…

…l-org#13594)

b5401

Toggle b5401's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
minja: sync (qwen3) (ggml-org#13573)

* minja: sync google/minja@f06140f

- google/minja#67 (@grf53)
- google/minja#66 (@taha-yassine)
- google/minja#63 (@grf53)
- google/minja#58

---------

Co-authored-by: ochafik <[email protected]>

b5400

Toggle b5400's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
gguf : use ggml log system (ggml-org#13571)

* gguf : use ggml log system

* llama : remove unnecessary new lines in exception messages

b5392

Toggle b5392's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
server : proper error handling for missing elements in messages array…

… (OpenAI compatible backend) (ggml-org#13540)

b5387

Toggle b5387's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
`common`: add partial regex support (ggml-org#12808)

* move string_find_partial_stop & string_ends_with to common

* add common_regex (supports partial matches)

Co-authored-by: Georgi Gerganov <[email protected]>

* Update common/regex-partial.cpp

Co-authored-by: Georgi Gerganov <[email protected]>

* Update common/regex-partial.cpp

Co-authored-by: Georgi Gerganov <[email protected]>

* Update common/regex-partial.h

Co-authored-by: Georgi Gerganov <[email protected]>

* partial regex: add missing iterator end checks

* string utils: use string_views

* direct throw to avoid ggml.h include

* regex-partial: replace missed ggml_asserts

---------

Co-authored-by: ochafik <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>

b5382

Toggle b5382's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
CUDA: faster Deepseek FA, add Turing support (ggml-org#13435)

b5152

Toggle b5152's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
SYCL: Refactor and enable FP16 in binary broadcast OPs (ggml-org#12975)

* SYCL: refactor move to a separate file

* Fix binbcast

* Remove duplicates

* fix include formatting

* fix typo

b5117

Toggle b5117's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
sycl: Support sycl_ext_oneapi_limited_graph (ggml-org#12873)

The current usage of the SYCL-Graph extension checks for
the `sycl_ext_oneapi_graph` device aspect. However, it is also
possible to support `sycl_ext_oneapi_limied_graph` devices that
don't support update

b5072

Toggle b5072's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
hellaswag: display estimated score confidence interval (ggml-org#12797)

b5054

Toggle b5054's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
sync: minja (ggml-org#12739)

* sync: minja

google/minja#57

* fix json include