Skip to content

Tags: withcatai/node-llama-cpp

Tags

v3.14.2

Toggle v3.14.2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: `semantic-release` retry (#518)

v3.14.1

Toggle v3.14.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix(Vulkan): include integrated GPU memory (#516)

* fix(Vulkan): include integrated GPU memory - adapt to a change in `llama.cpp`
* fix(Vulkan): deduplicate the same device coming from different drivers
* fix: adapt Llama chat wrappers to breaking `llama.cpp` changes
* fix: internal log level
* docs(Vulkan): recommend installing LLVM on Windows

v3.14.0

Toggle v3.14.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
test: fix tests (#509)

v3.13.0

Toggle v3.13.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat: Seed OSS support (#502)

v3.12.4

Toggle v3.12.4's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
test: fix tests (#499)

v3.12.3

Toggle v3.12.3's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: split prebuilt CUDA binaries into 2 npm modules (#495)

v3.12.2

Toggle v3.12.2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: CUDA 13 support (#494)

fix: prebuilt binaries CUDA 13 support

v3.12.1

Toggle v3.12.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: completion config (#490)

* fix: more flexible model message prompt completion config
* feat(Electron template): improve scroll

v3.12.0

Toggle v3.12.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat: `gpt-oss` support (#487)

* feat: `gpt-oss` support
* fix: Qwen3 memory estimation

v3.11.0

Toggle v3.11.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
build: fix CI config (#483)

* build: update CUDA version in the CI
* fix: add missing GGUF types