Lib.rs

›

#llama-cpp #bindings #low-level #cuda #moe #offloading #safe-api #llama-cpp-2

sys shimmy-llama-cpp-sys-2

Low Level Bindings to llama.cpp with MoE CPU offloading support

Owned by Michael-A-Kuykendall.

1 unstable release

0.1.123	Oct 9, 2025

#1798 in Machine learning

417 downloads per month
Used in 2 crates (via shimmy-llama-cpp-2)

MIT/Apache

9.5MB
174K SLoC

llama-cpp-sys

Raw bindings to llama.cpp with cuda support.

See llama-cpp-2 for a safe API.

Dependencies