llama.cpp Class Encapsulation and Wrapper

This project uses the find_package() CMake command to conveniently include llama.cpp in projects which live outside of the source tree.

Building

It is important to first build/install llama.cpp using CMake. An example is provided here, but please see the llama.cpp build instructions for more detailed build instructions.

git clone https://github.com/ggerganov/llama.cpp
# Or clone from my fork with slim version
# git clonehttps://github.com/cshbli/llama.cpp.git
cd llama.cpp
cmake -S . -B build
cmake --build build
cmake --install build --prefix inst

Examples

Simple-Chat

Deliberate design decision for C Interface

Ensures Compatibility

A C interface ensures compatibility with a broader range of programming languages and tools, as C APIs are:

Easily callable from other languages such as Python, Java, Rust, or even C++ itself.
Universally supported in cross-language bindings and FFI (Foreign Function Interfaces).

For example:

Libraries with a C API can be easily wrapped for Python (e.g., via ctypes or cffi).
They are also compatible with systems or frameworks that require C-style linkage. In contrast, C++ classes and features like inheritance, templates, or virtual functions can be complex to map to other languages.

Simplified Binary Interface (ABI Stability)

C has a more stable ABI (Application Binary Interface) compared to C++:

Different C++ compilers (e.g., GCC, Clang, MSVC) often produce incompatible name mangling for symbols, making binaries less portable.
By exposing a flat C API, llama.cpp avoids these issues and ensures consistent symbol names, even when used with different compilers or environments.

Easier for Embedding in Low-Level Systems

C interfaces are ideal for:

Embedding the library in low-level environments like game engines, operating systems, or hardware platforms where C is dominant.
C++ abstractions like exceptions or RTTI (Run-Time Type Information) can add overhead and complexity, which may not be desirable for performance-critical applications like llama.cpp.

Lightweight and Minimalist Design

A C interface aligns with the lightweight philosophy of llama.cpp:

It avoids introducing complex object-oriented abstractions that might make the code harder to understand, debug, or optimize.
A flat procedural API is easier for developers to trace and use in performance-critical scenarios like machine learning inference.

User-Controlled Encapsulation

Instead of enforcing encapsulation through C++ classes, llama.cpp lets developers implement their own abstractions:

A user of the library can wrap the C API in their preferred C++ classes or use it directly, giving them flexibility over how the library is integrated.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
examples/simple-chat		examples/simple-chat
include		include
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

llama.cpp Class Encapsulation and Wrapper

Building

Examples

Deliberate design decision for C Interface

About

Uh oh!

Releases

Packages

Languages

cshbli/llama-cpp-wrapper

Folders and files

Latest commit

History

Repository files navigation

llama.cpp Class Encapsulation and Wrapper

Building

Examples

Deliberate design decision for C Interface

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages