Browse free open source LLM Inference tools and projects for Linux below. Use the toggles on the left to filter open source LLM Inference tools by OS, license, language, programming language, and project status.
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
OpenVINO™ Toolkit repository
Sparsity-aware deep learning inference runtime for CPUs
Large Language Model Text Generation Inference
Bolt is a deep learning library with high performance
Libraries for applying sparsification recipes to neural networks
Self-contained Machine Learning and Natural Language Processing lib
An easy-to-use LLMs quantization package with user-friendly apis
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
A Unified Library for Parameter-Efficient Learning
Efficient few-shot learning with Sentence Transformers
Fast and user-friendly runtime for transformer inference