Python bindings for llama.cpp
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Agentic, Reasoning, and Coding (ARC) foundation models
Contexts Optical Compression
Port of Facebook's LLaMA model in C/C++
Open-source, high-performance AI model with advanced reasoning
Qwen3 is the large language model series developed by Qwen team
Powerful AI language model (MoE) optimized for efficiency/performance
Qwen3-Coder is the code version of Qwen3
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Qwen-Image is a powerful image generation foundation model
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Revolutionizing Database Interactions with Private LLM Technology
Renderer for the harmony response format to be used with gpt-oss
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Capable of understanding text, audio, vision, video
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Unified Multimodal Understanding and Generation Models
Repo of Qwen2-Audio chat & pretrained large audio language model
Inference framework for 1-bit LLMs
Pushing the Limits of Mathematical Reasoning in Open Language Models
Phi-3.5 for Mac: Locally-run Vision and Language Models
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Mixture-of-Experts Vision-Language Models for Advanced Multimodal