An AI-powered security review GitHub Action using Claude
Tool for exploring and debugging transformer model behaviors
Inference framework for 1-bit LLMs
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
FAIR Sequence Modeling Toolkit 2
Dataset of GPT-2 outputs for research in detection, biases, and more
Pushing the Limits of Mathematical Reasoning in Open Language Models
Phi-3.5 for Mac: Locally-run Vision and Language Models
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
A state-of-the-art open visual language model
Open-source large language model family from Tencent Hunyuan
Chat & pretrained large vision language model
DeepSeek Coder: Let the Code Write Itself
Chat & pretrained large audio language model proposed by Alibaba Cloud
A series of math-specific large language models of our Qwen2 series
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Qwen2.5-VL is the multimodal large language model series
A Family of Open Foundation Models for Code Intelligence
Towards Real-World Vision-Language Understanding
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
GLM-4-Voice | End-to-End Chinese-English Conversational Model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
CLIP, Predict the most relevant text snippet given an image
Qwen3-omni is a natively end-to-end, omni-modal LLM