ICLR2024 Spotlight: curation/training code, metadata, distribution
Language modeling in a sentence representation space
High-resolution models for human tasks
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
The ChatGPT Retrieval Plugin lets you easily find personal documents
Research code artifacts for Code World Model (CWM)
Diffusion Transformer with Fine-Grained Chinese Understanding
Expert System Tool
Resources, corpora, and tools for Chinese natural language processing
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Python package for easily interfacing with chat apps
Repo for external large-scale work
fast C++ library for linear algebra & scientific computing
Official PyTorch Implementation of "Scalable Diffusion Models"
ECLiPSe Constraint Logic Programming System
Code for the paper Fine-Tuning Language Models from Human Preferences
Open-source, high-performance Mixture-of-Experts large language model
Open-Source Financial Large Language Models!
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
A computer vision framework to create and deploy apps in minutes
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
Open Multilingual Multimodal Chat LMs
Implementation of model parallel autoregressive transformers on GPUs