Stars
Source code for AAAI'25 paper "Component-Level Segmentation for Oracle Bone Inscription Decipherment"
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
A 13B large language model developed by Baichuan Intelligent Technology
Type less, code more: Cody is an AI code assistant that uses advanced search and codebase context to help you write and fix code.
LLMFlows - Simple, Explicit and Transparent LLM Apps
A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.
An Open-source Toolkit for LLM Development
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
Unlock the Power of LLM: Explore These Datasets to Train Your Own ChatGPT!
[MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Making large AI models cheaper, faster and more accessible
LangChain 的中文入门教程
使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…
An open-source tool-augmented conversational language model from Fudan University
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.