
-
Institute of Computing Technology, CAS
- Beijing
Highlights
- Pro
Starred repositories
[ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
[ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
RLHF中文手册 - 详细解析RLHF全流程优化阶段,涵盖指令调优、奖励模型训练,以及拒绝采样、强化学习和直接对齐算法等关键技术。
Quarto template for Chinese academic writing
Train transformer language models with reinforcement learning.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl
Production-ready platform for agentic workflow development.
LangChain 的中文入门教程
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Official implementation for LaCo (EMNLP 2024 Findings)
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"
800,000 step-level correctness labels on LLM solutions to MATH problems
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
A paper list of some recent works about Token Compress for Vit and VLM
Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language models
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath